Example of Informatica DX using Power Center & Data Transformation
USE CASE FOR INFORMATICA DX
Informatica DX
·
Informatica B2B Data Exchange facilitates the work of setting
up the transformations and synchronizing various workflows and processes.
·
It also has the capability to understand the dependency of
various workflows and formulate their execution in the way to match and meet
the business requirements.
·
The partners/user can monitor and analyze the process going
on.
Use Case
Requirement Analysis
·
The Business user has a requirement of converting theirMonthly
Invoice statements (in PDF format) and generate the more detailed and
understandable (in text files).
·
The Invoices come from various countries ex: USA, Canada, UK,
India and have their respective states within itself. Let us say 5 states from
each country.
Approach
·
This work can be categorized as:
Ø Transformation of PDF to
Text documents. (Using Informatica Data Transformation)
Ø Creating Workflows for
Each Transformation project. (Using Informatica PC)
Ø Configuration of these
workflows based on Condition match. (Using Informatica DX)
Ø Evaluation of Partner
Name, Account Name based on the bundle of data received. (Using Informatica DX)
·
We also receive the Input PDF in form of a zip folder
(Containing 100K PDF files) with the naming Conventions as:US_20161001_Monthly_Statements.zip
·
The above steps have can be understood as:
Working:
The
data bundle that we receive also has a metadata file in .xml that contains the
information of each PDF file like DocID which is a unique identifier for each
document. So the working can be as follows:
1) Partner Name:It can be extracted from the bundle
name initials that we receive.
Eg: US.
2) Account Name:Account Name can be based on different
states of a particular partner. These partner names can be as: New Jersey, New York
etc.
3) Event Name:Each event can be
configured based on the Account name and Partner name. The events will trigger
subsequent workflows.
Eg: us_newjersey_monthly
Ø Events can be helpful in
analyzing the errors.
Ø There are various filters
that can be used to search the events. Eg: Time, Status, Type etc.
4) Workflow:The workflows corresponding to each
event trigger the execution ad transformation of the PDF. There can be many
sessions that may run in order to execute proper working of workflow.
Eg: wf_us_newjersey_monthly_conversion.
This process may consist of various sessions that may be
configured to work based on the status passed on from previous sessions.
Ø Unzipping of data bundle
and placing the PDF files at a particular source location.Eg: s_unzip_data
Ø Renaming of source files
from BankAccount#_20161001.pdf to BankAccount#_20161001_hhmmss.pdf. Eg: s_rename_files
Ø Session for calling the
Informatica DT Transformation code, this helps in converting the PDF to Text
Document and Place it at a required destination.Eg: s_pdf_to_text_conversion.
Note: The various sessions may be dependent and
interdependent and the execution can be monitored through DX Console.
Sessions may fail based if criteria fail to match
required condition.
Hence
we can conclude that we have following:
ü 4 Partners: India, US, UK,
Canada
ü 20 Accounts: 5 From Each
Partners
ü Workflows would be in
accordance with the Events that are configured based on AN & PN.
ü Sessions would execute on
the basis of status of previous session.
Informatica
DX provides us the platform to configure and monitor proper conversion of PDF
documents. It also helps us to analyze the events and categorize the issue as:
1) DX Configuration level
2) Workflow level
3) Session Level
4) Data Transformation Level
These
converted Text documents are further used by Business for Reporting purpose.