Example of Informatica DX using Power Center & Data Transformation


USE CASE FOR INFORMATICA DX


Informatica DX
·        Informatica B2B Data Exchange facilitates the work of setting up the transformations and synchronizing various workflows and processes.
·        It also has the capability to understand the dependency of various workflows and formulate their execution in the way to match and meet the business requirements.
·        The partners/user can monitor and analyze the process going on.

Use Case
Requirement Analysis
·        The Business user has a requirement of converting theirMonthly Invoice statements (in PDF format) and generate the more detailed and understandable (in text files).
·        The Invoices come from various countries ex: USA, Canada, UK, India and have their respective states within itself. Let us say 5 states from each country.
Approach
·        This work can be categorized as:
Ø  Transformation of PDF to Text documents. (Using Informatica Data Transformation)
Ø  Creating Workflows for Each Transformation project. (Using Informatica PC)
Ø  Configuration of these workflows based on Condition match. (Using Informatica DX)
Ø  Evaluation of Partner Name, Account Name based on the bundle of data received. (Using Informatica DX)
·        We also receive the Input PDF in form of a zip folder (Containing 100K PDF files) with the naming Conventions as:US_20161001_Monthly_Statements.zip
·        The above steps have can be understood as:






Working:
The data bundle that we receive also has a metadata file in .xml that contains the information of each PDF file like DocID which is a unique identifier for each document. So the working can be as follows:
1)     Partner Name:It can be extracted from the bundle name initials that we receive.
Eg: US.
2)      Account Name:Account Name can be based on different states of a particular partner. These partner names can be as: New Jersey, New York etc.
3)     Event Name:Each event can be configured based on the Account name and Partner name. The events will trigger subsequent workflows.
Eg: us_newjersey_monthly
Ø  Events can be helpful in analyzing the errors.
Ø  There are various filters that can be used to search the events. Eg: Time, Status, Type etc.
4)     Workflow:The workflows corresponding to each event trigger the execution ad transformation of the PDF. There can be many sessions that may run in order to execute proper working of workflow.
Eg: wf_us_newjersey_monthly_conversion.
This process may consist of various sessions that may be configured to work based on the status passed on from previous sessions.
Ø  Unzipping of data bundle and placing the PDF files at a particular source location.Eg: s_unzip_data
Ø  Renaming of source files from BankAccount#_20161001.pdf to BankAccount#_20161001_hhmmss.pdf.  Eg: s_rename_files
Ø  Session for calling the Informatica DT Transformation code, this helps in converting the PDF to Text Document and Place it at a required destination.Eg: s_pdf_to_text_conversion.
Note: The various sessions may be dependent and interdependent and the execution can be monitored through DX Console.
Sessions may fail based if criteria fail to match required condition.




Hence we can conclude that we have following:
ü  4 Partners: India, US, UK, Canada
ü  20 Accounts: 5 From Each Partners
ü  Workflows would be in accordance with the Events that are configured based on AN & PN.
ü  Sessions would execute on the basis of status of previous session.
Informatica DX provides us the platform to configure and monitor proper conversion of PDF documents. It also helps us to analyze the events and categorize the issue as:
1)     DX Configuration level
2)     Workflow level
3)     Session Level
4)     Data Transformation Level
These converted Text documents are further used by Business for Reporting purpose.