About the Course
Poor data quality defeats the very purpose of ETL (Extract, Transform and Load). Informatica has an automated data quality assurance process which developers must master to create enhanced productivity.
The Data Quality
Data Quality Management for the Developer will train the students in core data assurance methodologies that help them cleanse, standardize and enhance data for optimal usage. The instructor-led course will span 4 days and will also include advanced topics on data parsing, matching, troubleshooting, exceptions, consolidation management, etc.
What are the intended objectives of this course?
This course intends to make students experts at automated data quality assurance using Informatica. The course will also make students capable of project collaboration, managing reference tables, cleansing, parsing, standardizing Mappings and Mapplets and even identify duplicate records using Classic Data Matching.
Who are the ideal candidates for this course?
The course is best suitable for developers who use Informatica on a regular basis. Candidates with prior experience or fundamental knowledge of the automated data quality process in Informatica can perfect their skills with this course.
Training Format
1) Instructor-Led Online (Virtual) Training
2) Public Classroom Training
3) Self-paced video Training
4) Workshop
Course Duration
a) 4 Days
b) 30 Hours
Course Cost
$500 to $600
Is there a defined agenda for the course?
Yes. The course will follow a definite agenda broken down into several modules. Each module will deal with specific topics related to the data quality assurance process. There will be 14 modules in total.
Module 1: Course Introduction
The first module will explain the course topics, module workflow and how they progress throughout the course duration.
Module 2: Data Quality Process Overview
The second module with deal with the various dimensions of data quality and how they are managed using a Data Quality Management Process Cycle. Other topics like Data Quality Architecture and the various roles and tools used by developer and analysts will also be discussed under this module.
Module 3: Data Quality Projects and Solutions
The third module will elaborate several projects that can possibly benefit from cleansed and standardized data, Data Quality Use Cases, typical projects related to DI/DQ projects, Reporting, Gating and Cleansing projects and also Solution Architecture for Projects with Data Quality.
Module 4: Project Collaboration and Reference Table Management
The 4th module will deal with project collaboration elements and reference table management along with 2 lab sessions for building reference tables and their Analyst review.
Topics covered:
Developer Interface
Understanding Analyst projects, Data Objects, Profiles, Rules, Scorecards, Comments and Tags
Reference Tables and the Data Quality Process
Creating Reference Tables
Workshop
Review a project created by an Analyst
Build Reference Tables
Module 5: Working in the Developer Tool
1) Tasks in the Developer Tool
2) Working with Physical and Logical Data Objects
3) Connecting to a table
4) Importing and flat file
5) Creating logical data objects
6) Developer Transformations
7) Mappings and Mapplets
8) Content sets and their uses
9) Developer Tips and Tricks
Workshop
Create a project and assign permissions
Create a connection to an Oracle table and import a flat file
Build a Logical Data Object
Module 6: Profiling, Mapplets and Rules
1) Column Profiling
2) Mapplets and Scorecards
3) Profiling techniques to debug and improve development
4) Updating Scorecards with Rules
Workshop
1) Create a Rule to measure the Accuracy of data in a field.
2) Using Informatica Analyst, apply the rule to a Scorecard and review the results.
Module 7: Standardizing, Cleansing and Enhancing Data
1) Standardizing, cleansing and enhancing data.
2) Mappings that cleanse, standardize and enhance data
3) Developing standardization maplets
4) Configuring standardization transformations
Workshop
Build Standardization Mapping and Maplets using Standardization Transformations.
Module 8: Parsing Data
1) The Parsing Process
2) Parsing techniques
3) Key parsing transformations
Workshop
1) Perform Parsing using a variety of Parsing Transformations
2) Complete a Standardization Mapping
Module 9: Matching Data
1) Match Data definition
2) The DQ matching process
3) The different stages of Matching
4) Grouping and its effect on matching
5) Grouping methods
6) Grouping results and refining a grouping strategy
7) Match algorithms
Workshop
Build and fine-tune a grouping and matching mapping
Module 10: Manual Exception and Consolidation Management
1) Exception and Duplicate record management
2) Exception Management Process.
3) Populating tables with exception and duplicate record tasks
Workshop
1) Build a Mapping that can be used to identify Exception data
2 )Build a Mapping that can be used to identify Duplicate data
Module 11: Building, Managing and Deploying Workflows
1) Workflows and Workflow Tasks
2) Human Tasks and Steps
3) Identifying exception and duplicate records
4) Deploying and executing workflows
5 )Verifying Tasks in Informatica Analyst
Workshop
1) Build a Workflow to populate the Analyst Inbox with Exception Tasks
2) Build a Workflow to populate the Analyst Inbox with Duplicate Record Tasks
Module 12: Deploying: Executing Mappings outside of the Developer tool
1) Deployment options.
2) Mappings as applications
3) Scheduling mappings, profiles and Scorecards
Workshop
Schedule Mappings to run using Informatica Scheduler
Module 13: Importing and Exporting Project Objects
1) Export/import project use cases
2) Basic and Advanced Import options
3) Exporting a project
Workshop
1) Import a Project using the Basic method.
2) Import a Project using the Advanced Method.
3) Export a Project
Module 14: Troubleshooting
1) Common Developer errors
2) Common Mapping and Transformation configuration issues
3) Common Workflow configuration errors
4) Tips for working with the Developer tool
Workshop (Optional)
Troubleshoot Mapping configuration issues
Informatica Course topics to learn
- PowerCenter 10: Developer Level 2
- Data Discovery and Advanced Profiling 10
- MDM 10.1: Using Informatica Data Director
- Data Quality: Data Quality Management for the Developer
- 365onDemand for Administrators
- Data Quality V10: Administration
- PowerCenter 10: Administration
- 365onDemand Data Quality for Developers
- Data Integration Hub Developer 9.6
- Informatica Developer Tool 10.1 Big Data Management
- MDM 10.1: Configuring Informatica Data Director