Required Skills and Qualifications:
- Minimum of 8 years of hands-on experience with Informatica Cloud Data Quality (CDQ)/ Informatica Data Quality toolset (version 9.7 or higher, preferably 10.x).
- Experience with Informatica PowerCenter is a plus.
Technical Skills:
- Proficiency in CDQ/IDQ development, including data profiling, cleansing, parsing, standardization, verification, matching, and exception monitoring.
- Perform thorough data profiling to identify data quality issues, usage patterns, and metadata characteristics using Informatica CDQ/IDQ, Excel, and other tools. Conduct root cause analysis to address data anomalies.
- Design and implement data quality mappings, rules, and workflows for cleansing, standardization, parsing, validation, and de-duplication using CDQ/IDQ transformations (e.g., Parser, Address Validator, Match, Key Generator).
- Develop and execute ETL jobs for address standardization, email cleanup, name parsing, and other data quality tasks. Integrate CDQ/IDQ processes with Informatica PowerCenter and other ETL tools as needed.
- Create matching plans, configure identity matching algorithms, and analyze duplicates to ensure data consistency and accuracy.
- Strong knowledge of ETL processes, data integration, and transformations (e.g., Address Validator, Parser, Match).
- Expertise in SQL and database management systems (e.g., Oracle, MySQL, SQL Server)
- Expertise in SQL and database management systems (e.g., Oracle, MySQL, SQL Server).
- MDM concepts with respective to CDQ/IDQ
- Analytical Skills: Excellent data analysis and profiling skills, with the ability to identify trends, patterns, and anomalies in complex data sets.
- Communication Skills: Strong verbal and written communication skills to collaborate with stakeholders and present data quality metrics to management.
- Certifications: Informatica certification in CDQ/IDQ or related tools is highly desirable.
Others:
- Knowledge of data governance, MDM concepts with respective to CDQ/IDQ, and agile methodologies.
- Experience with cloud environments or modern data stacks (e.g., Snowflake, Azure) is a Plus.