Perform data validation and cross-checking of data across different sources and lines of business to ensure accuracy and consistency following our cloud migration.
Validate ETL pipelines and business logic to ensure data is transformed and loaded correctly into our cloud data platform.
Design and develop basic ETL flows and data assets using Databricks, Python, and SQL.
Utilize data engineering skills to assist in the design of a summary data layer and other key data assets.
Develop and maintain data documentation, including metadata and column definitions.
Collaborate with business analysts and stakeholders to understand data requirements and translate them into technical solutions.
Skills:
Proven experience in data validation and data quality assurance.
Proficiency in Databricks, Python, and SQL for data manipulation and ETL development.
Experience in designing and implementing basic ETL processes.
Knowledge of cloud-based data platforms (Azure).
Strong analytical skills with the ability to identify and resolve data discrepancies.
Familiarity with data governance principles and best practices