We are hiring an experienced Data Engineer to join our team in Chennai . This is a critical role focused on building scalable data pipelines, supporting data platform migration to a Common Data Platform (CDP) , and ensuring high-quality data delivery for business analytics and operations.
Key Responsibilities:
Data Engineering & Development:
- Design and build robust ETL/ELT pipelines using PySpark and SQL .
- Manage and optimize data within dimension and fact tables .
- Clean, transform, and enrich data across Bronze, Silver, and Gold layers of the data lake.
- Develop and maintain data ingestion and aggregation pipelines for internal and external data sources.
Data Platform & Infrastructure:
- Lead migration from legacy platforms to Common Data Platform (CDP) using Talend or Informatica .
- Update and maintain E-R models and perform data cataloging .
- Replicate existing ETL transformations in new environments.
- Support quality assurance activities including Technical QA and Business QA .
Technical & DevOps Integration:
- Utilize AWS services and cloud-based infrastructure for scalable data management.
- Set up and maintain CI/CD pipelines using GitHub Actions .
- (Preferred) Use Terraform and Airflow for IaC and workflow orchestration.
- Support on-prem and cloud environments and apply Medallion Architecture principles.
Required Skills:
- PySpark & SQL expertise for data processing and analytics.
- Strong experience with ETL tools and data warehousing concepts.
- Hands-on with AWS cloud services