Description

In this role you will be responsible for designing, developing, and maintaining data pipelines in support of data engineering and data management activities. You must be passionate about data engineering and data quality with a solid background in AWS, Databricks, Python, Trino/Starburst and SQL.

 

Your role as a Data Engineer: 

 

  • Design, develop, monitor, and maintain data pipelines in an AWS ecosystem with Databricks, Delta Lake, Python, SQL and Starburst as the technology stack. Collaborate with cross-functional teams to understand data needs and translate them into effective data pipeline solutions.
  • Establish data quality checks and ensure data integrity and accuracy throughout the data lifecycle.
  • Automate testing of the data pipelines and configure as part of CICD
  • Optimize data processing and query performance for large-scale datasets within AWS and Databricks environments.
  • Document data engineering processes, architecture, and configurations.
  • Troubleshooting and debugging data-related issues on the AWS Databricks platform.  
  • Integrating Databricks with other AWS products such as SNS, SQS, and MSK.

 

What we are looking for: 

  • Bachelor's or Master's degree in Computer Science, Information Technology, or a related field.
  • Minimum of 5 years of experience in data engineering roles, with a focus on AWS and Databricks.
  • Highly proficient with Databricks, Spark, Starburst/Trino, Python, PySpark and SQL
  • Hands-on experience in Gitlab with CI/CD.
  • Hands-on experience in AWS Services like S3, RDS, Lambda, SQS, SNS, MSK is required.
  • Strong SQL skills to perform data analysis and understanding of source data.
  • Experience with data pipeline orchestration tools
  • Experience with ETL tools such as Informatica PowerCenter is a plus
  • Ability to troubleshoot complex data issues and implement effective solutions.
  • Strong communication and interpersonal skills.
  • Ability to work collaboratively in a team-oriented environment.
  • Proactive in staying updated with industry trends and emerging technologies in data engineering

Education

Bachelor’s or Master’s degree