JD :
● Design and implement scalable data pipelines for feature extraction, transformation, and loading (ETL) using technologies such as Pyspark, Scala, and relevant big data frameworks.
● Govern and optimize data pipelines to ensure high reliability, efficiency, and data quality across on-premise and cloud environments.
● Collaborate closely with data scientists, ML engineers, and business stakeholders to understand data requirements and translate them into technical solutions.
● Implement best practices for data governance, metadata management, and compliance with regulatory requirements.
● Lead a team of data engineers, providing technical guidance, mentorship, and fostering a culture of innovation and collaboration.
● Stay updated with industry trends and advancements in big data technologies and contribute to the continuous improvement of our data engineering practices.
Any Graduate