Job description:
Job Summary:
We are seeking a highly skilled Team Member with 12 years of experience to join our dynamic team.
The ideal candidate will have expertise in Cloudera Data Platform, Cloudera HUE, Cloudera, Scala, Spark, and PySpark.
This hybrid role requires a strong technical background and the ability to work effectively in a collaborative environment.
Responsibilities :
Develop and maintain data pipelines using Cloudera Data Platform, ensuring efficient data processing and storage.
Implement and optimize data workflows with Cloudera HUE to enhance data accessibility and usability.
Utilize Cloudera tools to manage and monitor data clusters, ensuring high availability and performance.
Write and maintain Scala code for data processing tasks, ensuring code quality and performance.
Develop and optimize Spark applications to handle large-scale data processing tasks.
Utilize PySpark for data transformation and analysis, ensuring data integrity and accuracy.
Collaborate with cross-functional teams to understand data requirements and deliver solutions that meet business needs.
Provide technical support and troubleshooting for data-related issues, ensuring minimal downtime and disruption.
Conduct code reviews and provide feedback to ensure adherence to best practices and coding standards.
Participate in the design and implementation of data architecture, ensuring scalability and performance.
Stay updated with the latest industry trends and technologies to continuously improve data solutions.
Document data processes and workflows to ensure knowledge sharing and continuity.
Contribute to the overall success of the team by sharing knowledge and mentoring junior team members.
Qualifications
Must have strong experience with Cloudera Data Platform, Cloudera HUE, and Cloudera tools.
Must be proficient in Scala and have experience writing and maintaining Scala code.
Must have hands-on experience with Spark and PySpark for data processing and analysis.
Nice to have experience with other big data technologies and tools.
Must have excellent problem-solving skills and the ability to troubleshoot complex data issues.
Must be able to work effectively in a hybrid work model and collaborate with remote team members.
Must have strong communication skills and the ability to explain technical concepts to non-technical stakeholders
Certifications Required :
Cloudera Certified Associate (CCA) Spark and Hadoop Developer, Cloudera Certified Professional (CCP) Data Engineer
Any Graduate