Key Skills: Pyspark, Big Data, Python, Data Engineer, Data Engineering, Hadoop, Hive
Roles and Responsibilities:
- Lead the engineering team in analysis, design, coding, and release activities
- Collaborate with business stakeholders, product owners, and architects to deliver successful business outcomes
- Plan and develop comprehensive engineering solutions that meet business objectives
- Ensure the reliability and resiliency of solutions through rigorous testing and review processes
- Advocate for engineering best practices and mentor team members to achieve high performance
- Participate in industry forums to promote innovative technologies and solutions within the bank
- Acquire functional knowledge of the business capabilities being digitized or re-engineered
Skills Required:
- Strong hands-on experience with PySpark for building scalable data processing pipelines
- Deep understanding of Big Data technologies and distributed data architectures
- Proven expertise in data engineering including ETL/ELT processes and data integration workflows
- Proficiency in Python for scripting, automation, and data transformation tasks
- Familiarity with Hadoop ecosystem for storage and batch data processing (nice-to-have)
- Experience in using Hive for querying structured datasets in a Big Data environment (nice-to-have)
Education: Bachelor's Degree in related field