Description

We are looking for a highly capable and self-driven QA Engineer with strong experience in data engineering and cloud-based testing. The ideal candidate will have hands-on expertise in Python, PySpark, SQL, and AWS services, and will be expected to contribute immediately to ongoing projects involving big data pipelines and cloud automation.

Key Responsibilities:
Design, develop, and execute automated test scripts for data pipelines and cloud-based applications.
Validate data transformations and integrity across distributed systems using PySpark and SQL.
Execute and monitor AWS EMR jobs and validate outputs.
Test and troubleshoot AWS Lambda functions and related workflows.
Perform validations using Unix shell scripts and AWS CLI.
Collaborate with developers, data engineers, and DevOps teams to ensure high-quality releases.
Document test plans, test cases, and test results clearly and concisely.

Required Technical Skills:
Candidate must be proficient and able to work independently with the following:
Python – for scripting and automation.
PySpark – for validating distributed data processing jobs.
SQL – for data validation and backend testing.
AWS EMR – ability to run and validate EMR jobs.
AWS Lambda – understanding and testing of serverless functions.
Unix/Linux – command-line operations and scripting.
AWS CLI – for interacting with AWS services programmatically

Education

Any Gradute