Description

We are seeking a highly skilled AWS Cloud Engineer with SRE (Site Reliability Engineering) skills to support and maintain scalable, secure, and highly available cloud infrastructure solutions in AWS. The ideal candidate will be a key member of the Platform engineering team and ensure service uptime, provide infrastructure architecture guidance to support modern cloud-native applications with high performance and resilience.

 

​Key Responsibilities:

  • Architect and implement resilient cloud infrastructure using AWS services such as EC2, ECS/EKS, RDS, Lambda, S3, CloudFormation, etc.
  • Lead SRE practices such as monitoring, alerting, incident response, and post-mortems.
  • Design and enforce infrastructure-as-code (IaC) strategies using Terraform or CloudFormation.
  • Build and maintain CI/CD pipelines for automated deployment and testing.
  • Establish observability frameworks using tools like CloudWatch, Datadog, Prometheus, Grafana, or ELK.
  • Automation of finops processes and scaling capabilities in the AWS environments.
  • Run and drive Proof of concepts for new EKS setup for multiple applications using automation tools like ArgoCD and Kubernetes management platforms. 
  • Proficiency in scripting like python and AWS CLI to create Lambda function to exports and forward/integrate logs as well as S3 files with multiple tools.
  • Define and monitor SLOs/SLIs, conduct capacity planning, and performance tuning.
  • Collaborate with security, development, and operations teams to ensure end-to-end system reliability and compliance.

Required Qualifications:

                •             Bachelor’s degree in Computer Science, Engineering, or related field.

 

                •             8+ years in IT operations or DevOps roles, with 5+ years in a SRE capacity. AWS certification is a plus

                •             Strong expertise with AWS cloud services and architecture.

                •             Proficiency in Terraform, CloudFormation, or similar IaC tools.

                •             Experience with Kubernetes (EKS preferred), containers, and microservices.

                •             Deep understanding of CI/CD tools like Jenkins, GitLab CI, or AWS CodePipeline.

                •             Solid experience in monitoring and alerting systems.

                •             Strong scripting skills (Python, Bash, etc.).

                •             Excellent problem-solving and incident management skills

 

​Preferred Qualifications:

                •             AWS Certifications (e.g., AWS Certified Solutions Architect – Professional, DevOps Engineer).

                •             Experience with multi-account AWS landing zones and service control policies (SCPs).

                •             Knowledge of FinOps, cost optimization, and security best practices.

                •             Familiarity with service mesh (Istio, Linkerd) and GitOps

Education

Bachelor's degree