Description

  1. Provide operational support and troubleshooting for cloud infrastructure deployed on AWS.
  2. Maintain and support Kubernetes clusters (EKS or self-managed), ensuring high availability and performance.
  3. Support infrastructure-as-code configurations and environments built using Terraform.
  4. Respond to incidents and service requests, ensuring timely resolution or escalation.
  5. Monitor system performance using tools like CloudWatch, Splunk or Datadog.
  6. Assist in deploying new environments, services, and applications using CI/CD pipelines.
  7. Collaborate with DevOps and engineering teams to improve automation and reduce manual intervention.
  8. Document runbooks, procedures, and known issues to improve operational readiness.

Required Skills and Qualifications:

  1. 4+ years of hands-on experience with AWS, especially in support and SRE roles.
  2. Experience with Kubernetes (EKS) for managing containerized workloads.
  3. Familiarity with Terraform for managing infrastructure-as-code.
  4. Basic scripting skills in Python and Ansible
  5. Understanding of networking concepts such as VPCs, DNS, Load Balancing, and Security Groups.
  6. Strong troubleshooting and problem-solving skills.
  7. Familiarity with monitoring, alerting, and logging tools.
  8. Good communication skills and ability to work in a collaborative environment.


Nice to Have:

  1. AWS Certified Solution Architect Associate.
  2. Certified Kubernetes Administrator (CKA)
  3. Experience with ITIL practices or ticketing systems (e.g., Jira, ServiceNow).
  4. Exposure to CI/CD tools like Jenkins, Harness and AWS Code pipeline

Education

Any Gradute