Description

Experienced AWS Cloud Operations Engineer. The ideal candidate will be responsible for managing and maintaining our AWS cloud infrastructure, ensuring optimal performance, reliability, and security. This role involves collaborating with development, operations, and security teams to support and enhance our cloud environment.

Key Responsibilities:
Cloud Infrastructure Management:
o Deploy, configure, and manage AWS cloud resources, including EC2 instances, S3 buckets, RDS databases, and VPCs.
o Implement and manage AWS governance policies, including IAM roles and policies, tagging, and cost management.

Monitoring and Optimization:
o Monitor cloud infrastructure performance using AWS CloudWatch, CloudTrail, and other monitoring tools.
o Identify and resolve performance bottlenecks and optimize resource utilization.
o Implement and maintain automated monitoring, alerting, and reporting systems.

Security and Compliance:
o Ensure cloud infrastructure adheres to security best practices and compliance requirements.
o Implement and manage AWS security services, such as AWS IAM, AWS Shield, and AWS Key Management Service (KMS).
o Conduct regular security assessments and audits, and implement remediation plans.

Automation and Scripting:
o Develop and maintain automation scripts using AWS CLI and CloudFormation.
o Automate routine operational tasks, such as resource provisioning, configuration management, and backup processes.
o Expertise in working with Terraform key features such as Infrastructure as code (IaC), Resource Graphs and change automation
o Experience with containerization and related technologies like Kubernetes, for creating development pipelines.
o Experience in managing Kubernetes charts using Helm.
o Experience in configurating and managing source code using GIT, including resolving code merging conflicts.

Incident and Problem Management:
o Respond to and resolve cloud infrastructure incidents and outages in a timely manner.
o Perform root cause analysis and implement corrective actions to prevent recurrence.
o Maintain detailed incident and problem records.

Collaboration and Support:
o Collaborate with development and operations teams to support application deployments and releases.
o Provide technical support and training/guidance to internal teams on AWS cloud services and best practices.
o Participate in on-call rotation for after-hours support as needed.

Qualifications:
· 5+ years of experience in managing AWS cloud infrastructure and services.
· AWS certifications, such as AWS Certified Developer / SysOps Administrator.
· Expertise with DevOps practices and CI/CD pipelines.
· Expertise with tools such as Datadog, Splunk, Wiz, Harness

Education

Bachelor's degree