Key Skills: Kubernetes, Terraform, GitOps, Flux, Docker, Prometheus, Grafana, GitHub Actions, Azure DevOps, AWS, GCP, Azure, CI/CD, Microservices, Service Mesh, Ingress Controller, Infrastructure as Code, Bash, PowerShell, Security Compliance
Roles and Responsibilities:
- Design and management of cloud-native system architectures using Kubernetes, guaranteeing high availability, top performance, and reliability, with a focus on continuous operation and improvements.
- Implement and manage Terraform infrastructure as code to provision and manage cloud resources, ensuring consistency and repeatability across environments.
- Develop and maintain GitOps workflows using tools like Flux or similar, to automate deployment of applications and ensure consistency across environments.
- Design and operation of container-based hosting environments using Kubernetes, including management of pods, services, and deployments.
- Management and configuration of cloud-native networking, including load balancers, ingress controllers, and service meshes.
- Support incident and problem management to identify root cause and fix, define and put in place improvements to avoid the problem occurring again, with a focus on automation and self-healing systems.
- Interact with development teams to understand their requirements, help them to find the best technological solution for their needs, and ensure seamless integration with cloud-native infrastructure.
- Analyze and improve systems' performance & scalability, and automate systems management using tools like Prometheus, Grafana, and Kubernetes dashboard.
- Undertake regular review of IT Security vulnerabilities related to Infrastructure, participate in remediation work to improve Infra security scores, and ensure compliance with organizational security policies.
- Provide 3rd line support where required in an on-call roster covering one or more of our applications, with a focus on resolving complex technical issues and improving overall system.
Experience Requirement:
- 3-8 years of strong experience with Kubernetes, including cluster management, Networking fundamentals, pod management, and service management.
- Experience with infrastructure as code, including provisioning and managing cloud resources.
- Experience with GitOps workflows.
- Strong understanding of cloud-native architectures, including microservices, service meshes, and containerization.
- Experience with continuous integration and continuous delivery (CI/CD) pipelines, including Docker builds.
- Strong problem-solving skills, with the ability to troubleshoot complex technical issues and improve overall system reliability.
- Excellent communication skills, with the ability to work closely with development teams and other stakeholders to ensure seamless integration with cloud-native infrastructure.
- Experience with Terraform/OpenTofu.
- Experience with FluxGitOps tool.
- Familiarity with NGINX Ingress Controller, including configuration, deployment, and troubleshooting, with knowledge of features such as path-based routing, SSL/TLS termination, and load balancing.
- CI/CD tools like GitHub Actions / Azure DevOps pipelines or similar.
- Experience with cloud providers like AWS, GCP, or Azure, including management of cloud resources and services.
- Understanding of IT security principles, including network security, identity and access management, and compliance.
Education: Any Graduation