Key Responsibilities Include:
● Design, implement and operate enterprise-grade, secure, instrumented (alerting/monitoring), fault-tolerant infrastructure.
● Build infrastructure automation tools and frameworks leveraging terraform/opentofu, salt-stack/ansible, docker, Kubernetes/helm, bash and python scripts.
● Continuously evaluate and identify current system bottlenecks and implement solutions to improve the scalability of our infrastructure.
● Work with extended teams to manage systems and infrastructure required for ongoing development and releases.
● Consult with internal and external stakeholders on the best way to accomplish a given task.
What You’ll Bring:
● 5+ years of relevant experience
● Programming experience in one or more languages (i.e., Python)
● Experience using provisioning and configuration management tools
● Experience with containerization and Kubernetes
● Experience with open-source Monitoring and Alerting systems
● Significant on-prem and cloud (AWS, Azure, or GCP) experience, design patterns, limitations, and cost-containment techniques
● Prior experience implementing significant technology redesign projects
Any Graduate