Job Description:
• Experience with SRE design to address reliability and resiliency with availability of 5-9s
• Experience in working in a cloud environment (OCP and AWS EMR).
• Experience with application monitoring tools, observability, and performance assessments.
• Strong experience with DevOps (CI CD pipelines with Jenkins or similar Git GitHub)
• High level of familiarity with the Linux command line and scripting
• Proven skills in high availability and scalability design, as well as performance monitoring and testing
• Experience developing automation solutions in Java, bash, Python, Perl (or other similar languages)
• Extremely comfortable with production environments, firewalls, and networking
• Studied architectural patterns at scale, including thoughtfully designed APIs, repeatable delivery pipelines, and efficient computer engineering principles.
• Experience as part of an Agile Engineering or development team
• Strong experience in deploying, observing, altering, logging, and monitoring systems (Splunk, Datadog) with a mindset towards predictive analysis.
• Working knowledge of the automation tools such as Ansible, Terraform.
Any Graduate