- We are seeking a skilled and proactive Operations Engineer with 3 to 8 years of IT experience, including at least 3 years of relevant hands-on expertise in system operations, automation, cloud provisioning, and technical issue resolution.
- You will play a vital role in maintaining the stability, performance, and scalability of our infrastructure and applications, both on-prem and in the cloud.
Essential Job Functions:
- Manage and troubleshoot complex infrastructure and application issues across global environments.
- Provision, configure, and maintain systems in cloud environments including:
- Linux Administration
- Oracle Cloud Infrastructure (OCI)
- AWS
- Develop and maintain automation scripts using Python, Shell, or similar languages to reduce manual work and improve reliability.
- Support and manage applications built in Java, Python, and Ruby at scale.
- Design, implement, and optimize configuration management and automation tools:
- Ansible
- Puppet
- Jenkins
- Docker
- Kubernetes, etc.
- Monitor system performance and availability using modern observability and alerting tools:
- Nagios
- Prometheus
- Grafana
- PCP
- Ensure system security, performance tuning, and patch management across various environments.
- Collaborate with development, QA, and infrastructure teams to streamline CI/CD pipelines and operational workflows.
- Participate in a global 24x7 on-call rotation and incident response activities.
- Maintain detailed documentation of operational processes and system configurations.
- Contribute to continuous process improvements, root cause analysis, and knowledge base updates.
- Ensure compliance with industry best practices, ITIL, and CMMI standards.
Qualifications:
Technical Skills & Requirements
- Strong experience in RedHat Enterprise Linux or Oracle Enterprise Linux environments.
- Solid understanding of Container and Orchestration platforms like Docker and Kubernetes.
- Working knowledge of Oracle Database, RDBMS, SQL Queries is a plus.
- Proficient in TCP/IP Networking, Linux-based firewalls, VPNs, and routing.
- Familiarity with infrastructure-as-code and Git-based version control.
- Strong analytical and diagnostic skills, particularly in remote troubleshooting scenarios.
- Experience working in Agile/DevOps environments is desirable.
Soft Skills & Attributes
- Strong communication and interpersonal skills; ability to engage with cross-functional teams.
- Self-driven with the ability to work independently and manage priorities effectively.
- Continuous learning mindset and proactive approach to identifying areas of improvement.
- Ability to work under pressure and manage multiple tasks in a dynamic environment.
- Willingness to mentor junior team members and share knowledge