We are seeking a highly skilled and experienced Senior Systems Administrator with a strong background in managing and operating cloud platforms like OCI, AWS, Azure, or GCP. The ideal candidate should possess excellent troubleshooting, automation, and monitoring skills, along with expertise in managing hybrid infrastructures across globally distributed data centers. This role demands a strong understanding of Linux/Unix systems, cloud-native observability tools, database management, and automation workflows.
What you’ll do & how you’ll make your mark.
1. Cloud Infrastructure Management:
Manage and optimize OCI and other cloud platforms, including Compute, Storage, Networking, and Observability tools.
Implement and manage cloud IAM policies, NSGs, Security Lists.
Automate provisioning, scaling, and management of cloud resources using tools like Terraform and Ansible.
Diagnosing issues with connectivity, performance, and security, using monitoring and logging tools like CloudWatch, Grafana, and Prometheus.
2. System Operations:
Conduct regular patch management and system maintenance to ensure platform stability.
Troubleshoot and resolve hardware, OS, and database issues for Linux/Unix systems.
3. Monitoring & Observability:
Set up and maintain system and application monitoring using tools like Grafana, Prometheus, Zabbix, and OCI Monitoring.
Proactively identify and resolve performance issues using dashboards and alerts.
4. Automation & Documentation:
Automate repetitive tasks using Bash, Python, or Ruby and maintain operations documentation.
Use tools like Terraform for Infrastructure as Code and configuration management.
5. Incident Management:
Handle incidents and escalations, ensuring SLAs and uptime standards are met.
Collaborate with engineering teams for root cause analysis (RCA) and permanent fixes.
6. Database & Application Support:
Manage and troubleshoot databases like MySQL and PostgreSQL, and web servers like Apache and Nginx.
7. Collaboration & Shift Management:
Work with global teams, ensuring smooth hand-offs in a 24x7 environment
Who you are & what you’ll need to succeed.
Advanced english skills
6+ years in managing and operating hybrid infrastructure
Diploma/BE/BCA/MCA/MSc in Computer Science or IT.
Certifications such as OCI Certified Architect Associate, AWS Certified Solutions Architect, or Azure Administrator Associate are a plus.
Hands-on experience with OCI or any other cloud platform (AWS, Azure, or GCP).
Proficiency with Grafana, Prometheus, Zabbix, and similar tools.
Strong fundamentals, including troubleshooting, file systems, and OS tools.
Networking: Knowledge of DNS, HTTP, TCP/UDP, and the OSI model.
Scripting: Proficiency in Bash, Python, or Ruby. (Fundamental)
Experience with Terraform and IaC tools. (Understading is a must)
Database Management: Intermediate-level skills in MySQL, PostgreSQL, and backup/recovery processes.
Experience with DDoS mitigation, web caching (e.g., Nginx, Varnish), and hybrid cloud integration.
Knowledge of virtualization and configuration management tools like Ansible or Puppet.
Advanced troubleshooting of hardware RAID controllers and cloud automation using OCI CLI.
Any Graduate