Skills Required:
Problem Solving, Python, Shell Scripting
Desirable Skills:
Apache Kafka, Apache Pulsar, Ansible, Github
About the Role:
Site Reliability Engineer 1
About the team:
Site Reliability Engineer 1
You are Responsible for:
Handling daily on-call responsibilities and ensuring timely acknowledgment of alerts.
Following the runbook to troubleshoot and resolve issues effectively.
Responding to user queries and addressing concerns raised through Jira and Zenduty alerts.
Performing minor script modifications to support maintenance activities.
Proactively identifying and escalating critical issues to the appropriate teams when needed.
Ensuring seamless communication and coordination with cross-functional teams.
Documenting and updating processes, runbooks, and resolutions for future reference.
To succeed in this role – you should have the following:
Strong understanding of IT service management processes, including incident and alert handling.
Familiarity with tools such as Jira, Zenduty, and other alerting platforms.
Basic scripting knowledge (e.g., Bash, Python) for minor modifications and troubleshooting.
Ability to follow structured processes and runbooks with attention to detail.
Good problem-solving skills to quickly identify and resolve issues.
Effective communication skills for responding to user queries and collaborating with stakeholders.
A proactive and customer-focused attitude.
Familiarity with maintenance activities and general system administration tasks.
Any Graduate