Description

Job Description

We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and systems administration, with a focus on creating scalable and reliable systems. You will work closely with development and operations teams to ensure the reliability, availability, and performance of our services.

Key Responsibilities

Collaborate with engineering teams to design and implement scalable, robust systems.
Ensure the reliability and performance of our services through monitoring, incident response, and capacity planning.
Develop and maintain automation tools for system provisioning, configuration management, and deployment.
Implement and manage monitoring tools to ensure visibility into the health and performance of our systems.
Lead incident response efforts, perform root cause analysis, and implement preventative measures.
Utilize Infrastructure as Code (IaC) practices to manage and provision infrastructure.
Work closely with development and operations teams to ensure smooth deployments and continuous improvement of processes.
Ensure that our systems are secure and comply with industry standards and best practices.
Create and maintain detailed documentation for systems and processes.


Qualifications

Bachelor’s degree in computer science, Information Technology, or a related field, or equivalent experience.
3+ Years experience as a Site Reliability Engineer or in a similar role.
Experience with cloud platforms (e.g., Azure & AWS Exp).
Strong background in Linux/Unix administration.
Proficiency in programming languages such as Python, Go, or Ruby.
Experience with configuration management tools (e.g., Ansible, Puppet, Chef).
Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes).
Knowledge of monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack, loggly).
Understanding of networking concepts and protocols.
Excellent problem-solving skills and attention to detail.
Strong communication and collaboration skills.
Ability to work in a fast-paced, dynamic environment.
 

Preferred Qualifications

Experience with CI/CD pipelines and tools (e.g., Jenkins, GitLab CI).
Familiarity with database management (e.g., MySQL, PostgreSQL, MongoDB).
Experience with distributed systems and microservices architecture.
Certification in relevant technologies (e.g., AWS Certified Solutions Architect).


Exp Required: 3-7 Years

 

Education

Any Graduate