About the job
The job description you’ve shared appears to be for a Site Reliability Engineer (SRE) role with a focus on IT infrastructure, cloud management, and automation. This is a technical position at DFI Retail Group, a major retailer in Asia, where the candidate would be responsible for designing, building, and maintaining systems that support the company’s applications and services.
Here are the key responsibilities and qualifications for this role:
Responsibilities:
Automated Systems Implementation: Designing and implementing automated systems for monitoring, alerting, and incident response within Google Cloud and Azure infrastructure.
Atlassian Product Management: Overseeing tools like Jira, Confluence, and Bitbucket.
Collaboration with Development Teams: Partnering with dev teams to build and deploy highly available (HA) and scalable applications and services.
CI/CD Pipeline Management: Developing and managing continuous integration and continuous deployment (CI/CD) processes.
Performance and Capacity Management: Engaging in capacity planning and performance tuning to ensure system optimization.
Documentation: Maintaining comprehensive documentation related to infrastructure, applications, and programs.
On-call Support: Participating in on-call rotations to ensure system availability and provide support for production systems.
Desired Skills and Experience:
Experience: At least 3 years in an SRE or DevOps role with a focus on Linux-based infrastructure.
Infrastructure Knowledge: Strong understanding of infrastructure concepts and practices.
Cloud Platforms: Familiarity with cloud services, particularly Azure, Google Cloud, and AWS.
Scripting: Competency in scripting languages like Python, Ruby, or Bash.
Containerization: Experience with Kubernetes and Docker for managing containerized applications.
Automation and Configuration Management: Knowledge of tools like Ansible, Puppet, or Chef.
Monitoring Tools: Familiarity with monitoring and logging systems such as Prometheus, Zabbix, Grafana, and ELK stack.
Problem-Solving and Teamwork: Strong problem-solving skills and the ability to collaborate effectively within a team.
Communication: Excellent communication skills to coordinate with different teams and stakeholders.
Additional Information:
The role offers an opportunity to work with a leading retailer in the Pan-Asia region.
The company is an equal opportunity employer and takes care in handling personal information in compliance with relevant laws.
Any Graduate