Description

We are looking for a Site Reliability Engineer – Linux, who approaches their work with passion, a hunger for learning and growth, and a steadfast commitment to delivering outstanding results. If you're a team player with a positive mindset, keen to make a meaningful impact, we encourage you to reach out to us!

What you will do:
· Manage distributed infrastructure with open-source technologies across multiple datacenters

· Ensure product SLAs, perform capacity planning, and address critical issues in a 24/7 on-call rotation.

· Explore and implement innovative platforms As a service solution to support and enhance the efficiency of technical SRE teams.

· Utilize data and metrics for decision-making, focusing on security and best practices.

· Prioritize robust automation and scripting to reduce dependence on manual procedures

Who you are:
· Strong understanding of Linux internals, OS fundamentals, and core network principles.

· Basic familiarity with relational databases (PostgreSQL, MySQL) and NoSQL databases (Redis, MongoDB).

· Proficient in container orchestration tools like OpenShift, Kubernetes, Docker Swarm, or Apache Mesos.

· Experienced in administering and troubleshooting configuration management tools such as Puppet, Ansible Tower (AWX), or Chef.

· Hands-on experience in load balancer administration (HAProxy, Nginx, and F5).

· Hands-on experience with caching technologies such as Redis, Nginx+, Varnish, or Memcached.

· Skilled in monitoring and logging stacks such as Grafana, InfluxDB, Graphite, Prometheus, ELK, and Graylog.

· Hands-on experience with web servers like Nginx, Apache, or Tomcat.

· Skilled in at least one scripting language such as Python, Golang.

 

Education

Any Graduate