We are looking for a Site Reliability Engineer – Linux, who approaches their work with passion, a hunger for learning and growth, and a steadfast commitment to delivering outstanding results. If you're a team player with a positive mindset, keen to make a meaningful impact, we encourage you to reach out to us!
What you will do:
· Manage distributed infrastructure with open-source technologies across multiple datacenters
· Ensure product SLAs, perform capacity planning, and address critical issues in a 24/7 on-call rotation.
· Explore and implement innovative platforms As a service solution to support and enhance the efficiency of technical SRE teams.
· Utilize data and metrics for decision-making, focusing on security and best practices.
· Prioritize robust automation and scripting to reduce dependence on manual procedures
Who you are:
· Strong understanding of Linux internals, OS fundamentals, and core network principles.
· Basic familiarity with relational databases (PostgreSQL, MySQL) and NoSQL databases (Redis, MongoDB).
· Proficient in container orchestration tools like OpenShift, Kubernetes, Docker Swarm, or Apache Mesos.
· Experienced in administering and troubleshooting configuration management tools such as Puppet, Ansible Tower (AWX), or Chef.
· Hands-on experience in load balancer administration (HAProxy, Nginx, and F5).
· Hands-on experience with caching technologies such as Redis, Nginx+, Varnish, or Memcached.
· Skilled in monitoring and logging stacks such as Grafana, InfluxDB, Graphite, Prometheus, ELK, and Graylog.
· Hands-on experience with web servers like Nginx, Apache, or Tomcat.
· Skilled in at least one scripting language such as Python, Golang.
Any Graduate