Description

  • Bachelor's in Computer Science, Software Engineering, DevOps, or a related technical discipline.
  • A minimum of 5-8+ years of progressive experience in a hands-on technical role, with a significant focus on observability, monitoring, or DevOps.
  • Minimum of 3-5 years of dedicated experience in designing, implementing, and managing observability solutions in production environments.
  • Proven track record of architecting and delivering scalable and resilient observability platforms.
  • Extensive experience with incident response and post-mortem analysis.
  • Expert-level understanding of Observability Principles: Deep knowledge of the "three pillars" (metrics, logs, traces), distributed tracing, event correlation, and their application in complex systems.
  • Deep Hands-on Expertise with Observability Tools: Proven proficiency in deploying, configuring, and optimizing multiple leading observability platforms (e.g., Prometheus/Grafana, ELK Stack, Jaeger/ OpenTelemetry.
  • Cloud-Native & Distributed Systems Expertise: In-depth understanding and hands-on experience with cloud platforms (Azure), containerization (Docker, Kubernetes), service mesh, and microservices architectures.
  • Infrastructure as Code (IaC): Proficient in using tools like Terraform for automating infrastructure provisioning and configuration related to observability.
  • Linux System Administration & Networking: Strong grasp of Linux operating systems, networking protocols, and system-level troubleshooting.
  • Database Knowledge: Familiarity with time-series databases (e.g. Prometheus, InfluxDB) and other relevant data stores for observability data.
  • Troubleshooting & Root Cause Analysis: Exceptional analytical and problem-solving skills, with a systematic approach to diagnosing complex technical issues.
  • Relevant industry certifications in cloud platforms, Kubernetes, or specific observability tools are highly valued.
  • A strong command of the English language is mandatory (speaking, writing)

 

Education

Bachelor's in Computer Science, Software Engineering, DevOps, or a related technical discipline