Bachelor's in Computer Science, Software Engineering, DevOps, or a related technical discipline.
A minimum of 5-8+ years of progressive experience in a hands-on technical role, with a significant focus on observability, monitoring, or DevOps.
Minimum of 3-5 years of dedicated experience in designing, implementing, and managing observability solutions in production environments.
Proven track record of architecting and delivering scalable and resilient observability platforms.
Extensive experience with incident response and post-mortem analysis.
Expert-level understanding of Observability Principles: Deep knowledge of the "three pillars" (metrics, logs, traces), distributed tracing, event correlation, and their application in complex systems.
Deep Hands-on Expertise with Observability Tools: Proven proficiency in deploying, configuring, and optimizing multiple leading observability platforms (e.g., Prometheus/Grafana, ELK Stack, Jaeger/ OpenTelemetry.
Cloud-Native & Distributed Systems Expertise: In-depth understanding and hands-on experience with cloud platforms (Azure), containerization (Docker, Kubernetes), service mesh, and microservices architectures.
Infrastructure as Code (IaC): Proficient in using tools like Terraform for automating infrastructure provisioning and configuration related to observability.
Linux System Administration & Networking: Strong grasp of Linux operating systems, networking protocols, and system-level troubleshooting.
Database Knowledge: Familiarity with time-series databases (e.g. Prometheus, InfluxDB) and other relevant data stores for observability data.
Troubleshooting & Root Cause Analysis: Exceptional analytical and problem-solving skills, with a systematic approach to diagnosing complex technical issues.
Relevant industry certifications in cloud platforms, Kubernetes, or specific observability tools are highly valued.
A strong command of the English language is mandatory (speaking, writing)