Description

Key Responsibilities:

- Execute validation steps for dashboards and alerts based on provided runbooks and checklists.

- Assist in running and verifying migration scripts for dashboards, alerts, or telemetry pipelines.

- Follow standard procedures for validating PromQL queries including metrics/labels/expressions.

- Report issues, anomalies, or missing configurations to Observability team.

- Help end users in troubleshooting metric queries/dashboard widgets/alerts.

Preferred Skills & Experience:

- Familiarity with monitoring tools like Grafana, Prometheus and OpenTelemetry.

- Basic understanding of observability concepts: dashboards, alerts, metrics, and logs.

- Some experience running or modifying Python Scripts (with modifying JSON/YAML and API calls). 

- Attention to detail and strong organizational habits.

Nice to Have:

- Experience with Datadog and OpenTelemetry is a plus

 

Education

Any Graduate