Description

Job Description:

  • Design, implement, and manage observability solutions using DataDog (Logs, Metrics, APM, RUM, Synthetics, etc.)
  • Develop real-time dashboards and alerts to monitor critical infrastructure and application health
  • Collaborate with development, SRE, and DevOps teams to identify key metrics and create     actionable observability strategies
  • Optimize existing monitoring setups and identify gaps in visibility
  • Integrate DataDog with various tools such as Teams, AWS/GCP/Azure, Kubernetes, and CI/C pipelines
  • Build automation for alert tuning and anomaly detection
  • Should know how to create Synthetic testing
  • Provide training and documentation to teams to maximize value from DataDog
  • Contribute to incident response and post-mortem analysis using observability insights.


 

Education

Any Graduate