• Design, deploy, configure, and manage the Dynatrace platform to monitor applications, services, servers, networks, and cloud resources.
• Define monitoring strategies, custom metrics, SLOs/SLIs, synthetic tests, and distributed tracing within Dynatrace.
• Develop custom dashboards, anomaly detection, problem detection rules, and service flow visualizations.
• Integrate Dynatrace with ITSM tools (ServiceNow, Jira) and CI/CD pipelines for proactive monitoring.
• Analyze telemetry data to identify performance bottlenecks, availability risks, and system anomalies.
• Lead observability reviews and recommend enhancements to improve system monitoring, alerting, and self-healing capabilities.
• Advocate Dynatrace best practices across development and operations teams (OneAgent deployment, tagging, smartscape, etc.).
• Enable automatic and dynamic baseline creation for anomaly detection.
• Work with cloud (AWS, Azure, GCP) and containerized environments (Kubernetes, OpenShift) to implement cloud-native monitoring.
• Produce regular reports on system health, performance KPIs, and service level adherence.
• Participate in incident response activities and postmortem analyses using Dynatrace-provided insights.
Must Have Technical/Functional Skills
• 3–5+ years hands-on experience with Dynatrace (SaaS or Managed) as a primary monitoring platform.
• Deep understanding of observability pillars: metrics, logs, traces.
• Strong hands-on experience with cloud platforms (AWS, Azure, or GCP) and hybrid environments.
• Expertise in deploying and managing Dynatrace OneAgent, ActiveGate, RUM (Real User Monitoring), and Synthetic Monitoring.
• Familiarity with OpenTelemetry concepts and other observability standards.
• Strong troubleshooting skills in distributed systems, microservices architectures, and containerized workloads (Kubernetes).
• Proficiency with infrastructure-as-code (Terraform) and automation scripting (Python, Shell).
• Good knowledge of ITSM/incident management tools integration
Any Gradute