Seeking a Lead Systems Engineer to support the Systems Monitoring.
Responsible for software tool administration for systems and applications monitoring tools. Expertise with at least one of the Monitoring tools like DataDog and Service Now.
• DataDog Administration experience on Linux platform to instrument Java based applications running on Tomcat Application Server.
• Configuration experience in Infrastructure Monitoring, Network Monitoring and Centralized Logging Or similar Administration experience with ELK Stack – Elasticsearch (search and analytics engine), Logstash (ingest pipeline) and Kibana (visualization and creating dashboards).
• Strong Linux platform (Red Hat) background.
• Automation experience with scripting (Python, Shell, ANSIBLE) preferred.
• Understanding of SSL setup on Linux servers. Installing CA certs etc.
• Experience with Network Monitoring and knowledge on Network components like Switches, Routers, Palo Alto Network utilization SNMP, F5 Load Balancers, WebSeal, Info Blocks, Gigamon, Network Mapping is a plus.
Tasks:
• Manages, configures and maintains the Data Dog tool on Linux platform.
• Responsible for Network Monitoring, Infrastructure/Server Monitoring (Linux, Windows, AIX) using Data Dog, Application, SNMP and Log Monitoring.
• Configure centralized logging of all logs from different sources like WebSphere / Tomcat and Client WebServers on AIX servers to Data Dog on Linux. Knowledge of Load Balancers like F5 to route logs to Log server. Handling different types of Log formats.
• Creates required dashboards with data visualization in Data Dog.
• Responsible for Java Applications' instrumentation with Data Dog, set up health rules and fine tune monitoring in Data Dog.
• Setup End User Monitoring / Browser Real User Monitoring of Data Dog for applications, using Java script injection.
Specific Required Skills:
• 5-8 years strong IT experience and good working knowledge of a variety of technology platforms in a distributed environment including: Microsoft systems (e.g. Windows Server, Active Directory, Exchange, SharePoint), Linux/Unix, VMWare, SQL Server, database architectures, TCP/IP, VPNs, Mainframe, LAN/WAN technologies and architectures
• A minimum of 3 years hands-on experience installing, integrating, managing and maintaining monitoring tools like Data Dog administration and support Or similar Log Management experience with ELK Stack – Elasticsearch (search and analytics engine), Logstash (ingest pipeline), and Kibana (visualization and creating dashboards)
• Experience in writing Shell, Python, Selenium, VuGen scripts
Any Gradute