Real-Time troubleshooting of critical application workflows and incorporate feedback to product development.
Should have good knowledge on splunk , should be able to write queries in splunk and create alerts.
Good knowledge of ITSM framework.
Should be good in analysis , able to find out solutions without much help.
Triage alerts & diagnose/resolve critical issues, manage implementation of changes.
Perform root cause analysis of critical incidents/alerts. Initiate and drive the Techlines in case of outages/major incidents/Batch abends and ensure .Service Restoration in the least time possible.
Act quickly on the application Alerts and Batch Job failures.
Identify manual toil, repetitive issues, and work with stakeholders with improvement plan.
Should have basic experience in .net framework , C# to fix bug
Able to write basic powershell commands.
Knowledge of one or more of Message Brokers such as RabbitMQ, IBM MQ
Knowledge of JIRA, confluence and remedy ticketing systems