Must have: Terraform, Monitoring tools, AWS
Key Competencies -
Batch Monitoring & Scheduling - Proficient in tools like Autosys, Control-M, or TWS for monitoring and scheduling jobs.
Incident Management - Experience with ticketing tools (e.g., ServiceNow), understanding SLAs, triage processes.
Root Cause Analysis (RCA) - Ability to perform RCA on failed batch jobs, using logs and monitoring tools.
Change Management - Familiarity with change control processes, code promotion, and rollback strategies.
Shift Management - Working in 24x7 environments, including handoffs and escalation processes.
Communication Skills - Clear written/verbal communication with stakeholders and L2/L3 teams.
Documentation & Reporting -Creating SOPs, job runbooks, incident postmortems, and performance reports.
Focus on inefficiencies - Ability to identify bottleneck, degradations, failures, and work on tactical, strategic fixes
Focus on Efficiency, Automation, AI and New Tech - Ability to learn new Tech with interest, thinking on making operate space simpler and predictive, looking at automation opportunities and automating, Use Gen AI to build operating efficiencies and measure.
Required skillsets (but not limited to):
Informatica
Hadoop (Cloudera/HDP)
Teradata
Snowflake
SQL and Data Manipulation
SSIS
Mainframe:
Scripting & Automation
Scheduling and Orchestration Tools
Data Quality and Governance Tools
Cloud Platforms
Any Graduate