We are seeking a highly skilled and experienced Production Support Engineer with expertise in Java development, DevOps, and containerization technologies to support critical applications. This role involves providing 24/7 support for production environments, ensuring smooth and efficient operation of applications, diagnosing and resolving issues, and performing system monitoring and enhancements.
The successful candidate will work closely with cross-functional teams to maintain and improve application performance, optimize APIs, and support containerized applications. Strong knowledge of Unix/Linux scripting, Kafka, PostgreSQL, and MS SQL Server is required, along with hands-on experience with Swagger, Key cloak, and Docker for application deployments.
Key Responsibilities:
• System Support & Maintenance: Provide continuous support for the applications, ensuring optimal performance, security, and compliance with data processing requirements.
• Java Development & Support: Develop, maintain, and optimize Java-based applications to meet evolving system and business needs. Troubleshoot and debug issues in production, ensuring minimal downtime and disruption to operations.
• API Integration: Manage and support API endpoints, ensuring seamless communication between systems. Work with Swagger to document, test, and integrate APIs efficiently.
• DevOps Practices: Leverage DevOps tools and methodologies for continuous integration and continuous deployment (CI/CD). Automate build, test, and deployment pipelines to support application delivery.
• Containerization & Deployment: Utilize Docker for the packaging and deployment of containerized applications. Manage Docker container deployments and ensure smooth operation in various environments.
• Unix Scripting & Linux: Write and maintain Unix scripts to automate system tasks, monitoring, and administration. Ensure system reliability in Linux-based environments, including servers and application configurations.
• Database Management & Support: Support and troubleshoot Postgres SQL and MS SQL Server databases, ensuring data integrity, optimization, and availability.
• Kafka Management: Monitor, manage, and optimize Kafka for real-time data streaming and event processing. Address performance issues, system failures, and ensure reliable message delivery.
• KeyCloak: Support and manage Keycloak for identity and access management, ensuring secure and efficient user authentication across applications.
• System Monitoring & Incident Management: Proactively monitor application performance, log files, and infrastructure health using tools like Prometheus, Grafana, or other monitoring solutions. Quickly resolve issues by troubleshooting, analyzing logs, and collaborating with development and operations teams.
• Documentation & Reporting: Ensure comprehensive documentation of deployed systems, including APIs, containerized applications, and infrastructure setups. Provide incident reports, root cause analysis, and post-mortem reports when required.
• Collaboration & Cross-team Support: Collaborate with cross-functional teams, including business stakeholders, developers, and infrastructure teams, to resolve production issues, deploy new releases, and ensure application reliability.
• Continuous Improvement: Engage in performance tuning and optimization of Java applications and database queries, contributing to overall system performance and scalability improvements.
Key Qualifications:
• Bachelor’s Degree in Computer Science, Information Technology, Engineering, or a related field.
• 4+ years of experience in production support, with expertise in both on-prem and cloud-based systems management.
• Hands-on experience with DevOps tools such as Jenkins, Git, and Kubernetes to support continuous integration, deployment, and automation processes.
• Strong proficiency in Java development, including troubleshooting, performance tuning, and optimizing Java-based applications in a production environment.
• Experience with API management and integration, particularly with Swagger for API documentation and testing, as well as Kafka for real-time messaging and event streaming.
• Familiarity with Unix/Linux scripting and administration, with a focus on automating tasks, monitoring system performance, and ensuring system reliability.
• Database management skills with hands-on experience in Postgres SQL and MS SQL Server for data troubleshooting, query optimization, and database support.
• Solid understanding of Linux Operating Systems, including system administration, shell scripting, and performance monitoring.
• Excellent troubleshooting and root cause analysis skills, with the ability to quickly identify, resolve, and document production issues.
• Experience with application monitoring and logging (e.g., using Splunk or ELK stack) to diagnose issues and ensure production systems are running efficiently.
• Experience with Agile methodologies and working in collaborative environment
Bachelor's Degree