• Provide end-to-end support of production applications, ensuring their stability, reliability, and performance.
• Conduct in-depth problem analysis of application, troubleshoot system errors, and performance issues.
• Perform proactive application stability analysis to investigate performance concerns, system errors and improvement opportunities.
• Take ownership of problem root cause analysis and implement appropriate remediation.
• Lead chronic issue investigations to minimize business impact and maintain system health.
• Drive proactive monitoring review and implementation strategies.
• Collaborate with Engineering, Application Development, and Infrastructure teams to implement break fixes, code updates, configuration changes, and production enhancements.
• Handle application management, business continuity, server patching coordination, vulnerabilities remediation, Splunk monitor setup.
• Lead production incident triaging calls.
• Provide production on-call support, including weekend rotation on a round-robin basis.
• Execute disaster recovery procedures and strategies.
• Respond to and resolve production tickets promptly to meet SLA requirements.
• Experience architecting a large-scale production database platform.
• Strong Knowledge of Postgres production and contingency replication feature and configuration
• Strong Knowledge of Postgres HA Clustered environment.
• Strong hands-on experience on failover/migration and data restores in a HA environment.
• Support Database patching and ability to provide continuous support for the Application.
• Proficient in handling crontabs and data backups using pgBackRest.
• Proficient in handling Kubernetes/OpenShift cluster for PostgreSQL
• Creating and maintaining documentation, troubleshooting playbooks, testing failover and recovery plans.
• Perform regular database maintenance tasks.
• Ability to write ansible playbooks.
• PostgreSQL DBA experience in a 24x7 production environment
• Experience in configuring, managing, and troubleshooting PostgreSQL (Postgres) on Linux
• Experience with database backup and recovery, including implementing disaster recovery standards (Postgres)
• Experience with database design, optimization, and tuning (Postgres)
• Experience with implementing database security concepts, including access, auditing, and encryption.
• Strong experience on OpenShift containers, Apache Kafka.
• Create the objects in the database, such as triggers, indexes, etc.
• Monitor the performance of the database and ensure optimum performance.
• Identify potential issues in the database to solve them early.
• Maintain backups and perform disaster recovery in case a disaster destroys the database.
• Monitor security and prevent any unauthorized access to the database.
• Schedule consistent maintenance on the server
• Maintain database schema.
• Manage database availability.
• Give best practice guidance to the development team.
• Resolve any production data issues.
• Tablespace management.
• Role management.
• Ensure data integrity.
• Maintain the database using different utilities.
• Postgres DBA, strong shell script skills, DB replication, ability to create playbooks, and strong SME in Unix System Administration
• Bachelor's degree or related experience
• Proven experience in Database support or similar role, preferably in production environment.
• Availability for on-call support, including weekend rotations, on a round-robin basis.
• 7+ years of experience in production database operations.
• Excellent communication and collaboration skills for effective work across cross-functional teams.
• Familiarity with database platforms such as Oracle or PostgreSQL.
• Knowledge of F5 Load balancer, GTM, high availability architectures and disaster recovery strategies.
• Certification in Red Hat Linux, DevOps methodologies or related fields.
• Knowledge of Ansible Tower, and CICD concepts, GTM and LTM concepts, understanding of F5 load balancing
Bachelor's degree