Job Description:
Experience:
- 5+ years hands-on admin experience at the platform and application tiers supporting critical Customer Facing applications preferably in the Financial Services Industry.
- 5+ years of experience troubleshooting environments across the entire architecture (i.e., applications to infrastructure).
- 3+ years of hands-on Linux administration experience.
- 3+ experience with Oracle SQL, MongoDB, Redis, Kafka, Flink, Postgres, or similar data technologies.
- 1+ Years supporting and monitoring service load balancing architectures including F5 & VMware AVI.
Hard Skills:
- Site Reliability Engineer (SRE) Skills – Ability to apply candidates expert troubleshooting and optimization skills up and down the full stack, ensuring that critical. applications/services are engineered for scalability, availability & resiliency including graceful degradation of service, fault isolation and quick recovery to minimize customer impact.
- Provide highly advanced technical expertise to maximize efficiency, reliability and value from current solutions, infrastructure, platforms and emerging technologies, showing technical leadership, and driving continuous improvement efforts.
- Ability to identify root-cause issues, articulate corrective actions and improvement opportunities, and design approaches/programs/products to improve overall quality assurance.
- Strong knowledge of Observability/Monitoring tools & their application (i.e., Glassbox, Dynatrace, AppDynamics, Client, BigPanda AIOps, etc.).
- Intermediate/expert level ability to use automation and configuration management tools for provisioning using Puppet, Ansible, Terraform, Chef, Jenkins, GitLab and Liquibase.
- Functional knowledge of programming scripting such as JavaScript, PowerShell, Python, Bash, SQL, .NET, Java, PHP, Ruby, PERL, C++, R, etc.
- Cloud Architect or Engineer Certification (i.e. GCP, Azure, AWS, etc.) – Preferably GCP ACE.
Soft Skills:
- Collaborate well within a Global Platform Engineering Team being technically competent & confident but not arrogant.
- Ability and willingness to quickly learn new technologies & tools and can effectively train candidates peers.
- Ability to adapt to a rapidly changing environment.
- Ability to communicate highly complex technical information clearly and articulately for all levels and audiences.
Job Expectations:
- Flexibility to work in a 7x12 support environment, including weekends and holidays within a Team on-call engineer rotation schedule