Job Responsibilities:
Have experience of these:
- Jenkins
- VMWare
- Python
- Docker
- Kubernetes
Requirements
- Experience w/ GCP is a plus.
- A self-starter, can problem solve and seek out info/answers independently,
- Good communication skills and attention to detail.
- Problem solving with a willingness to learn,
- Good at vendor communications and ability to drive working sessions,
- Good knowledge and experience on Kubernetes (usage of kubectl commands and able to troubleshoot/fix the issues)
- Platform support: Triage of non-prod/prod systems (for prod like L4 level) and understanding of analysis and how to drive toward resolution while participating in large group discussions,
- SPLUNK / New Relic/ monitoring tools: Good understanding including query & dashboard creation and modification,
- Capable of analyzing infrastructure and applications issues.
- Kubernetes cluster management
- Identify, prepare, execute mitigation plans,
- Perform non-prod deployments either manually or through automation,
- Working knowledge of Ansible and / or any other auto deployment tools,
- Perform required deployment verifications after application or services post deployments,
- At least 5 years’ experience in supporting; JAVA Application / services hosted in Linux environments.
- GCP, Anthos, & APIGEE
- Capable of taking deep dives in code to identify possible fixes for platform/services related issues,
- Work as a contributing team member together with other team members in other states and countries,
- Oncall rotation duties,
- Contribute and participate in team knowledge transfer
Primary Focus Skillset
Linux
- Ability to run basic linux command to troubleshooting application bug, network and other application related activities
- Perform patching and maintain the baremetal servers/VMs.
- Install OS in VM /create VM template to create new servers.
- Maintain relation with vendor / submit SR to get vendor involve in resolving issue
Middleware application
- Knowledge on managing webservers like JBOSS EAP or tomcat
- Installation/configuration/troubleshooting activities on JBOSS EAP/Wildfly
- Knowledge on messaging tool and technology
- Installation/configuration/troubleshooting on Artemis Messaging Queues or similar platform.
Kubernetes:
- Ability to perform cluster level administration on K8s platform.
- Creating and maintaining scripts to maintain, monitor and alerts on K8s platform.
- Comfortable with kubectl / YAML
Docker:
- Understanding of docker.
- Good experience in writing docker files.
- Creating images, maintaining docker registry.
Ansible:
- Playbook creation for repeatable tasks
- Perform installation of software (platform and code deployment)
- Take documentation and create roles to install software.
- Create reusable roles and playbooks
- CloudFormation and Terraform
Monitoring Tools:
- NewRelic - APM, Insights, Infrastructure
- Ability to create alerts and dashboards.
- Splunk - Querying and dashboard creation
Application Performance Tuning:
- Participate in load testing and resolve testing bottlenecks.
- Java heap and thread dump analysis
Jenkins:
- knowledge and understanding of Jenkins Pipelines
- Ability to analyze console job log for errors.
OS Support and Troubleshooting:
- Redhat Enterprise Linux
- Amazon Linux
- Alpine Linux
- Windows 2012/2016