DevOps Site Reliability Engineer

Ztek Consulting Inc
Pennington, NJ, USA

Description

Responsibilities:

Looking for DevOps Site Reliability Engineer to work with the development team on DevOps needs with the focus of SRE to ensure system reliability.

Required Skill:

System Monitoring and Alerting. Set up, maintain, and improve monitoring systems to proactively identify and resolve issues.

Performance optimization through identifying bottlenecks and implement solutions to optimize speed and efficiency.

Respond to incidents, diagnose problems, and implement fixes to minimize downtime and ensure system stability.

Managing Release and Deployment processes, ensuring the deployments are safe, reliable and do not disrupt services once it is released.

Work closely with development and production support teams to understand their needs, improve the developer/release experience, and ensure smooth releases.

Define and monitor Service Level Objectives (SLOs) to ensure systems meet performance and reliability requirements.

Analyze system usage patterns and capacity requirements to ensure systems can handle traffic and workloads.

Designs, implements, and maintains tools and process for Continuous Integration and Continuous Delivery (CI/CD).

Well versed with BitBucket flow and version control system, pull requests and other version control related concepts.

Experience with CI/CD tools such as Jenkins, JFrog, Ansible Tower, Datical/Liquibase, SonarQube, Artifactory, XLR.

Knowledge of Application Development Security Framework and Vulnerability remediation.

Should be able to adhere and implement to modern CI/CD concepts and proactively suggest automation solutions to improve.

Possess strong problem-solving skills and the ability to diagnose and fix system issues.

Must have basic knowledge of working with Linux.

Able to work with Java/Python programming language.

Individual should be self-motivated, goal oriented with a high degree of accountability.

Excellent interpersonal and communication skills.

Ability to collaborate with vendor partners/programmers to coordinate delivery of software application.

Desired Skill :-

Knowledge of the Open Shift Platform is plus.

Experience with Cloud Architecture and Operations including migration, resilience, maintainability, and cost efficiency.

Experience with at least one large cloud service providers: Azure, AWS, and/or GCP

Key Skills

Ci/cd Java Jenkins Jfrog Ansible Python Azure Aws Gcp

Education

Any Gradute

Apply Now

Back To Jobs

Posted On: Today
Experience: 10+ years of experience
Openings: 1
Category: Site Reliability Engineer
Tenure: Full-Time Position