Description

Key Responsibilities:

  • Run the production environment by monitoring availability and taking a holistic view of system health
  • Improve reliability, quality, and time-to-market of our suite of software solutions
  • Measure and optimize system performance to push our capabilities forward, get ahead of customer needs, and innovate to improve continually
  • Collaborate with frontend and backend teams to identify and resolve issues across the stack.
  • Build tools and automation to improve observability, deployment, and incident response.
  • Drive reliability best practices across CI/CD pipelines, infrastructure, and application code.

Qualifications:

  • 5+ years of experience in software engineering or SRE roles.
  • 5+ years of experience with observability and monitoring tools (e.g., New Relic, LogRocket, Datadog, etc.)
  • 5+ years of experience with frontend frameworks (React, Angular, Mobile, React Native, Android, iOS)
  • 5+ years of experience with backend frameworks (Node.js, Spring Framework, etc.)
  • 5+ years of experience with database design (SQL and NoSQL)
  • 5+ years of experience with cloud services (Serverless Functions, Blob Storage, Virtual Machines)
  • Familiarity with cloud platforms (AWS or Azure) and containerized environments (Docker, Kubernetes).
  • Solid understanding of CI/CD, incident management, and system design.
  • A passion for reliability, performance, and delivering great user experiences.
  • Bachelor’s Degree or higher in Computer Science/Engineering/Math, or relevant experience

Education

Bachelor’s Degree or higher in Computer Science