Key Responsibilities:
Define and manage a standardized infrastructure platform catalog.
Enforce architectural best practices and design standards across engineering teams.
Design for resiliency and define failure domains to ensure high availability.
Collaborate with core infrastructure engineering teams to develop scalable platform solutions.
Partner with embedded Site Reliability Engineering (SRE) teams to scale platform architecture initiatives.
Required Skillsets:
Strong experience in platform architecture and infrastructure engineering, including:
Host Platforms: Experience with enterprise-scale web hosting and container orchestration platforms (e.g., Kubernetes or similar).
Compute: Proficiency in virtual machines, containers, and serverless computing.
Storage: Familiarity with object and file storage systems.
Databases: Knowledge of SQL, NoSQL, and time-series databases.
Messaging Systems: Experience with distributed messaging platforms (e.g., Kafka, MQ).
Observability: Understanding of logging, metrics, and tracing systems.
Security: Implementation of authentication, secrets management, and audit logging.
Networking: Experience with ingress controllers, service mesh, and DNS configurations
Any Graduate