Job Description:
We are seeking a Senior Azure AI/ML Platform Engineer with hands-on experience in Azure OpenAI, Infrastructure as Code (IAC), and Azure networking. This role focuses on delivering secure and compliant AI/ML solutions within Azure for enterprise data services. The ideal candidate will have extensive experience working in financial services, with a deep understanding of security and regulatory requirements.
You will collaborate with cloud engineering, security, and risk management teams to implement AI/ML services and enable automation through CI/CD pipelines. Strong problem-solving skills and a passion for innovation are essential.
Key Responsibilities:
AI/ML Development & Automation:
-
- Design, implement, and automate pipelines for training and managing AI/ML models (AIOps/MLOps).
- Leverage Azure OpenAI services to develop and deploy solutions aligned with business objectives.
- Develop APIs and Webhooks using Python, Git Actions, and Terraform.
Cloud Infrastructure & Automation:
-
- Utilize Infrastructure as Code (IAC) to provision and manage cloud infrastructure.
- Create and manage Azure Resource Manager (ARM) and Terraform templates.
- Automate cloud services deployment using native Azure CLI and CI/CD pipelines.
Security & Compliance:
-
- Ensure secure data handling and model training in compliance with internal security policies.
- Implement Azure security features, including data protection, RBAC, and authentication.
- Manage Public Key Infrastructure (PKI) and certificates for Azure services.
Azure Networking & Troubleshooting:
-
- Troubleshoot Azure connectivity, DNS, and network configurations (NSG, routing).
- Collaborate with network teams to optimize and maintain secure Azure networking solutions.
Collaboration & Agile Delivery:
-
- Work closely with development teams to integrate AI/ML services into enterprise applications.
- Participate in Agile teams and contribute to DevOps practices within a Scrum framework.
- Continuously improve automation frameworks and tools for self-service AI/ML capabilities.
Mandatory Requirements:
AI/ML Expertise:
-
- 2+ years of hands-on experience with AI/ML platforms in Azure, including Azure OpenAI services.
- Strong understanding of deep learning and natural language models.
Cloud Automation & IAC:
-
- 2+ years of experience developing platform orchestration code using Azure Python SDK, Terraform, and GitHub Runners.
- Expertise in Infrastructure as Code (IAC) and CI/CD pipelines with Git Actions.
Azure Networking & Security:
-
- Proficiency in Azure networking, including DNS, NSG, routing, and connectivity troubleshooting.
- Familiarity with Azure security features and public/private key management (PKI).
Critical Thinking & Problem Solving:
-
- Strong analytical, research, and troubleshooting skills.
- Self-starter with the ability to work independently and collaboratively in a fast-paced environment.