You will design, operate, and secure Azure?based, Kubernetes?centric infrastructure that underpins real?time, model?driven services
Client Details
This opportunity is with a large organisation in the financial services industry.
Description
• Design, build, and manage production infrastructure on Microsoft Azure with Kubernetes as the core orchestration platform.
• Operate, monitor, and scale Kubernetes?based services, leading incident response and reliability improvements.
• Partner with algorithm and application teams to deploy and run model?serving and inference workloads in production.
• Build and refine CI/CD pipelines (e.g. GitHub Actions, Azure DevOps, GitLab CI) to enable fast, reliable releases.
• Champion infrastructure?as?code, DevOps best practices, and enhanced observability across the engineering stack.
Job Offer
• End?to?end ownership of a high?impact Azure/Kubernetes platform, with direct influence over architecture, tooling, and DevOps practices.
• Cutting?edge exposure to AI/ML and potentially LLM and GPU?based workloads, working closely with algorithm and product teams in real production environments
• Significant experience as a DevOps, Platform, or SRE Engineer supporting large?scale, production systems.
• Deep hands?on expertise with Azure services, Kubernetes, and observability tooling for distributed systems.
• Proven track record building and maintaining CI/CD pipelines and automating infrastructure through code.
• Comfortable collaborating with software and algorithm teams on model?related or AI?driven services; LLM or GPU