Responsibilities
- Own the design, deployment, and day‑to‑day operation of OpenStack and Kubernetes clusters optimised for GPU workloads.
- Build and maintain infrastructure automation using Infrastructure-as-Code (IaC) and GitOps practices.
- Enable reliable GPU workload scheduling through Kubernetes‑native tooling and NVIDIA integrations.
- Ensure high availability and resilience through effective monitoring, logging, and incident response.
- Implement strong security controls, including RBAC and network policies, to ensure tenant isolation across cloud layers.
- Collaborating with DevOps, AI, and Product teams to align infrastructure capabilities with customer needs.
Skills / Must have
- OpenStack: Significant hands‑on experience operating OpenStack in production environments.
- Kubernetes: Strong experience running production‑grade Kubernetes clusters, ideally on bare‑metal or private cloud setups.
- Solid grounding in Linux, networking, and storage with a practical approach to troubleshooting.
- Automation: Experience with infrastructure automation, CI/CD, and Git‑based workflows.
- Mindset: Ability to thrive in a fast‑moving environment with a strong sense of accountability for outcomes.
Benefits
- 10% Performance related bonus
Salary
- £130,000 GBP
#J-18808-Ljbffr…
