6-Month Contract (Likely Extension) | £520 per day
3 Days Remote & 2 Days Onsite in London
Overview
We are partnering with a leading Financial Services organisation seeking an experienced DevOps Engineer / Site Reliability Engineer (SRE) with deep OpenShift expertise to take ownership of their enterprise container platform.
This is a critical role focused on designing, building, and operating a secure, scalable, and highly available OpenShift environment supporting business-critical services. The successful candidate will act as a subject matter expert (SME) for OpenShift, while applying DevOps and SRE principles to ensure platform reliability, automation, and operational excellence.
Key Responsibilities
Platform Engineering & SRE Operations
- Own the full OpenShift lifecycle (Day 0, Day 1, Day 2 operations)
- Manage upgrades, patching, and version compatibility across OCP, RHCOS, and Operators
- Ensure platform reliability, performance, and high availability in line with SRE principles
- Define and maintain SLIs, SLOs, and error budgets
Security & Compliance
- Implement and maintain robust security controls (RBAC, SCC, network policies, image governance)
- Ensure adherence to regulatory and compliance standards within a Financial Services environment
- Proactively manage vulnerabilities and CVEs
Automation & Infrastructure as Code
- Develop and maintain automation using Ansible, Terraform/OpenTofu, and GitOps (ArgoCD/OpenShift GitOps)
- Drive Infrastructure as Code and CI/CD best practices across the platform
- Promote self‑service and automation‑first engineering culture
Networking, Storage & Integration
- Oversee storage integration (CSI drivers, provisioning, performance tuning)
- Integrate with enterprise services (LDAP/AD, OIDC, DNS, PKI, monitoring, logging, ServiceNow)
Skills & Experience Required
- 12+ years’ experience in Linux/platform engineering
- 3+ years hands‑on experience with OpenShift/Kubernetes in enterprise environments
- Strong experience in a DevOps Engineer or SRE role supporting production platforms
- Proven track record managing production OpenShift clusters at scale
- Strong expertise in Kubernetes architecture (networking, storage, security, CRDs, controllers)
- Solid Linux administration (RHEL/RHCOS) and scripting skills
- Experience with automation tools (Ansible, Terraform/OpenTofu, GitOps)
- Strong understanding of observability tools (Prometheus, Grafana, logging platforms)
- Experience integrating enterprise identity/security (LDAP, AD, OIDC, certificates)
#J-18808-Ljbffr…
