Site Reliability Engineer (SRE)
London (Hybrid – 2-8 days per month in office) • £50,000 per annum
Clear progression to Mid-Level SRE within 18 months
We’re working with a growing, engineering‑led organisation looking to hire a Site Reliability Engineer who enjoys solving real platform problems through automation‑not just firefighting tickets.
The Opportunity
This is an ideal opportunity for someone with 2-3 years of DevOps, Platform Engineering or SRE experience who wants to take ownership of CI/CD, infrastructure‑as‑code, and platform tooling while continuing to build production‑grade coding skills. This role blends hands‑on engineering with platform ownership. You’ll spend your time split between:
- Supporting developers with broken builds and deployments (40%)
- Designing and building automation, CI/CD pipelines, and Terraform infrastructure (60%)
You’ll act as the automation backbone of the platform–reducing manual effort, improving reliability, and enabling engineering teams to move faster.
Key Responsibilities
Developer Support & Troubleshooting (40%)
- Debug failing builds, deployments, and CI/CD pipelines
- Provide Tier 2/3 support via Slack, tickets, and pairing sessions
- Take ownership of incidents, ensuring reliable and timely resolution
Platform Engineering & Automation (60%)
- Design, build, and optimise CI/CD pipelines (GitHub Actions, Jenkins, GitLab CI)
- Develop and maintain Terraform modules for infrastructure‑as‑code
- Build automation tools (CLI tools, scripts, GitHub Apps, self‑service tooling)
- Own observability: dashboards, alerts, monitoring, and runbooks
- Continuously improve platform processes and reduce operational toil
What We’re Looking For
Essential Skills & Experience
- 2-3 years in DevOps, SRE, or Platform Engineering
- Strong Linux troubleshooting and systems knowledge
- Proven experience with Terraform (module design, not just usage)
- CI/CD experience (GitHub Actions, GitLab CI, Jenkins)
- Ability to write production‑quality code in Python or Bash
- Solid networking fundamentals (DNS, load balancers, CDNs)
- Experience with observability tools (NewRelic, Datadog, Prometheus, Grafana)
- Comfortable participating in on‑call rotations
- Experience using AI tools (e.g. ChatGPT, Copilot, Cursor) to enhance productivity
Desirable
- Go, Ansible, or configuration management experience
- Experience working with multiple CDNs (CloudFront, Fastly, Cloudflare)
About You
- You’re a proactive problem‑solver who automates rather than repeats
- You communicate clearly with both technical and non-technical stakeholders
- You stay calm under pressure and take ownership during incidents
- You care about clean, maintainable, production‑quality code
- You actively use AI tools to improve how you build and debug systems
What’s On Offer
- £50,000 salary
- Genuine ownership of CI/CD and platform automation
- Direct collaboration with the Head of Technology
- Clear progression to mid-level SRE within 18 months
- Learning budget and dedicated development time
Why Apply?
This is not a ticket‑driven support role. You’ll be a key technical contributor shaping how the platform operates–working alongside engineers who code and influencing real infrastructure and tooling decisions.
#J-18808-Ljbffr…
