Site Reliability Engineer

Company: Halian | Managed Services, Recruitment Agency & Contract Staffing
Apply for the Site Reliability Engineer
Location:
Job Description:

Senior Site Reliability Engineer (UK or Germany) – Fully Remote – £120k + Benefits

We are hiring experienced Senior Site Reliability Engineers to join a global engineering team supporting a high‑availability, Java‑based platform used by customers worldwide.

This is a permanent, fully remote role open to candidates based in the UK or Germany, offering a competitive package of ~£120k + benefits.

If you are a true SRE (not DevOps-focused) who cares deeply about reliability, stability, incident response, and performance at scale, we want to speak with you.

What You’ll Do

Ensure high availability, scalability, reliability, and security across production environments

Lead live incident response, drive root‑cause analysis, and deliver lasting solutions

Build and maintain SLIs, SLOs, and SLAs

Support a core Java product: patching, SDKs, configuration (YAML), and uptime work

Drive automation using Python, Linux tooling, and IaC

Work closely with security, compliance, and multiple engineering teams

Participate in a 24/7 on‑call rotation (1 week every 4–5 weeks)

Tech Stack & Skills

AWS: EC2, EKS, Load Balancers, VPC — with hands‑on production experience

Linux: Deep troubleshooting & sysadmin fundamentals

Python: Scripting for automation

SRE mindset: Incident management, observability, reliability engineering principle

We’re Looking For

Senior‑level SREs with proven experience running large‑scale, mission‑critical systems

Engineers who love digging into incidents, solving problems properly, and improving systems over time

Professionals who thrive in autonomous, globally distributed teams.

Posted: April 1st, 2026