Site Reliability Engineer

Company: La Fosse
Apply for the Site Reliability Engineer
Location: Greater London
Job Description:

£75,000-85,000 basic + bonus + benefits

A global payments and FX platform is undergoing a major technology transformation, moving from traditional infrastructure towards cloud-native, highly automated, reliability-first engineering.

They’re looking for a Senior Site Reliability Engineer to play a key role in modernising platforms, improving operational resilience, and embedding strong observability and SRE practices across the organisation.

This is a chance to shape the platform direction, not just maintain existing systems.

What you’ll be doing

  • Defining SLOs, SLIs and error budgets for critical services
  • Building and improving observability frameworks (metrics, logs, traces)
  • Leading incident response and post-incident reviews for high-impact events
  • Driving automation, self-healing systems and reliability improvements
  • Leading disaster recovery testing and resilience engineering initiatives
  • Helping modernise legacy infrastructure into cloud-native Azure platforms

Tech environment

  • Terraform / Infrastructure as Code

What they’re looking for

  • Strong background in SRE / Platform Engineering
  • Experience owning production reliability for high-availability systems
  • Deep knowledge of Azure infrastructure
  • Experience with observability, incident management, and automation
  • Comfortable working across engineering, product, and business teams
  • Join during a major cloud and platform transformation
  • Work on high-scale financial systems and payments infrastructure
  • Influence platform reliability and engineering standards

#J-18808-Ljbffr…

Posted: March 17th, 2026