Staff Site Reliability Engineer

Company: The BAE HQ Ltd
Apply for the Staff Site Reliability Engineer
Location: London
Job Description:

Short role description (click “Apply Here” to see full listing):

Responsibilities:

  • As a Staff SRE, you’ll contribute to influence and shape both the strategy and implementation of our evolving observability capabilities across the Birdie system; you’ll leverage OpenTelemetry and SRE practices like SLOs, to support squads in proactively identifying issues before they impact customers.
  • You’ll play a central role in our Incident Management and On-Call “experience”, building automations and driving practices that unify critical system operations and make OOH support run smoothly.
  • You’ll act as a Tech Lead for Disaster Recovery and support Platform and Product in defining and executing targeted improvements that cross-functionally achieve RPO and RTO targets.
  • You’ll be a key part of our “shift-left” DevOps success, whether it’s security best-practices, CI/CD, solid production considerations or just leveraging AWS to its fullest – you’ll be at the forefront of our non-functional strategies.
  • You’ll be working in an embedded model, acting as an expert on short-term projects with a product squad providing hands-on contributions with their code, pipelines, and configurations; along with working with your Platform colleagues in better maintaining infrastructure or improving developer tools.

#J-18808-Ljbffr…

Posted: February 4th, 2025