Senior Site Reliability Engineer

{ “@context”: “http://schema.org”, “@type”: “JobPosting”, “title”: “Senior Site Reliability Engineer”, “description”: “

The Role

We’re hiring a Senior SRE to own the reliability, scalability, and performance of our production systems as we continue to grow.

At Lyrebird, you won’t just respond to incidents. You’ll design the systems and standards that prevent them. That means building infrastructure that scales cleanly, creating deployment patterns that reduce risk, and ensuring we can detect and resolve issues before they impact users.

This is a broad role that sits across platform engineering, DevOps, and security. You’ll be responsible for ensuring our systems are resilient under load, observable in real time, and able to scale as usage increases.

You’ll play a key role in how we get code from a developer’s machine into production safely, and how we operate those systems once they’re live.

About Us

Lyrebird Health builds AI-powered tools that reduce the administrative burden on clinicians and improve the quality and accessibility of healthcare.

Our platform is used by thousands of clinicians across multiple markets. As we grow, we’re focused on building systems that are reliable, scalable, and trusted in high-stakes environments.

What you’ll do

  • Keep production systems online and restore them quickly when they fail
  • Lead and manage incidents, making high-quality decisions under pressure
  • Design and implement scalable infrastructure and deployment patternsBuild and improve CI/CD pipelines and release systems
  • Improve monitoring, telemetry, and observability across the stack
  • Own cloud infrastructure, security, and access controls
  • Work closely with engineers to ensure systems are built to scale from day one

What you’ll bring

  • 5-7 years experience in SRE, platform engineering, or DevOps roles
  • Strong AWS experience (ECS/Fargate, EC2, Lambda, SQS, IAM)
  • Experience running and scaling production systems
  • Strong understanding of distributed systems and scaling approaches
  • Hands‑on experience with Docker and containerised environments
  • Experience with Kubernetes or ECS

How You Work

  • You take ownership and follow things through
  • You’re proactive and comfortable operating with ambiguity
  • You stay calm and make good decisions during incidents
  • You focus on solving problems end to end
  • You’re willing to roll up your sleeves and get into the detail

This is a critical hire for us as we scale. If you want real ownership over how systems are designed, deployed, and operated, and the opportunity to build reliability into a product used in high‑stakes environments, we’d love to hear from you.

We’re building a team that reflects the diversity of the people who use our product. If you’re from an underrepresented background in tech, we strongly encourage you to apply, even if you don’t meet every requirement.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

#J-18808-Ljbffr”, “datePosted”: “2026-05-01”, “hiringOrganization”: { “@type”: “Organization”, “name”: “Lyrebird Health”, “sameAs”: “https://uk.whatjobs.com/pub_api__cpl__418769255__4861?utm_campaign=publisher&utm_medium=api&utm_source=4861” }, “jobLocation”: { “@type”: “Place”, “address”: { “@type”: “PostalAddress”, “addressLocality”: “” } } }
Company: Lyrebird Health
Apply for the Senior Site Reliability Engineer
Location:
Job Description:

The Role

We’re hiring a Senior SRE to own the reliability, scalability, and performance of our production systems as we continue to grow.

At Lyrebird, you won’t just respond to incidents. You’ll design the systems and standards that prevent them. That means building infrastructure that scales cleanly, creating deployment patterns that reduce risk, and ensuring we can detect and resolve issues before they impact users.

This is a broad role that sits across platform engineering, DevOps, and security. You’ll be responsible for ensuring our systems are resilient under load, observable in real time, and able to scale as usage increases.

You’ll play a key role in how we get code from a developer’s machine into production safely, and how we operate those systems once they’re live.

About Us

Lyrebird Health builds AI-powered tools that reduce the administrative burden on clinicians and improve the quality and accessibility of healthcare.

Our platform is used by thousands of clinicians across multiple markets. As we grow, we’re focused on building systems that are reliable, scalable, and trusted in high-stakes environments.

What you’ll do

  • Keep production systems online and restore them quickly when they fail
  • Lead and manage incidents, making high-quality decisions under pressure
  • Design and implement scalable infrastructure and deployment patternsBuild and improve CI/CD pipelines and release systems
  • Improve monitoring, telemetry, and observability across the stack
  • Own cloud infrastructure, security, and access controls
  • Work closely with engineers to ensure systems are built to scale from day one

What you’ll bring

  • 5-7 years experience in SRE, platform engineering, or DevOps roles
  • Strong AWS experience (ECS/Fargate, EC2, Lambda, SQS, IAM)
  • Experience running and scaling production systems
  • Strong understanding of distributed systems and scaling approaches
  • Hands‑on experience with Docker and containerised environments
  • Experience with Kubernetes or ECS

How You Work

  • You take ownership and follow things through
  • You’re proactive and comfortable operating with ambiguity
  • You stay calm and make good decisions during incidents
  • You focus on solving problems end to end
  • You’re willing to roll up your sleeves and get into the detail

This is a critical hire for us as we scale. If you want real ownership over how systems are designed, deployed, and operated, and the opportunity to build reliability into a product used in high‑stakes environments, we’d love to hear from you.

We’re building a team that reflects the diversity of the people who use our product. If you’re from an underrepresented background in tech, we strongly encourage you to apply, even if you don’t meet every requirement.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

#J-18808-Ljbffr…

Posted: May 1st, 2026