Short role description (click “Apply Here” to see full listing):
Responsibilities:
- As a Staff SRE, you’ll contribute to influence and shape both the strategy and implementation of our evolving observability capabilities across the Birdie system; you’ll leverage OpenTelemetry and SRE practices like SLOs, to support squads in proactively identifying issues before they impact customers.
- You’ll play a central role in our Incident Management and On-Call “experience”, building automations and driving practices that unify critical system operations and make OOH support run smoothly.
- You’ll act as a Tech Lead for Disaster Recovery and support Platform and Product in defining and executing targeted improvements that cross-functionally achieve RPO and RTO targets.
- You’ll be a key part of our “shift-left” DevOps success, whether it’s security best-practices, CI/CD, solid production considerations or just leveraging AWS to its fullest – you’ll be at the forefront of our non-functional strategies.
- You’ll be working in an embedded model, acting as an expert on short-term projects with a product squad providing hands-on contributions with their code, pipelines, and configurations; along with working with your Platform colleagues in better maintaining infrastructure or improving developer tools.
#J-18808-Ljbffr…