Site Reliability Engineer

Company: Spendesk
Apply for the Site Reliability Engineer
Location: London
Job Description:

About the Team

The Infrastructure team at Spendesk builds the tools, systems, and internal products that empower every engineering team to move faster and more safely. We transform traditional infrastructure into a developer-facing platform focused on enablement, automation, and scalability. Our mission is to simplify complexity, automate manual toil, and provide a platform that drives innovation. We own the CI/CD platform (ArgoCD and GitHub Actions), secrets management, observability tooling, infrastructure provisioning, and developer workflows, foundational to the productivity and reliability of the entire product organization.

About the Role

You will join a dynamic and friendly team working on the next generation of internal tooling and infrastructure at Spendesk. You will be hands‑on with Kubernetes, Go, and AWS to create scalable, reliable, and developer-friendly systems that power our internal platform.

Responsibilities

  • Build and maintain core infrastructure services including provisioning workflows, service orchestration, and deployment automation using Kubernetes, Go, and cloud-native tools.
  • Support and improve our CI/CD platform and Infrastructure Service Orchestrator, enabling product teams to spin up production-ready services in minutes with consistent tooling, observability, and security baked in.
  • Partner closely with product engineers and EMs to identify recurring friction in development workflows and turn those into reusable, documented platform solutions across key domains (secrets management, IaC, monitoring, cost visibility).
  • Improve our observability and incident response workflows, building custom dashboards, refining alerting systems, and supporting internal adoption through clear documentation and onboarding guides.
  • Support the migration of legacy infrastructure, bringing the you build it, you own it mindset to our product squads while maintaining a high level of security and permission controls.
  • Participate in our regular on-call rotation (off-hours) and firefighting (during working hours), leveraging lessons learned to continuously reduce toil through automation, self-healing systems, and smarter alert routing.

Qualifications

  • 3-5 years of engineering experience in DevOps, Infrastructure, or Platform Engineering roles.
  • Hands-on experience with Kubernetes in production, including debugging containerized services at scale.
  • Proficiency in Bash, Python, or Go for building reliable tooling and automation.
  • Experience with AWS and Infrastructure-as-Code (Terraform or similar).
  • A bias for automation: you replace playbooks with code and eliminate toil wherever possible.
  • Strong communication skills: clear documentation and async updates are second nature to you.
  • Fluency in English (spoken and written).

Nice to Have

  • Experience with CI/CD platforms (ArgoCD, GitHub Actions) or observability stacks (Grafana, Prometheus, Datadog).
  • Familiarity with SLOs, incident management, and on-call practices.
  • Background in regulated environments (fintech, GDPR).
  • Contributions to open-source tooling.

Benefits

  • Flexible on-site and remote policy.
  • Latest Apple equipment – the tools you need to excel.
  • Access to Moka.care – for emotional and mental health wellbeing.
  • Great office snacks – to fuel your day.
  • A positive team to work with daily.
  • Location-specific benefits tailored to each market, including health insurance, wellness allowances, commuter support, meal vouchers, and gym memberships.

Diversity & Inclusion

At Spendesk, we’re committed to fostering an environment where all differences are encouraged, supported and celebrated. We’re building our culture for everyone, with everyone. Our goal is to attract and build a diverse, equal and inclusive team, where everyone feels welcome and we truly embrace and encourage people from all backgrounds to apply.

#J-18808-Ljbffr…

Posted: May 31st, 2026