Site Reliability Engineer (SRE)

Company: Uniting People
Apply for the Site Reliability Engineer (SRE)
Location: London
Job Description:

Site Reliability Engineer (SRE) – Market Risk Platform

London (5 days onsite) | Contract | Banking/Finance/Trading£450- £500 day

Overview

We are hiring experienced Site Reliability Engineers (SREs) to support a Market Risk platform within a leading financial services environment.

This is an engineering-led transformation role, focused on automation, reliability, and AI-driven operational improvement rather than BAU support.

Success is measured by:

  • Reduced operational toil
  • Faster recovery (MTTR reduction)
  • Safer, faster change delivery
  • Increased automation and self-service
  • Improved platform reliability

Key Responsibilities

Automation Engineering (Core)

  • Build production-grade Python automation for operational workflows
  • Automate environment checks, dependency validation, reruns, restarts, and drift remediation
  • Deliver self-service tools with proper audit, rollback, and safety controls (idempotency, dry-run, approvals)

Process Re-engineering (Core)

  • Redesign incident, change, release, and recovery processes
  • Convert runbooks into automated workflows
  • Remove manual handoffs and operational friction
  • Define KPIs: toil, MTTR, alert volume, change failure rate

Agentic AI (Core)

  • Build agentic workflows for diagnostics, remediation, and orchestration
  • Implement guardrails, human-in-the-loop controls, and evaluation frameworks
  • Productionise AI automation with monitoring and feedback loops

Observability

  • Improve monitoring, logging, and system visibility to enable automation at scale

Required Skills

  • 8+ years SRE/production engineering experience
  • Strong Python (automation/tooling focus)
  • Experience with distributed systems in production environments
  • Strong Linux troubleshooting (app/system/network layers)
  • Hybrid infrastructure exposure (on-prem + cloud)
  • Kubernetes experience (ops/monitoring/reruns)
  • Strong background in automation and process optimisation
  • Athena ecosystems

Agentic AI (Essential)

  • Proven experience with agentic AI or intelligent automation systems
  • Tool integration, guardrails, evaluation, and measurable production impact (toil/MTTR reduction)

Desirable

  • Banking/Finance/Market Risk experience
  • Familiarity with Athena ecosystem or similar (SecDB, Quartz)
  • Exposure to trading, risk, or regulatory platforms

About the Role

A high-impact SRE role in a Market Risk trading environment, focused on eliminating operational toil through automation, AI, and reliability engineering at scale.

#J-18808-Ljbffr…

Posted: May 23rd, 2026