Company: identifi Global Resources

Apply for the Site Reliability Engineer

Location: Sheffield

Job Description:

Site Reliability / Resilience Engineer

Check all associated application documentation thoroughly before clicking on the apply button at the bottom of this description.

Hybrid: Sheffield, UK(2-3days)

Contract: 6 months rolling contract

Rate: £600/day(Inside IR35) via Umb

Role Overview

We are seeking a Site Reliability / Resilience Engineer to support a large-scale, enterprise technology environment. This role focuses on improving the reliability, availability, and resilience of critical services across complex, distributed systems.

You will work across cloud, infrastructure, and application ecosystems, helping ensure services are observable, recoverable, and aligned with both engineering best practices and regulatory resilience requirements.

Key Responsibilities

Support reliability and resilience across cloud platforms (AWS, Azure, GCP)
Work across infrastructure, networks, data centres, and application platforms
Analyse and map service dependencies and critical service chains
Contribute to the design and implementation of resilience and recovery strategies (RTO/RPO, failover patterns)
Support vulnerability identification and risk reduction activities
Enhance observability, monitoring, and resilience tooling across services
Ensure alignment with UK Operational Resilience Policy Framework (PRA/FCA/Bank of England)
Support ITIL-aligned processes, including incident, change, and release management
Drive improvements in service stability, reliability, and performance

Skills & Experience

Strong experience across enterprise technology environments:
Cloud platforms (AWS, Azure, GCP)
Infrastructure, networking, and data centres
Application platforms and integration layers
Strong understanding xwzovoh of:
Service chain and dependency mapping
Vulnerability and risk management
Recovery models (RTO/RPO) and resilience patterns
ITIL-based service management practices
Experience with enterprise tooling such as ServiceNow
Exposure to observability or monitoring platforms (beneficial but not essential)
Familiarity with UK Operational Resilience frameworks (PRA/FCA/Bank of England)

This is a strong opportunity for someone who combines Site Reliability Engineering principles with a focus on operational resilience, observability, and large-scale enterprise systems.

…

Posted: May 2nd, 2026