Staff / Principal Site Reliability Engineer
ViVA Tech Talent are working with a high-growth, venture-backed technology company building a next-generation cloud platform focused on real-time system behaviour and production intelligence.
Already trusted by major global brands and operating at serious scale, they’re tackling complex problems across distributed systems, cloud infrastructure, and real-time data – and are now expanding their Site Reliability Engineering function in Belfast.
This isn’t a traditional “ops” role.
SREs here are embedded directly into product engineering teams, owning reliability from design through to production – not just reacting when things break. You’ll be working on live, complex systems where performance, observability and scalability genuinely matter.
It’s an SRE with a Software Engineering background – you’ll write code, solve deep production issues, and influence architecture from day one.
Responsibilities
- Partnering closely with engineering teams to design and build reliable, scalable systems from the ground up
- Owning observability, monitoring and alerting across critical services
- Defining and driving SLOs / SLIs to improve system performance and reliability
- Working on infrastructure as code and improving cloud architecture
- Leading and contributing to incident response and post-mortems
- Optimising performance, cost, and resilience across cloud environments
You’ll be working across a modern stack including Kubernetes, multi-cloud environments, distributed systems and high-scale data infrastructure.
Qualifications
- Strong experience in SRE, DevOps or platform engineering roles
- Hands‑on experience with cloud platforms (AWS, GCP or Azure)
- Solid background in infrastructure as code (Terraform, CDK, etc.)
- Experience building or improving observability and monitoring systems
- Strong problem‑solving skills and a mindset for digging into complex production issues
- Comfortable writing code (Python, Go, or similar)
- Fast‑paced, product‑led engineering culture
- Constant feature delivery with real production complexity to solve
- Belfast‑based team with real influence on the global platform
- Hybrid working in stunning offices
- Work on genuinely hard, meaningful engineering problems
- Be part of a high‑calibre, low‑ego team
- Have real ownership and impact, not just operational responsibility
- Join at a stage where you can shape systems, not just maintain them
If you’re someone who enjoys getting deep into systems, solving complex problems, and building reliable platforms at scale, this is a rare opportunity to do exactly that.
#J-18808-Ljbffr…
