Title: Distributed Systems Engineer (HPC)
London (Hybrid – 1 day/week onsite)
Contract (Initial 3 months, extendable to Dec 2026)
We’re looking for an experienced Distributed Systems Engineer to join an exciting High-Performance Computing (HPC) initiative. This is a unique opportunity to help build the core infrastructure behind a next-generation, GPU-accelerated distributed computing platform.
What You’ll Be Working On
You’ll play a key role in developing resilient, high-throughput backend services that orchestrate workloads across CPU and GPU environments.
Your work will include:
- Designing and building distributed services in Rust
- Developing scalable work queue systems (e.g. Apache Pulsar, RabbitMQ)
- Building GPU node orchestration services for data movement
- Creating robust storage lifecycle and indexing services
- Designing performant APIs using gRPC
What We’re Looking For
Essential:
- Strong experience in Rust for backend systems
- Deep understanding of distributed systems architecture
- Experience with message queues (Pulsar, RabbitMQ, etc.)
- gRPC / API design experience
- AWS-based service deployment experience
Nice to Have:
- HPC environments (RDMA, Lustre)
- GPU architecture and CUDA ecosystem
- High-performance data and networking systems
…
