Senior SRE, AI Inference Platform — GPU Scalability

Company: Nebius
Apply for the Senior SRE, AI Inference Platform — GPU Scalability
Location: London
Job Description:

Nebius, headquartered in Amsterdam with a global presence, is seeking an engineer to enhance the reliability and performance of their inference platform, crucial for AI deployment. You will design telemetry pipelines, tune Kubernetes for efficiency, and create resilient systems under load. Candidates should be proficient in technologies like Kubernetes, Prometheus, and Python, and have experience with GPU workloads.

The role offers competitive compensation, career growth opportunities, and a collaborative culture focused on impactful AI projects.

#J-18808-Ljbffr…

Posted: July 1st, 2026