AI Inference Engineer | GPU-Scale Rust/Python | Equity

Company: Perplexity

Location: London

Posted: April 4th, 2026

We are looking for an AI Inference Engineer to join our growing team. We build and run the inference engine behind every Perplexity query and deploy dozens of model architectures at scale with tight latency and cost budgets. Our stack is Rust, Python, CUDA, and CuTe DSL.

Responsibilities

Who We're Looking For

Nice-to-have

Qualifications

Final offer amounts are determined by multiple factors including experience and expertise.

Equity: In addition to the base salary, equity may be part of the total compensation package.

#J-18808-Ljbffr
Apply Now