RL Research Engineer: Safe, Scalable AI Systems

Company: Anthropic
Apply for the RL Research Engineer: Safe, Scalable AI Systems
Location: London
Job Description:

Anthropic is seeking a Research Engineer specializing in Reinforcement Learning to advance large language model capabilities. This role involves collaborative research and engineering, focused on optimizing core reinforcement learning infrastructure and driving performance through novel methodologies. We’re looking for candidates proficient in Python, with strong experience in machine-learning frameworks and systems design. The position offers a salary range of £260,000–£630,000 GBP and requires a Bachelor’s degree or equivalent, along with a commitment to AI safety and benefits.#J-18808-Ljbffr…

Posted: June 1st, 2026