RL Research Engineer: Safe, Scalable AI Systems

{ “@context”: “http://schema.org”, “@type”: “JobPosting”, “title”: “RL Research Engineer: Safe, Scalable AI Systems”, “description”: “Anthropic is seeking a Research Engineer specializing in Reinforcement Learning to advance large language model capabilities. This role involves collaborative research and engineering, focused on optimizing core reinforcement learning infrastructure and driving performance through novel methodologies. We're looking for candidates proficient in Python, with strong experience in machine-learning frameworks and systems design. The position offers a salary range of £260,000–£630,000 GBP and requires a Bachelor's degree or equivalent, along with a commitment to AI safety and benefits.#J-18808-Ljbffr”, “datePosted”: “2026-05-15”, “hiringOrganization”: { “@type”: “Organization”, “name”: “Anthropic”, “sameAs”: “https://uk.whatjobs.com/pub_api__cpl__432616837__4861?utm_campaign=publisher&utm_medium=api&utm_source=4861&geoID=33” }, “jobLocation”: { “@type”: “Place”, “address”: { “@type”: “PostalAddress”, “addressLocality”: “London” } } }
Company: Anthropic
Apply for the RL Research Engineer: Safe, Scalable AI Systems
Location: London
Job Description:

Anthropic is seeking a Research Engineer specializing in Reinforcement Learning to advance large language model capabilities. This role involves collaborative research and engineering, focused on optimizing core reinforcement learning infrastructure and driving performance through novel methodologies. We’re looking for candidates proficient in Python, with strong experience in machine-learning frameworks and systems design. The position offers a salary range of £260,000–£630,000 GBP and requires a Bachelor’s degree or equivalent, along with a commitment to AI safety and benefits.#J-18808-Ljbffr…

Posted: May 15th, 2026