Research Engineer

Company: ScienceMachine
Apply for the Research Engineer
Location: London
Job Description:

# About the role

Science Machine is building agentic AI software for automating bioanalysis workflows. Our platform helps scientists and bioanalysis teams turn complex analytical processes into reliable, repeatable, AI-assisted workflows across data ingestion, analysis, reporting, review, and compliance.

The Research Engineer executes our research projects. The work spans agent systems, how we quantify their performance, and fine-tuning open models. Research direction comes from product feedback; you turn it into shipped results.

You will work close to the science: formulating problems, designing experiments, shipping implementations, curating data, and interpreting results. You will work with biologists, scientific contractors, and the rest of the team, and act as a technical sparring partner on what to build next.

## What you'll do

– Turn research directions into projects: scoping, experiment design, implementation, data, and results.

– Define how we measure system performance: which metrics matter, which proxies are honest, which signals to trust.

– Build the evaluation infrastructure to put those metrics into practice: benchmarks, harnesses, and the tooling around them.

– Develop the research agent stack: memory and in-context learning, test time compute, and models.

– Fine-tune and post-train open models to improve agent performance.

– Work with biologists, scientific contractors, and annotators to build reproducible training and evaluation data pipelines.

– Track what frontier labs are shipping and bring back what's relevant.

## Essential experience

– Experience building ML/AI systems in a research-adjacent context (industry, lab, or PhD).

– Experience building LLM-powered systems: prompts, context engineering, agent architectures.

– Experience working with evaluations and benchmarks, including in tasks where “correct” is ambiguous.

– Familiarity with model training generally, including the data, optimisation, and evaluation work around it.

– Strong engineering fundamentals. Fluent in Python and comfortable across the AI/ML stack.

– Experience running experiments rigorously, in academia or industry. You think about confounds. You can defend your results.

– Experience with training and evaluation data pipelines, including reproducibility and observability.

## Nice to have

– Experience in a life-science domain (biology, chemistry, medicine, bioinformatics).

– Post-training experience on LLMs.

– Peer-reviewed publications, or other settings where your ideas were stress-tested.

– Open source contributions to scientific or AI tooling.

## Essential qualities

– High ownership: you notice what needs doing and carry it through.

– Skeptical of your own results: you assume a good-looking number is wrong until you understand why it isn't.

– Strong opinions, weakly held: you push back, defend a position, and change your mind when evidence moves.

– Hands-on with the unglamorous parts: data cleaning, contractor coordination, eval annotation, whatever the project needs.

– Reliable under ambiguity: you make progress when problems aren't yet well-defined.

– Curious about science: you want to learn the bioanalysis domain.

– Clear communicator: you can explain technical decisions and results to engineers, scientists, and founders.

Posted: May 28th, 2026