Senior Computational Biologist | ML Engineer

Company: SEQUENTIAL
Apply for the Senior Computational Biologist | ML Engineer
Location: Cambridge
Job Description:

Sequential is building a next-generationAI-drivendiscoveryplatformto identify and designnovelfunctionalactives, including peptides and complex ingredient systems. The platform integrateslarge-scalebiologicaldatasets(>50,000 samples and measurements)spanning multi-omics data, microbiome sequencing, clinical and real-world outcomes. Our goal is to translate biological signals intoactionable compound discovery and optimisation, powering a pipeline across:

  • We are looking for a Senior Computational Biologist with ML Engineering background to help build, bridge, and functionalise the link between AI-powered biological discovery and real-world clinical outcomes. This role sits across biological discovery and scalable ML engineering. You will own key parts of the end-to-end architecture from data to model to evaluation to deployment, and work closely with ML engineers and software engineers to productise the platform into client-ready outputs. This is a senior role with significant autonomy and technical ownership.

The Data You Will Work With

The platform is built on a growing dataset of>50,000 biological samples and measurements, including pairedpre- and post-treatment observations. The data includes multiple modalities such as microbiomesequencing (16S rRNA sequencing, ITS sequencing, shotgun metagenomics), Multi-omics (proteomics, lipidomics, metabolomics), Clinical and observational data (treatment exposure, formulation and ingredient combinations, clinical outcomes, patient metadata). Datasets includelongitudinalmeasurements, enabling analysis ofbiologicalresponsetointerventions(e.g., ingredient exposure, treatment, formulation).

Responsibilities

  • 1) Build the discovery engine (data > signal > candidate)
    • Develop models that identify novel functional actives from multi-omic datasets
    • Detect patterns in biological signatures that correlate with clinical outcomes (e.g., inflammation reduction, microbiome restoration, barrier repair, malodour reduction)
    • Create robust feature representations from:
      • clinical metadata and response data
      • SNP and risk features (where relevant)
  • 2) Predict mechanism + response
    • Build predictive models for:
      • molecule–microbe interactions
      • clinical response forecasting
      • safety and developability scoring
    • Translate model outputs into interpretable mechanistic narratives for R&D teams and external partners.
  • 3) Design and optimise functional complexes
    • Implement multi-objective optimisation and scoring frameworks to balance:
      • safety and stability constraints
      • manufacturability and cost
      • regulatory feasibility
    • Support generation of:
      • repurposed peptides
      • newly discovered natural peptides
  • 4) Productionise the AI product launch
    • Build end-to-end ML pipeline covering ingestion, training, evaluation and deployment
    • Develop APIs/services to serve predictions and ranked candidates into internal tools and client outputs
    • Create evaluation harnesses to compare predicted vs. observed validation outcomes
    • Implement monitoring and governance: drift, data quality checks, model versioning, auditability
  • 5) Collaborate cross-functionally
    • Work closely with biology, formulation, and clinical teams to design experiments and validation loops
    • Partner with product and commercial teams to shape “client-ready” deliverables (e.g., ranked actives, evidence packs, scientific dossiers)
    • Lead and /or partner with ML and software teams to define ownership boundaries code reviews norms, and the path from prototype to a maintained service.

12-Month Mission

Month 0–3: Foundation & Proof of Concept

  • Establish harmonised datasets and core data pipelines with dataset versioning, documented schemas, and baseline QC checks
  • Deliver feasibility screening models for active discovery with an agreed split strategy (sample, cohort, and time splits) and reporting baseline ranking metrics (e.g., hit-rate@K, NDCG@K)
  • Build initial predictive baselines with clear metrics
  • Implement multi-criteria scoring + optimisation with a defined objective, weighting strategy, and ablation plan
  • Extend predictive models and improve candidate ranking performance
  • Develop reproducible experiment tracking and evaluation workflows

Month 6–9: Advanced Validation

  • Compare predictions against real validation outputs and refine models
  • Improve robustness, interpretability, and governance
  • Deliver a performance report suitable for internal and external stakeholders

Month 9–12: Client Readiness & Pilot Launch

  • Finalise v1.0 AI product outputs (ranked candidates + evidence summaries)
  • Support pilot client projects and accelerate validation turnaround time
  • Contribute to the first commercial-ready “hero” functional complex pipeline

What we’re looking for

  • Strong Python and experience building ML systems end-to-end with evidence (links to shipped projects, tools, or repos)
  • Proven ability to work with large, messy real-world datasets and to define leakage-safe validation splits (e.g., sample, time, cohort)
  • Practical knowledge of ML evaluation, validation strategy, and failure modes including error analysis and iteration planning
  • Experience with model development using PyTorch / TensorFlow / JAX
  • Ability to communicate clearly across technical + scientific stakeholders
  • Strong experience leading or coordinating cross-functional teams, and thinking strategically across product and science (prioritisation, tradeoffs, and shipping)
  • Comfort deploying models (batch + real-time inference) into production environments or owning the handoff to engineering with clear interfaces

Strongly preferred

  • Experience with biological or high-dimensional scientific datasets (omics, imaging, microbiome, clinical)
  • Experience with multi-modal learning, embeddings, or representation learning
  • Familiarity with optimisation / ranking / multi-objective scoring systems
  • Interest in mechanism-driven modelling and scientific interpretability
  • Foundation model experience (LLMs and/or biological foundation models) with proof (fine-tuning/adapters or applied FM pipelines, plus how it was evaluated)
  • A strong portfolio of tool or pipeline development (internal platforms, evaluation harnesses, reproducible workflows)

Nice to have

  • Experience building developer-friendly tooling: CLIs, dashboards, APIs, or reusable libraries used by other scientists/engineers
  • Some front-end/product sense (you care about shipping usable tools, not just notebooks)
  • Experience with causal inference, Bayesian methods, or mechanistic simulation

#J-18808-Ljbffr…

Posted: March 20th, 2026