Senior AI Inference Engineer 100% Remote

{ “@context”: “http://schema.org”, “@type”: “JobPosting”, “title”: “Senior AI Inference Engineer 100% Remote”, “description”: “

Responsibilities

  • Deploy machine learning models to edge devices using the frameworks: llama.cpp, ggml, onnx.
  • Collaborate closely with researchers to assist in coding, training and transitioning models from research to production environments.
  • Integrate AI features into existing products, enriching them with the latest advancements in machine learning.

Qualifications

  • Excellent programming skills in C++; experience in Javascript is a bonus.
  • Strong experience with Llama.cpp and ggml inference engines, facilitating the deployment of models to specific GPU architectures.
  • Good understanding of deep learning concepts and model architectures.
  • Experience with transformers, LLMs, Diffusion models.
  • Demonstrated ability to rapidly assimilate new technologies and techniques.
  • A degree in Computer Science, AI, Machine Learning, or a related field, complemented by a solid track record in AI R&D.

Bonus points if:

  • Experience with Javascript/Typescript.
  • Understanding of the difficulties, nuances and importance of P2P technology.
  • Experience with Vulkan, Metal and OpenCL.
  • Productionized models.

#J-18808-Ljbffr”, “datePosted”: “2026-05-09”, “hiringOrganization”: { “@type”: “Organization”, “name”: “Framework Ventures”, “sameAs”: “https://uk.whatjobs.com/pub_api__cpl__428048837__4861?utm_campaign=publisher&utm_medium=api&utm_source=4861” }, “jobLocation”: { “@type”: “Place”, “address”: { “@type”: “PostalAddress”, “addressLocality”: “” } } }
Company: Framework Ventures
Apply for the Senior AI Inference Engineer 100% Remote
Location:
Job Description:

Responsibilities

  • Deploy machine learning models to edge devices using the frameworks: llama.cpp, ggml, onnx.
  • Collaborate closely with researchers to assist in coding, training and transitioning models from research to production environments.
  • Integrate AI features into existing products, enriching them with the latest advancements in machine learning.

Qualifications

  • Excellent programming skills in C++; experience in Javascript is a bonus.
  • Strong experience with Llama.cpp and ggml inference engines, facilitating the deployment of models to specific GPU architectures.
  • Good understanding of deep learning concepts and model architectures.
  • Experience with transformers, LLMs, Diffusion models.
  • Demonstrated ability to rapidly assimilate new technologies and techniques.
  • A degree in Computer Science, AI, Machine Learning, or a related field, complemented by a solid track record in AI R&D.

Bonus points if:

  • Experience with Javascript/Typescript.
  • Understanding of the difficulties, nuances and importance of P2P technology.
  • Experience with Vulkan, Metal and OpenCL.
  • Productionized models.

#J-18808-Ljbffr…

Posted: May 9th, 2026