Founding Cloud SRE — AI/ML Platform & GPU Compute

{ “@context”: “http://schema.org”, “@type”: “JobPosting”, “title”: “Founding Cloud SRE — AI/ML Platform & GPU Compute”, “description”: “Icehouseventures is seeking a Staff Cloud Site Reliability Engineer to shape the reliability of large-scale AI systems and GPU compute infrastructure. This founding role involves building and scaling reliability foundations for the AI cloud platform and ensuring cloud infrastructure resilience. Responsibilities include operationalizing SLOs, improving incident response, and creating automation for operations. The position offers a hybrid work model, encouraging collaboration in the London office while allowing remote work.#J-18808-Ljbffr”, “datePosted”: “2026-05-19”, “hiringOrganization”: { “@type”: “Organization”, “name”: “Icehouseventures”, “sameAs”: “https://uk.whatjobs.com/pub_api__cpl__435982832__4861?utm_campaign=publisher&utm_medium=api&utm_source=4861&geoID=33” }, “jobLocation”: { “@type”: “Place”, “address”: { “@type”: “PostalAddress”, “addressLocality”: “London” } } }
Company: Icehouseventures
Apply for the Founding Cloud SRE — AI/ML Platform & GPU Compute
Location: London
Job Description:

Icehouseventures is seeking a Staff Cloud Site Reliability Engineer to shape the reliability of large-scale AI systems and GPU compute infrastructure. This founding role involves building and scaling reliability foundations for the AI cloud platform and ensuring cloud infrastructure resilience. Responsibilities include operationalizing SLOs, improving incident response, and creating automation for operations. The position offers a hybrid work model, encouraging collaboration in the London office while allowing remote work.#J-18808-Ljbffr…

Posted: May 19th, 2026