Data Engineer – Content Analytics

{ “@context”: “http://schema.org”, “@type”: “JobPosting”, “title”: “Data Engineer – Content Analytics”, “description”: “

We’re looking for an experienced Data Engineer to join a high-performing Content Analytics & Products team, supporting the development of data-driven products and insights across the full content lifecycle.

This role sits at the heart of a growing data function, where you’ll design and build scalable data pipelines, enabling both customer-facing features and internal analytics.

What you’ll be doing

  • Designing, building and optimising scalable data pipelines (batch & streaming)
  • Ingesting and transforming data from multiple sources into high-quality datasets
  • Developing robust data models to support analytics and reporting
  • Implementing big data solutions using tools such as Spark, EMR, Redshift and Kinesis
  • Building and maintaining data platforms using infrastructure-as-code (AWS CDK)
  • Creating automated data quality frameworks to ensure accuracy and reliability
  • Writing high-performance SQL for complex data transformations
  • Partnering with business and engineering teams to deliver impactful data solutions
  • Improving performance and cost efficiency across data processing and storage

What we’re looking for

  • Strong experience building and operating data pipelines at scale
  • Solid background in data modelling, warehousing and ETL/ELT
  • Proficiency in SQL and at least one programming language (Python, Java, Scala, etc.)
  • Experience working with large, complex datasets in distributed environments
  • Good understanding of software engineering best practices (CI/CD, testing, version control)

Nice to have

  • Experience with AWS tools (Redshift, S3, Glue, EMR, Kinesis, Lambda)
  • Exposure to Spark or other big data frameworks
  • Experience with non-relational databases or modern data storage solutions

#J-18808-Ljbffr”, “datePosted”: “2026-05-20”, “hiringOrganization”: { “@type”: “Organization”, “name”: “Cpl Life Sciences”, “sameAs”: “https://uk.whatjobs.com/pub_api__cpl__436987782__4861?utm_campaign=publisher&utm_medium=api&utm_source=4861&geoID=33” }, “jobLocation”: { “@type”: “Place”, “address”: { “@type”: “PostalAddress”, “addressLocality”: “London” } } }
Company: Cpl Life Sciences
Apply for the Data Engineer – Content Analytics
Location: London
Job Description:

We’re looking for an experienced Data Engineer to join a high-performing Content Analytics & Products team, supporting the development of data-driven products and insights across the full content lifecycle.

This role sits at the heart of a growing data function, where you’ll design and build scalable data pipelines, enabling both customer-facing features and internal analytics.

What you’ll be doing

  • Designing, building and optimising scalable data pipelines (batch & streaming)
  • Ingesting and transforming data from multiple sources into high-quality datasets
  • Developing robust data models to support analytics and reporting
  • Implementing big data solutions using tools such as Spark, EMR, Redshift and Kinesis
  • Building and maintaining data platforms using infrastructure-as-code (AWS CDK)
  • Creating automated data quality frameworks to ensure accuracy and reliability
  • Writing high-performance SQL for complex data transformations
  • Partnering with business and engineering teams to deliver impactful data solutions
  • Improving performance and cost efficiency across data processing and storage

What we’re looking for

  • Strong experience building and operating data pipelines at scale
  • Solid background in data modelling, warehousing and ETL/ELT
  • Proficiency in SQL and at least one programming language (Python, Java, Scala, etc.)
  • Experience working with large, complex datasets in distributed environments
  • Good understanding of software engineering best practices (CI/CD, testing, version control)

Nice to have

  • Experience with AWS tools (Redshift, S3, Glue, EMR, Kinesis, Lambda)
  • Exposure to Spark or other big data frameworks
  • Experience with non-relational databases or modern data storage solutions

#J-18808-Ljbffr…

Posted: May 20th, 2026