Cloud Infrastructure Engineer

Company: Perlego
Apply for the Cloud Infrastructure Engineer
Location: London
Job Description:

What we do

At Perlego, we are working hard to make education accessible to all. In this digital age, we believe that anyone should be able to learn anything at any time. Knowledge should be more accessible, not locked behind sky-high price tags.

Over the past 9 years, our goal has been to support students across the UK & Europe to access quality books. Our ambition is to expand our support to students globally, specifically looking at the US, and build a product that goes beyond the book, a platform that helps students study smarter and more effectively.

What we’re looking for

We are looking for an experienced Cloud Infrastructure Engineer with a strong background in AWS services and monitoring tools. In this role, you will ensure the availability and reliability of our services. You will be integral to swiftly addressing issues, resolving incidents independently, and thriving in a fast-paced environment.

What you’ll do

As a Cloud Infrastructure Engineer, your main focus will be to ensure our services remain highly available and performant. Key responsibilities include:

Cloud Infrastructure Management

  • Manage and supportAWS infrastructure, focusing on scalability, security, and reliability.
  • Handle deployments, managingCI/CD pipelines for both containerised (Docker/ECS) and serverless (AWS Lambda) applications.
  • Own infrastructure as code — provisioning resources declaratively so environments are reproducible, version-controlled, and safe to change.
  • Ensure effective backup, recovery, and disaster recovery strategies to minimise downtime.
  • Manage operational and analytical data stores (Aurora MySQL, DynamoDB)
  • Drive cost optimisation across the infrastructure — monitoring spend, eliminating waste, and rightsizing resources to balance performance and cost.

Monitoring & Incident Management

  • Monitor and manage platform activity using tools likePrometheus,Grafana, orAWS CloudWatch
  • Respond quickly to alerts and incidents, independently resolving issues and ensuring service uptime.
  • Conduct post‑incident reviews and help improve system resiliency through automation and monitoring enhancements.
  • Review network activity with AWS Security Hub and Cloudflare

Collaboration & Communication

  • Collaborate with cross‑functional teams to implement platform improvements.
  • Work independently and make swift decisions when managing service incidents outside core business hours.
  • Assist in platform security, ensuring adherence to best practices for cloud security and compliance.

Continuous Improvement

  • Automate manual processes to reduce human error and improve efficiency.
  • Continuously enhance monitoring systems, ensuring robust early detection and resolution capabilities.
  • Identify potential performance bottlenecks and contribute to overall platform optimisation.

This role is ideal for you if you possess

  • Experience in Cloud Infrastructure Engineering, DevOps, or a similar field.
  • Strong experience with AWS services and containerised applications
  • Strong experience operating operational data stores (Aurora MySQL, DynamoDB).
  • Expertise in using monitoring tools(e.g. Prometheus, Grafana, CloudWatch) for real‑time platform performance insights.
  • Strong understanding of network security and Cloudflare, VPC and networking fundamentals, with a clear grasp of how traffic and infrastructure components flow together end to end.
  • Hands‑on experience with CI/CD pipeline management for deploying containerised (Docker) and serverless applications, preferably with GitHub Actions
  • Proficiency in Linux‑based operating systems and shell scripting.
  • Familiarity with Infrastructure as Code tools (Terraform, CloudFormation).
  • Experience with incident management, troubleshooting, and platform recovery in high‑pressure environments.
  • Strong communication skills with a proven ability to work both independently and collaboratively

It’s a plus if you have

  • Experience working in a global, distributed team providing off‑hours support.
  • Previous experience with SecOps and cloud security best practices.
  • Familiarity with scaling highly available systems in a fast‑paced, growth‑oriented environment.

Compensation

The salary available for this role is £60,000‑65,000 dependent upon experience.

Flexible

We operate a flexible hybrid working environment. However we would be open to a remote role for the right candidate.

L&D Budget

We value continuous learning and you will have a personal L&D budget for online courses, subscriptions, or books not on Perlego.

Learning Time

All employees have dedicated Learning Time to focus on new skills, projects, or interests outside their day‑to‑day role, including Hackathons.

Work-Life Balance

22 days annual leave + 1 additional day per year of service

Office Reset

The days between Boxing Day and New Year off, additional to annual leave.

Flexi Bank Holidays

Flexibility to swap local bank holidays for religious or cultural days.

Work from overseas

Flexible short‑period remote working overseas, as long as you remain a UK tax resident.

Sabbatical

1-month unpaid sabbatical after 3 years; 1-month paid sabbatical after 5 years.

Personal Days

1 additional day per year for life events.

Health & Wellbeing

Private medical, optical and dental insurance via Vitality.

Cycle to Work Scheme

Social

Regular social events and activities for everyone.

Family time

Competitive matched parental leave and phased return to work.

Workplace Nursery Benefit

#J-18808-Ljbffr…

Posted: June 20th, 2026