Senior Site Reliability Engineer (Observability) Location: London/UK (Remote) Contract: 12 Months Initial Day rate : 55 Per Hour – 62 Per Hour Inside IR35 We are looking for a Senior Site Reliability Engineer with strong experience in Observability, Monitoring and Distributed Systems to support large-scale cloud infrastructure supporting millions of devices globally. The role focuses on building and scaling monitoring, logging and alerting platforms to ensure high availability and performance of cloud services. Manage and scale Prometheus monitoring systems Build and maintain data pipelines using Kafka Develop alerting and monitoring frameworks Develop tools and scripts using Python, Go, Ruby or Bash Work with Linux systems (Debian/Ubuntu) Improve system reliability, performance and scalability 5+ years experience in Site Reliability Engineering / DevOps~ Strong Linux systems experience~ Observability and Monitoring tools experience~ Ansible / Configuration Management~ Programming experience (Python, Go, Ruby or Bash)~ This is an urgent vacancy where the hiring manager is shortlisting for an interview immediately. Randstad Technologies is acting as an Employment Business in relation to this vacancy….
