Universal Music Group invites applications for the Observability Engineer position, an integral part of our global technology team focusing on data‑driven decisions, automation, and continual improvement.
Job Summary
We are seeking a talented and proactive Observability Engineer to design, implement, and enhance observability solutions that ensure the reliability, performance, and scalability of our critical IT systems and applications worldwide.
Responsibilities
- Design, implement, and continuously improve the observability stack across cloud‑native, on‑premise, and hybrid environments, covering monitoring, logging, tracing, and alerting.
- Evaluate, select, and deploy leading observability tools (Dynatrace, AWS CloudWatch, Prometheus, Grafana, ELK Stack, Splunk, OpenTelemetry) and automate pipelines and alerting mechanisms.
- Define, enforce, and advocate observability standards and best practices across engineering and operations teams.
- Create and maintain dashboards and automated alerts to provide real‑time insights into system health, performance, and availability.
- Collaborate with Operations to diagnose and resolve incidents using telemetry data and conduct post‑incident reviews.
- Partner with development, SRE, and infrastructure teams to embed observability throughout the technology lifecycle.
- Analyze telemetry data to identify and resolve performance bottlenecks, optimize resource allocation, and fine‑tune configurations.
- Contribute to compliance and security efforts through effective log management and integration with SIEM systems.
- Work independently and as part of a global team to design and implement robust solutions across a hybrid ecosystem.
- Troubleshoot complex issues based on observability notifications and document processes and best practices.
- Actively contribute to a positive, respectful Observability team culture.
Qualifications
- 3+ years of hands‑on experience in an Observability, Site Reliability Engineering, or DevOps role focused on observability.
- Strong understanding and practical experience with monitoring, logging, and tracing systems.
- Proficiency with industry‑standard observability tools (Dynatrace, AWS CloudWatch, Prometheus, Grafana, ELK Stack, Splunk, Logic Monitor).
- Experience with major cloud platforms (AWS, Azure, or GCP).
- Solid programming and scripting skills (Python, Go, Shell, JavaScript) for automation.
- Understanding of distributed systems, microservices architectures, and cloud‑native environments; experience with Docker/Kubernetes and DevOps principles.
- Familiarity with CI/CD pipelines and automation tools such as Ansible and Terraform.
- Exceptional analytical and problem‑solving abilities with a proactive approach.
- Excellent communication, collaboration, and interpersonal skills.
- Prior experience supporting critical business applications in a large‑scale global enterprise.
- Security awareness and ability to integrate security monitoring into observability processes.
- Bachelor's degree in Computer Science, Engineering, or related field, or equivalent practical experience.
- Self‑motivated with initiative, strong follow‑up skills, and analytical mindset.
- Travel may be required.
Desired Qualifications
- Experience with Chaos Engineering, Canary/Blue‑Green deployment strategies, capacity planning, data analysis, and networking.
- Software engineering background for designing and automating observability workloads.
- Relevant certifications (AWS Certified DevOps Engineer, Kubernetes certifications).
Everyone is welcome to apply for our roles. We are committed to ensuring that no applicant or employee receives less favourable treatment because of gender, race, disability, sexual orientation, religion, belief, age, marital status, background, pregnancy, or caring responsibilities. We also recognise the importance of diversity of thought and fully support people with autism, dyslexia, ADHD, and other neurocognitive variations. If you require reasonable adjustments for the application process, please email UniversalMusicCareers@umusic.com.
#J-18808-Ljbffr