Overview
Solus, part of the Aviva family, is growing our Technology capability and we're looking for a talented Performance and Monitoring Engineer to help us strengthen the stability, reliability and performance of our systems. If you're passionate about monitoring, observability and using data to proactively improve service health, this is a great opportunity to make a real impact across a large, modern technology estate.
Responsibilities
- Be the subject matter expert for monitoring and performance, responsible for designing, implementing and maintaining the tools and dashboards that provide real-time visibility of our infrastructure, applications and cloud services.
- Own and optimise platforms such as LogicMonitor, Azure Monitor, App Insights and Log Analytics.
- Build meaningful dashboards, alerts, telemetry pipelines and performance insights.
- Identify risks, trends and early indicators to prevent incidents before they happen.
- Carry out deep-dive investigations into performance issues and recommend improvements.
- Work with Platform, Operations, Security and Product teams to ensure systems are reliable, available and scalable.
- Automate responses and integrations to improve speed, accuracy and consistency.
- Support major changes, deployments and post-incident reviews with data-driven evidence.
Qualifications
- Strong experience with monitoring and observability tools (LogicMonitor, Azure Monitor, App Insights, Log Analytics, Defender for Cloud).
- Excellent understanding of cloud performance, IaaS/PaaS, networking fundamentals, API performance and capacity modelling.
- Skilled in dashboards, log queries (KQL), custom metrics and performance analysis.
- Ability to diagnose complex issues across infrastructure, networks, applications or databases.
- Confident scripting and automation skills (PowerShell, Azure Automation, Graph API).
- Clear communicator who can simplify technical detail for both technical and non-technical teams.
Desirable qualifications
- Microsoft certifications (AZ-900, AZ-104, AZ-305, AZ-500) or similar.
- Experience with LogicMonitor admin, Grafana or other observability tools.
- Familiarity with SRE concepts (SLIs, SLOs, error budgets).
- Understanding of ITIL processes.
#J-18808-Ljbffr