Salary: £55,000 - 60,000 per year Requirements: Experience managing capacity and performance in IT environments Hands-on experience with AWS and Azure Strong knowledge of ITIL v3/v4 (certification required) Experience with monitoring/observability tools (e.g. Zabbix, Grafana, Kibana, OpenSearch) Knowledge of Windows and Linux server environments Scripting skills (e.g. Python, PowerShell, Node.js) Experience integrating data via APIs, webhooks, or messaging Strong analytical, problem-solving, and stakeholder management skills Desirable: DevOps exposure Desirable: Network infrastructure and communications protocols knowledge Desirable: Experience with social alarm platforms Responsibilities: Take ownership of performance, capacity, and resilience across critical IT services Lead observability across services by ensuring effective monitoring and actionable insights Manage capacity and performance through forecasting and trend analysis Identify risks early and drive improvements in service performance Ensure resilience and availability are built into services from the outset Support continuity planning and risk management Work closely with technical teams and stakeholders to resolve issues Deliver ongoing service improvements Technologies: AWS OpenSearch Azure DevOps Grafana Support ITIL Kibana Linux Network PowerShell Python Windows Zabbix NodeJS Cloud More: We are seeking an IT Service Performance & Reliability Manager to join our team. In this role, you will focus on keeping customer-facing services fast, reliable, and fully observable while driving continuous improvement. If you are looking for a position where you can make a tangible impact on service performance and resilience, we encourage you to apply. last updated 17 week of 2026