Salary: £60,000 - 60,000 per year Requirements: Experience managing capacity and performance in IT environments Hands-on experience with AWS and Azure Strong knowledge of ITIL v3/v4 (certification required) Experience with monitoring/observability tools (e.g. Zabbix, Grafana, Kibana, OpenSearch) Knowledge of Windows and Linux server environments Scripting skills (e.g. Python, PowerShell, Node.js) Experience integrating data via APIs, webhooks, or messaging Strong analytical, problem-solving, and stakeholder management skills Desirable: DevOps exposure Desirable: Network infrastructure and communications protocols knowledge Desirable: Experience with social alarm platforms Responsibilities: Take ownership of performance, capacity, and resilience across critical IT services Focus on keeping customer-facing services fast, reliable, and observable Lead observability across services, ensuring effective monitoring and actionable insights Manage capacity and performance through forecasting and trend analysis Identify risks early and drive improvements Ensure resilience and availability are built into services from the outset Support continuity planning and risk management Work closely with technical teams and stakeholders to resolve issues Deliver ongoing service improvements Technologies: AWS OpenSearch Azure DevOps Grafana Support ITIL Kibana Linux Network PowerShell Python Windows Zabbix NodeJS Cloud More: We are seeking an IT Service Performance & Reliability Manager to join our team and play a crucial role in enhancing service performance and resilience. Our organization is dedicated to innovation and excellence, offering a collaborative work environment where your contributions will have a meaningful impact. We encourage continuous learning and provide opportunities for professional development. If you are passionate about IT service management and want to make a tangible difference, we invite you to apply. last updated 21 week of 2026