Job Description
Life on the teamLocation: UK WideAt Computacenter, youll be joining a world-class team of over 1,000 skilled professionals within Group Professional Services (GPS). Our teams operate across the UK, Germany, France, and India, delivering complex, enterprise-grade IT solutions and consultancy across infrastructure, cloud, and modern operations.As a Monitoring & Observability Engineer, you'll work in high-impact delivery teams that support some of the worlds most well-known organisations. Youll play a key role in helping our customers achieve greater visibility, performance, and reliability across their IT estatescontributing to their operational success through proactive insight and incident prevention.What you'll doDesign, implement, and manage observability solutions using industry-leading tools such as Dynatrace (primary), Grafana, and Splunk Collect and analyse telemetry data (metrics, logs, traces, events) to diagnose and resolve system and application performance issues Integrate monitoring platforms with ITSM tools (e.g.
ServiceNow) and CI/CD pipelines to enable proactive alerting and resolution workflowsAct as a Monitoring & Observability SME within customer delivery teams Support incident response activities and postmortems by identifying patterns, root causes, and optimisation opportunities Work collaboratively with cross-functional teams to define and implement best practices in observability and monitoring Attend customer and project meetings to present monitoring solutions and technical designs Proactively identify and highlight risks that could impact solution successWhat you'll needStrong experience deploying and managing observability platforms including Dynatrace, Grafana, and/or SplunkDeep understanding of telemetry signal analysis and performance monitoringExperience integrating observability tools with ITSM platforms and DevOps toolchainsAbility to troubleshoot complex infrastructure and application issues using monitoring insightsSolid scripting experience (Python, PowerShell) for automation and integrationFamiliarity with cloud platforms (Azure, AWS) and monitoring containerised environments (Kubernetes, OpenShift)Strong problem-solving skills and analytical thinking to draw meaningful insights from dataExcellent communication skills with the ability to convey technical concepts to both technical and non-technical audiencesExperience working in Agile project environments (Scrum, Kanban, etc.)A proactive mindset with a passion for continuous improvement and knowledge sharingCertificationsDynatrace Associate & ProSplunk Core Certified Power UserDesirable ExperienceDevOps or Site Reliability Engineering (SRE) experienceAutomation with Terraform or similar toolsBuilding CI/CD pipelinesExperience with Docker and Kubernetes for packaging and deploymentAbility to adapt to new technologies in fast-paced environmentsTPBN1_UKTJ