In this role you will collaborate with Application Stewards and Site Reliability Engineers to confirm critical assets in scope for monitoring verification and uplift.
Your responsibilities will include working with EMAS to analyse Prometheus scrape coverage, exporter deployment, and Grafana dashboard availability for critical applications.
Key Capabilities / Knowledge:
Deep expertise in designing, implementing, and configuring in:
* PromQL development for complex queries and performance analysis.
* Recording rules, alerting rules, and metric optimisation.
Grafana
* Dashboard and panel design with performance‑focused visualisations.
* Alerting configuration and best‑practice alert routing.
* Synthetic monitoring integrations (e.g., Grafana Synthetic Monitoring / Blackbox exporter).
* Log ingestion and analysis (e.g., Loki).
* Real User Monitoring equivalents or integrations (e.g., Grafana Faro for web telemetry).
Location: Edinburgh (Potentially Bristol or London)
Contract: 6 Months
Hybrid: 2 days in the office, 3 days working from home
J-18808-Ljbffr