Salary: £50,000 - 75,000 per year Requirements: Proven experience with Prometheus, including PromQL, and Grafana in production environments Strong knowledge of Linux-based systems Experience writing and optimizing PromQL queries for alerts and dashboards Familiarity with exporters such as node_exporter, blackbox_exporter, and custom exporters Understanding of Alertmanager configuration and routing Proficiency with Grafana dashboard creation and templating Strong troubleshooting skills for infrastructure and application issues Familiarity with containers, including Docker Scripting skills with a focus on Python; Bash or Go is also beneficial Ability to work from our Cardiff office one day per week Responsibilities: Design, configure, and maintain Prometheus-based monitoring solutions Develop and manage metric exporters for application and system-level data Optimize Prometheus scraping configurations and retention policies Define and maintain alert rules based on SLIs, SLOs, and performance baselines Ensure alerts are actionable with minimal false positives Participate in on-call rotations and incident postmortems as needed Design and maintain Grafana dashboards for real-time operational insights Collaborate with engineering and product teams to create tailored visualizations Provide self-service dashboard capabilities for end users Monitor infrastructure including servers, containers, databases, and services for uptime, latency, and throughput Identify bottlenecks and recommend improvements Technologies: Bash Docker Grafana Linux PLC Prometheus Python Security UX UI Design NodeJS Hardware Support More: We are SRT Marine Systems plc, a respected, established, and ambitious multinational company headquartered in the UK. We are a market leader in international marine surveillance technology and maritime domain awareness solutions that improve security, safety, environmental protection, and sustainability for customers worldwide, from national coast guards to individual vessel owners. This role is part of a small team working on end-user observability visualization, supported by highly experienced engineers, UX designers, and our lead observability engineer. We offer a highly competitive salary and benefits package, matched pension contributions up to 5%, 25 days annual leave rising to 28 days with service, and career development opportunities. This role is titled System Monitoring & Observability Engineer internally, and it requires working from our Cardiff office one day per week. We are an equal opportunity employer committed to an inclusive working environment. last updated 21 week of 2026