Senior/Lead SRE opportunity, top tier finance organisation
We are seeking a Senior SRE to join our client as their first SRE and play a pivotal role in constructing a comprehensive observability platform. If successful, you will be responsible for designing, deploying, and maintaining a system that grants visibility into their IT infrastructure and operations.
Your Role:
* Architect and implement a comprehensive observability and traceability platform.
* Identify and address gaps in monitoring coverage, collaborating with cross-functional teams to implement solutions.
* Proactively identify and remediate system performance issues.
* Develop and implement strategies to enhance system reliability and scalability.
* Partner with stakeholders to define and configure alerting mechanisms.
* Champion automation initiatives, utilizing automation tools and frameworks for efficient code deployment and system management.
* Documentation of system configurations and operational procedures.
Qualifications:
* 5+ years experience as a Senior SRE
* Demonstrated expertise in both AWS and Azure.
* In-depth understanding of Windows Server, Linux operating systems, and Kubernetes container orchestration.
* Strong foundation in network monitoring principles, with familiarity of NetFlow and network telemetry streaming a significant advantage.
* 5+ years of experience working with logging, tracing, and metrics platforms (experience with Grafana, Influx, Prometheus, ELK Stack, or Loki is preferred).
* Proven ability to interact with third-party APIs for data integration and analysis.
* Experience with data collection and transformation systems like Open Telemetry.
* Strong scripting and programming skills with the likes of Bash/PowerShell/Python.
What They Can Offer:
* Excellent salary and annual bonus potential, plus private healthcare, 11% pension and family-oriented benefits.
* Being the first SRE into the business with scope to grow the team and evangelise for the SRE mindset, with the opportunity to make a significant impact on business-critical systems.
* Hybrid work environment, with home working setup grant.
* A commitment to continuous learning and professional development.
If you are a passionate SRE with a talent for building robust monitoring solutions, please apply!