SmartSearch’s distinctive Anti-Money Laundering verification software protects our clients by offering the most advanced and comprehensive features available from an AML provider.
We are looking for a Site Reliability Engineer (SRE) who will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure and applications. This role focuses on maintaining and improving system observability, automating operations, and enhancing deployment practices to support business-critical services.
Reporting directly to the Lead Site Reliability Engineer, you will be expected to work independently while collaborating closely with engineering and operations teams. You will be responsible for implementing and maintaining monitoring and logging solutions while producing clear documentation to support the cloud environment. Continuous learning and improving performance based on set targets will be expected.
Please note, you'll be required to be within commutable distance to the Ilkley office for occasional office attendance.
Ensuring system reliability, performance, and scalability through monitoring and automation
Building and maintaining observability solutions using Grafana, Prometheus, Loki, OpenTelemetry
Proactively identifying and resolving performance bottlenecks and infrastructure issues
Automating infrastructure provisioning, configuration management, and deployments
Implementing effective logging, monitoring, and alerting strategies
Working with DevOps engineers to streamline CI/CD pipelines and automate testing
Providing detailed documentation for cloud infrastructure, deployment processes, and best practices
Actively participating in capacity planning and cloud architectural decisions
Experience designing and implementing robust observability, monitoring and logging solutions
Strong proficiency with observability and monitoring tools such as Grafana, Prometheus, and Loki
An understanding of cloud networking architecture and load balancing techniques
Good written and verbal communication skills, with a strong standard of English
Desire to continuously learn and stay updated with technology advancements
Several years’ experience in an SRE, DevOps, or similar role
Knowledge of application performance monitoring solutions like DataDog or NewRelic
Hands-on experience with DevOps practices, including CI/CD pipelines and automated deployments
Understanding of software development, ideally with PHP
Strong automation and scripting abilities with Python, Bash, or Go
Proficiency in capacity planning and performance optimization
We are a multi-award winning Tech company with an aspirational mentality
Some of our most recent recognitions include: named in the renowned RegTech100 list for 2024, listed in the Top 100 Fasted Growing Tech Companies by Northern Tech Awards 2024 as well as being named Technology Provider of the Year by Corporate Finance Awards 2024
There are excellent progression opportunities due to our growth and you will have personal development goals, regular feedback and support
We are a diverse and inclusive team committed to promoting Diversity & Inclusion and Social Responsibility. Through our DE&I group, charitable initiatives and support for local schools, we actively foster a positive Impact on our community
Our comprehensive benefit package includes:
~25 days holiday rising to 30 with each year of service
~ Private Medical Insurance covering dental and optical
~ Company pension scheme
~ Life Assurance – 4x your annual salary
~Employee Assistance Programme
~ Cycle to work scheme
~ On site gym