Overview
Your new day to day will see you proactively monitoring critical infrastructure and supporting the live production environment. These systems underpin the entire efficiency of the businesses operations, downtime costs the company serious cash. You’ll ensure things are fixed quickly and good monitoring is in place to make sure issues are identified and solved before they can cause too much damage. Working with Grafana, Splunk and New Relic there is loads to learn, loads to get stuck into and a chance to be better.
You will be monitoring key infrastructure using bespoke tools and responding to alerts from the Network Operations Centre (NOC). You’ll investigate incidents, resolving what you can and escalating when necessary. Communication is key, working with different teams, stakeholders and the senior team in order to ensure the operations continue smoothly.
Performing daily checks to keep production systems in top shape, coordinating planned maintenance and managing potential scheduling conflicts will be your responsibility. The infrastructure is across Linux and Windows, with both Bash and Python used for automating so experience with any of these is beneficial. You will be upskilled and trained in these tools too, mentors are there to ensure you are given all the in-depth knowledge of the systems before being left to sort things on your own!
On offer are a range of benefits designed to reward and support the team. The bonus schemes recognise your hard work and dedication, while the top-tier company pension plan helps secure your future. You are cared about through the Employee Assistance Programme, and you can save money with exclusive discounts at hundreds of retailers. Additionally, your big moments will be celebrated with life event gifts along with long service rewards.
Take your career to the next level, reach out for more details. No CV required initially.
It’s time to take your skills to the next level.
A scaling technology company, changing the game, needs an experienced Product Manager. You.
You’re passionate about building advanced mathematical models and algorithms that drive real world impact.
Whatever role you are looking for, our team will work with you to understand your unique skills,experience, career goals and aspirations. Kick-start your job search by registering with us today.
Searching for new talent? Let’s go. Get in touch with us today to find out how we can help scale your team.
Responsibilities
* Monitor critical infrastructure and support the live production environment.
* Respond to alerts from the NOC; investigate incidents and resolve where possible, escalating when necessary.
* Coordinate daily checks, planned maintenance, and manage scheduling conflicts.
* Communicate with multiple teams, stakeholders and the senior team to ensure smooth operations.
* Work with Linux and Windows environments; use Bash and Python for automation (training provided).
* Engage with monitoring tools and dashboards (e.g., Grafana, Splunk, New Relic).
Qualifications
* Experience in monitoring and supporting production infrastructure.
* Familiarity with Linux and Windows environments; scripting with Bash and Python is beneficial.
* Experience with monitoring and alerting tools (Grafana, Splunk, New Relic) is advantageous.
* Strong communication skills; ability to collaborate with multiple teams and stakeholders.
* Willingness to learn, be upskilled, and work with mentors to gain in-depth system knowledge.
#J-18808-Ljbffr