Our client is a specialist software company formed as a joint venture between two industry powerhouses. They operate at the intersection of AI, financial services, and consulting, specialising in developing the next generation of AI solutions for the financial services market. They deliver complex, low-latency, high-availability systems that underpin mission-critical operations.
About the Role
As a Production Support Engineer, you will take full responsibility for live, mission-critical platforms, ensuring stability, performance, and resilience. You will act as the final technical authority during incidents, collaborating closely with core engineering, infrastructure, and stakeholder teams. Your work will shape system reliability and operational excellence across the business.
What You’ll Be Doing
* Owning and supporting production systems across multiple environments
* Diagnosing and resolving complex, real-time issues in live systems
* Performing root-cause analysis (RCA) and implementing long-term fixes
* Monitoring system health using logs, metrics, and alerting platforms
* Supporting deployments and releases while mitigating risk to production
* Improving observability, resilience, and incident response processes
* Automating repetitive tasks to reduce operational toil
* Participating in on-call rotations for high-priority incidents
* Influencing system design for reliability alongside engineering teams
Ideal Background
* Strong experience in Production Support, Application Support, SRE, or DevOps
* Excellent debugging and problem-solving skills
* Solid knowledge of Linux/Unix systems and networking fundamentals
* Experience with logging, monitoring, and alerting frameworks
* Ability to read, understand, and troubleshoot production code
* Experience supporting high-availability, distributed systems
* Calm, structured approach to incident management
* Desirable: financial services, trading, payments, or complex enterprise systems
* Desirable: scripting/programming (Python, Java, Bash, SQL) and cloud platforms (AWS, Azure, GCP)
* Desirable: CI/CD tooling experience and an interest in moving towards platform engineering
What You’ll Receive
* Highly competitive salary and package
* 25 days annual leave + UK public holidays
* Contributory pension scheme
* Private healthcare, dental, and wellbeing support options
* Critical illness and life assurance cover
* Flexible benefits including cycle-to-work, wellness programs, and more
* Hybrid working and exposure to complex, high-impact projects
Who Should Apply
This role is for support engineers who thrive in high-pressure, mission-critical environments and are ready to take ownership of live production systems. If you’re passionate about operational excellence, solving complex problems in real time, and shaping system reliability, our client wants to hear from you.
#J-18808-Ljbffr