The Client
Tempest Vane Partners is supporting a high-growth, institution-backed fintech scale-up at the forefront of digital asset infrastructure. With global offices and a mission to bring regulated solutions to institutional finance, this firm is redefining how digital assets are secured and managed.
As part of their expansion, they’re looking to bring on a Senior Production Engineer to lead the charge on reliability, resilience, and operational excellence within a complex, high-uptime platform environment.
What You’ll Get
* A superb opportunity to join an institutionally backed, cutting edge Crypto Fintech at the beginning of their journey.
* An opportunity to work closely with the leadership team, contributing to and influencing decision making.
* Opportunities for learning and development, reflecting the importance of lifelong learning.
* Ability to work cross-functionally across the company on major initiatives and be at the forefront of innovative design and development.
* Market leading salary and annual discretionary bonus.
* Pension contributions, in addition to Health Insurance, Life Assurance.
* 25 Annual Leave.
What You’ll Be Doing
This is a hands-on and strategic engineering role where you’ll be responsible for ensuring production stability across a highly dynamic microservices architecture hosted in Azure. You’ll have end-to-end ownership over reliability tooling, incident response, and system performance—working across teams to scale a truly enterprise-grade platform.
Key Responsibilities:
* Leading on production resilience, observability, and incident frameworks.
* Building SLIs/SLOs and advocating for best practices in platform reliability.
* Scaling infrastructure in Azure (AKS, App Insights, Key Vault, etc.).
* Automating recovery, scaling, and monitoring across distributed systems.
* Collaborating with cross-functional teams to align platform strategy and reliability goals.
What You’ll Bring:
* 8+ years in software engineering or SRE/production infrastructure roles.
* Strong experience with Java (Spring) and cloud platforms (ideally Azure).
* Proven track record in building and maintaining mission-critical systems.
* Deep understanding of Kubernetes, observability tooling (Grafana, Prometheus, ELK, etc.), and Infrastructure as Code (Terraform, Bicep).
* Ability to lead technical conversations across Engineering and Product.
Bonus points if you bring:
* Experience in fintech, crypto, or regulated digital infrastructure
* RDBMS performance tuning (MS SQL)
* Knowledge of SLAs/SLOs/chaos engineering and platform risk management