What you’ll be doing
1. Executes the implementation of new software development life cycle automation tools, frameworks, and code pipelines (continuous integration/continuous delivery pipelines whilst executing best practices with a focus on the re-use of application code, demonstrates consistent software delivery practices and produces continuous integration/continuous delivery platform solutions using Amazon Web Services cloud, infrastructure as code (IaC), GitOps, and container technologies
2. Coordinates a diverse team and creates the initial test schedule to deliver all aspects of testing to time, budget and quality targets, ensuring producing outlines of solutions and defining depth of testing required
3. Executes the implementation of automation technologies to ensure repeatability, eliminating toil, reducing mean time to detection and resolution and repair services
4. Proactively identifies and manages risk through regular assessment and diligent execution of controls and mitigations, proactively raising any concerns
5. Leads scale testing to measure, tune and optimise system performance
6. Executes metric/monitoring analysis that creates stability, security, and performance improvements
7. Designs, analyses, develops and troubleshoots highly-distributed large-scale production systems spanning on-prem and cloud-based hosting
8. Executes approaches that scale systems sustainably through mechanisms like automation and evolves systems by pushing for changes that improve reliability and velocity
9. Writes and delivers infrastructure as code software to improve the availability, scalability, latency, and efficiency of services
10. Implements robust monitoring and alerting systems and performs root cause analysis and post-mortems with an eye towards future prevention
11. Inspects queue and support processing to ensure early warning of support issues
12. Executes retrospective and preventive actions after each high severity production incident
13. Analyses complex systems from a reliability and resilience perspective and identifies sources of instability in distributed systems
14. Champions, continuously develops and shares with team knowledge on emerging trends and changes in site reliability engineering best practices and industry standards
15. Mentors other site reliability engineers, helping to improve the team's abilities by acting as a technical resource
The skills you’ll need
TroubleshootingInfrastructure ConfigurationService AssuranceApplication Performance Monitoring & AlertingComputer NetworkingSystem AdministrationProgramming/ScriptingArtificial Intelligence Operations (AIOps)Server ArchitectureCloud ComputingContinuous Integration/Continuous Deployment Automation & OrchestrationSystems IntegrationProject/Programme ManagementIncident ManagementDecision MakingGrowth MindsetInclusive Leadership
Our leadership standards
Looking in:
Leading inclusively and Safely
I inspire and build trust through self-awareness, honesty and integrity.
Owning outcomes
I take the right decisions that benefit the broader organisation.
Looking out:
Delivering for the customer
I execute brilliantly on clear priorities that add value to our customers and the wider business.
Commercially savvy
I demonstrate strong commercial focus, bringing an external perspective to decision-making.
Looking to the future:
Growth mindset
I experiment and identify opportunities for growth for both myself and the organisation.
Building for the future
I build diverse future-ready teams where all individuals can be at their best.
With over 175 years of heritage, BT is now the flagship business brand of BT Group. We’ve brought together our best people and capabilities into a B2B powerhouse serving 1.2 million business customers internationally. We’re a global leader for secure connectivity and collaboration platforms for businesses of all shapes and sizes, from big household names and government departments, right through to sole traders and new start-ups. But it’s not just the technology that matters, it’s what it can do to help them build stronger, smarter, more secure businesses. We value diversity and inclusion and believe in making a positive impact. We connect for good by championing digital inclusion and equipping people, businesses, and communities with digital skills to thrive. As a member of our team, you will be part of an organisation that celebrates difference, fosters innovation and provides you with opportunities to be your best. With millions of businesses relying on us daily, joining BT means you can be part of a diverse and multi-skilled team that makes a significant impact to society. A FEW POINTS TO NOTE: Although these roles are listed as full-time, if you’re a job share partnership, work reduced hours, or any other way of working flexibly, please still get in touch. We will also offer reasonable adjustments for the selection process if required, so please do not hesitate to inform us. DON'T MEET EVERY SINGLE REQUIREMENT? Studies have shown that women and people who are disabled, LGBTQ+, neurodiverse or from ethnic minority backgrounds are less likely to apply for jobs unless they meet every single qualification and criteria. We're committed to building a diverse, inclusive, and authentic workplace where everyone can be their best, so if you're excited about this role but your past experience doesn't align perfectly with every requirement on the Job Description, please apply anyway - you may just be the right candidate for this or other roles in our wider team.