Technical Resilience Specialist
Job Description
The Technical Resilience Specialist will play a crucial role in enhancing the organisation's resilience posture, focusing primarily on AWS cloud environments. This role combines hands-on technical analysis, project management, and business support to ensure that systems are robust, compliant, and recoverable. The successful candidate will lead resilience initiatives, support testing and compliance activities, and provide expert guidance to both technical and business stakeholders.
Responsibilities
1. Evaluate AWS-based systems and infrastructure for resilience gaps, risks, and compliance with internal standards. Conduct regular reviews and recommend improvements to enhance system resilience.
2. Plan, coordinate, and monitor resilience tests, such as disaster recovery, backup restoration, failover, and cyber-attack scenarios in AWS environments.
3. Lead and support projects focused on Technical Resilience enablement, including the implementation of new resilience controls and processes. Track project progress, manage risks, and ensure timely delivery of project milestones.
4. Provide clear guidance and support to technical and business teams on Technical Resilience policies, standards, and best practices. Develop and deliver training, documentation, and communications to promote awareness and adoption of resilience guidelines.
5. Track resilience metrics and costs, incidents, and compliance status across AWS environments. Prepare regular reports for management, auditors, and other stakeholders.
6. Identify opportunities to improve resilience maturity and support the implementation of enhancements.
Essential Skills
7. Experience in technical resilience, disaster recovery, or related roles.
8. Good expertise with AWS services such as EC2, S3, RDS, IAM, and VPC.
9. Experience running technical projects and delivering outcomes in cloud environments.
10. Excellent communication skills, with the ability to explain technical concepts and provide clear guidance to non-technical stakeholders.
11. Strong analytical and problem-solving abilities.
12. Experience with resilience testing, incident response, and compliance reporting.
Additional Skills & Qualifications
13. AWS certification.
14. Familiarity with ISO27001, NIST, or other resilience/security frameworks.
15. Knowledge of automation, infrastructure-as-code tools, and chaos engineering.
Location
Bracknell, UK
Trading as TEKsystems. Allegis Group Limited, Maxis 2, Western Road, Bracknell, RG12 1RT, United Kingdom. No. 2876353. Allegis Group Limited operates as an Employment Business and Employment Agency as set out in the Conduct of Employment Agencies and Employment Businesses Regulations 2003. TEKsystems is a company within the Allegis Group network of companies (collectively referred to as "Allegis Group"). Aerotek, Aston Carter, EASi, Talentis Solutions, TEKsystems, Stamford Consultants and The Stamford Group are Allegis Group brands.