Job Description
SRE Engineer - Azure - New Role
A leading organisation in the IT consultancy space requires an SRE Engineer to join its growing team! Suitable candidates will have the following experience:
* Hands-on experience of a range of Azure services across all production and non-production environments - primarily the compute, networking, storage, database, costing, security and IAM, and management tools service groups.
* Solid understanding of DevOps/SRE, continuous delivery and related principles with demonstrable experience using complex CI/CD implementations and IAC tools. e.g., Terraform, Bicep and ARM.
* Solid understanding of modern industry practices and how they translate into architectures and working practices.
* The ability to build and maintain strong relationships with stakeholders at all levels, both internally and externally.
* Ability to understand, communicate and present effectively in business and technical contexts at all levels.
* Capable and confident with complexity and the unfamiliar.
* A passion for technology, constantly looking for innovative ways to improve.
* Strives for the highest level of quality and customer satisfaction, whilst focussing on results and outcomes.
* Desirable - Azure certification in relevant area or working towards.
Duties will include the following:
* Work as part of a team to ensure ongoing platform operation, security, reliability, efficiency and governance.
* Support the development and engineering teams in building and supporting highly available, scalable solutions.
* Accountable for delivery and support of production and non-production systems within the Azure ecosystem.
* Continuous improvement of operational processes and tooling for building, deploying, and managing systems.
* Develop and maintain automated tooling and processes for building, deployment and platform/configuration management.
* Maintain and improve CI/CD Pipelines.
* Performance tuning and capacity planning actions to align to client business needs.
* Working in, or supporting, implementation teams on Azure migration projects and managed service onboarding.
* Participate in incident management and post-mortem activities to identify and address root causes of outages and improve system reliability.
* Identify opportunities for improving system performance and reliability and develop plans for implementing improvements.
* Provide guidance and training to other team members on best practices and principles within SRE/DevOPS environments when building and operating reliable and scalable systems.
* Stay up to date with emerging technologies and industry trends related to system reliability and performance.
* Collaborate with development, operations, and other teams to ensure that systems meet reliability and availability goals.
Please send CV for full job description and an informal chat.