Job Description The Cloud Operations teams are responsible and accountable for Sage's online Product portfolio. The teams ensure our Products and Services are constantly available for our customers to use to run their businesses. A Cloud Operations Engineer helps bridge the gap between Development and Operations by deploying, administering, and improving infrastructure, as well as enhancing automation using cloud technologies. They work within their product delivery teams to ensure products and services remain available, secure, and performant. As part of a 24/7 operation, the role includes participation in an on-call rota to troubleshoot and resolve service incidents. Guided by Security, Architecture, and Automation subject matter experts, the Cloud Operations Engineer builds and operates infrastructure, develops operational processes, implements infrastructure as code, automates manual tasks, and strengthens operational capabilities. Responsibilities span disaster recovery, high availability, scalable solutions, infrastructure monitoring, and continuous deployment.
* Take personal accountability for assigned products to ensure they remain available, secure, and performant.
* Lead during high-pressure incident scenarios, balancing technical resolution with clear stakeholder communication.
* Participate in our 24x7 on-call rota.
* Design and implement solutions that mitigate risk, improve customer experience, and enhance operational efficiency.
* Collaborate closely with product delivery teams to ensure designs, standards, and quality expectations are met.
* Operate in a true DevOps environment, understanding the development team's world (source control, builds, backlogs, sprints, Agile) while bringing them closer to operational concerns (infrastructure, OS, security, scripting, monitoring).
Excellent communication skills, able to explain complex technical issues in simple terms to non-technical audiences and business stakeholders.
* Proficiency in cloud computing, particularly AWS, with experience in distributed systems, containers, and serverless technologies.
* Operating system expertise across Windows and Linux.
* Strong analytical, debugging, and problem-solving skills.
* Experience proactively monitoring systems and leading resolution during high-pressure incidents.
* Scripting skills for automation (PowerShell, Bash, Ruby, Python).
* Ability to work across multiple teams, manage competing priorities, and multitask effectively.
* Proficiency in English language, verbal and written with ability to build relationships and influence.
* Exposure to SRE principles, including Service Levels, Error Budgets, and reducing operational toil.
* Experience with Infrastructure as Code (CloudFormation, CDK) and CI/CD pipelines (Azure DevOps, GitHub Actions).
Advert Working at Sage means you're supporting millions of small and medium sized businesses globally with technology to work faster and smarter. We leverage the future of AI, meaning business owners spend less time doing routine tasks, like entering invoices and generating reports, and more time pursuing their ambitions.
#J-18808-Ljbffr