Jobs
My ads
My job alerts
Sign in
Find a job Career Tips Companies
Find

Software engineering manager, site reliability, cloud incident response

London
Google
Software engineering manager
Posted: 20 October
Offer description

Software Engineering Manager, Site Reliability, Cloud Incident Response

Join to apply for the Software Engineering Manager, Site Reliability, Cloud Incident Response role at Google


Minimum qualifications

* Bachelor's degree or equivalent practical experience.
* 8 years of experience with software development in one or more programming languages (e.g., Python, C, C++, Java, JavaScript).
* 3 years of experience in a technical leadership role; overseeing projects, with 2 years of experience in people management, supervision, or team leadership.
* Experience with cloud services, telemetry systems, and incident response.


Preferred qualifications

* Master's degree or PhD in Computer Science, or a related technical field.
* Experience as a cloud customer.


About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer needs, and a fast rate of improvement. Additionally SRE will keep an ever-watchful eye on system capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure, and eliminating work through automation. On the SRE team, you’ll manage complex challenges of scale unique to Google Cloud while leveraging coding, algorithms, complexity analysis, and large-scale system design. SRE’s culture of intellectual curiosity, problem solving, and openness is key to its success. Our organization brings together people from diverse backgrounds, experiences, and perspectives. We encourage collaboration, big thinking, and risk-taking in a blame-free environment. We promote self-direction to work on meaningful projects while learning and growing. The Cloud Incident Response Team supports responders, tooling, and outcomes for Google Cloud Platform (GCP) major incidents. The team collaborates across GCP products, customer-facing teams, and stakeholders to coordinate, mitigate, or resolve issues across all of GCP. Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.


Responsibilities

* Participate in on-call rotation supporting Critical Incident Response for GCP.
* Focus on high-quality customer outcomes and collaboration across GCP teams.
* Create Incident Management at Google (IMAG) training and processes for the incident management lifecycle in partnership with Cloud SRE Tech Leads and Cloud Support leadership team.
* Build systems and tooling to support the team, enhance visibility, improve issue detection, and facilitate communication with customers, stakeholders, and other customer-facing teams.
* Define and elevate risks in Cloud, reduce incident probabilities.

Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also Google's EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know by completing our Accommodations for Applicants form.

#J-18808-Ljbffr

Apply
Create E-mail Alert
Job alert activated
Saved
Save
Similar job
Software engineering manager
London
Collaborative Search
Software engineering manager
Similar job
Software engineering manager - unified client experience (ucx)
London
Hargreaves Lansdown
Software engineering manager
£100,000 a year
Similar job
Senior software engineering manager
London
ION
Software engineering manager
€110,000 a year
See more jobs
Similar jobs
Google recruitment
Google jobs in London
It jobs in London
jobs London
jobs Greater London
jobs England
Home > Jobs > It jobs > Software engineering manager jobs > Software engineering manager jobs in London > Software Engineering Manager, Site Reliability, Cloud Incident Response

About Jobijoba

  • Career Advice
  • Company Reviews

Search for jobs

  • Jobs by Job Title
  • Jobs by Industry
  • Jobs by Company
  • Jobs by Location
  • Jobs by Keywords

Contact / Partnership

  • Contact
  • Publish your job offers on Jobijoba

Legal notice - Terms of Service - Privacy Policy - Manage my cookies - Accessibility: Not compliant

© 2025 Jobijoba - All Rights Reserved

Apply
Create E-mail Alert
Job alert activated
Saved
Save