Site Reliability Engineer (SRE)
Hybrid working
Who are we?
Toyota Connected Europe aims to create a better world through connected mobility for all. We are a new company focused on integrating big data and a customer-centric approach into all aspects of the mobility experience to make it more personal, convenient, fun, and safe. We develop and enable technologies to delight and simplify the lives of our users and empower them to explore and utilize our services in innovative ways.
Joining us means being part of Toyota Connected Europes journey from the start, building our team and products. We are creating teams to inspire, innovate, and develop technologies and products used by millions across diverse backgrounds. We foster a start-up culture where every member acts like an owner, with immediate impact and visibility of their work.
About the role:
Our Cloud Engineering team plays a crucial role in Toyota Connected Europes success by providing the necessary tools and processes for global growth and scalability. We aim to enhance agility, effectiveness, and innovation, collaborating with product teams to align on technological and project goals.
As a Site Reliability Engineer, you will manage and improve complex cloud operations for one of the worlds largest automotive companies. You will work in a fast-paced, innovative environment, supporting Toyota Connected Europe teams to create next-generation connected vehicle solutions.
This environment values passion and potential; we are committed to developing talent into superstars.
What you will do:
* Ensure the availability, performance, reliability, and scalability of applications and services.
* Collaborate with Software Engineering to define infrastructure and deployment requirements.
* Proactively identify and resolve production issues, developing tools and scripts to improve operational efficiency.
* Conduct performance reviews and incident post-mortems to enhance system reliability.
* Participate in on-call rotations, demonstrating problem-solving skills to ensure quick resolution of issues.
* Develop and maintain monitoring and alerting systems.
* Improve CI/CD pipelines, automate tasks, and bolster system security.
* Document procedures and policies related to managed systems.
What are we looking for?
* Bachelor’s degree in Computer Science, Information Systems, or related field, or equivalent experience.
* Approximately 3 years of relevant experience managing high-traffic websites, applications, or critical services.
* Strong knowledge of cloud platforms like AWS, GCP, or Azure.
* Proficiency with Infrastructure as Code tools such as Terraform or CloudFormation.
* Experience with containerization (Docker) and orchestration (Kubernetes).
* Understanding of CI/CD pipelines.
* Familiarity with scripting languages like Python, Bash, or Go.
* Experience with monitoring tools such as Datadog, Prometheus, Grafana, or ELK stack.
* Strong problem-solving, communication skills, and ability to work independently or in teams.
Additional notes
We value diverse backgrounds and perspectives. Even if you dont meet every listed requirement, we encourage you to apply. We are committed to building and growing talent from all walks of life, believing that different experiences add value to our team.
J-18808-Ljbffr