Ready for a Challenge?
Welcome to Just Eat Takeaway.com, a leading global online food delivery platform where our mission is to empower everyday convenience. Whether it’s a Friday-night feast, a post-gym poke bowl, or a quick grocery run, our tech platform connects tens of millions of customers with hundreds of thousands of restaurant, grocery, and convenience partners worldwide.
About This Role
We are on the lookout for a Principal Engineer to spearhead the design, development, and evolution of our Observability Platform. This role is pivotal in ensuring our systems and engineering teams can scale rapidly. You will leverage Machine Learning (ML) and Artificial Intelligence (AI) to deliver advanced insights that proactively enhance system health, while driving down Mean Time to Detection (MTTD) and Mean Time to Resolution (MTTR). If you are a visionary technologist with deep expertise in observability, monitoring, and distributed systems, we want you to drive strategy, architecture, and execution for a world-class platform!
Your Key Ingredients for Success
Platform Leadership
* Architect, design, and implement a cutting-edge Observability Platform to support metrics, logs, traces, and events at scale.
* Integrate ML/AI-driven solutions to enhance anomaly detection, root cause analysis, and predictive insights.
* Lead the development and adoption of platform capabilities to ensure system health, reliability, and performance.
* Establish and evolve platform standards and best practices to align with the company’s engineering goals.
Strategic Initiatives
* Collaborate with engineering teams to define the observability strategy, ensuring alignment with business and operational objectives.
* Identify and integrate the latest observability technologies, including AI-based analytics, to improve system insights and developer productivity.
* Drive a platform-first mindset, ensuring observability is treated as a foundational capability across all services.
* Implement real-time insights and proactive monitoring powered by AI/ML to reduce detection and resolution times.
Operational Excellence
* Ensure the Observability Platform is highly available, performant, and secure across all environments.
* Optimize data collection, processing, and storage to balance performance with cost efficiency.
* Define SLAs, SLOs, and SLIs for observability services to support reliability engineering practices.
* Continuously improve MTTD and MTTR by leveraging advanced AI/ML models for predictive analysis and automated responses.
Mentorship and Collaboration
* Act as a mentor and technical leader for engineers, fostering a culture of learning, innovation, and excellence.
* Collaborate with stakeholders, including Site Reliability Engineering (SRE), infrastructure, and application teams, to gather requirements and deliver impactful solutions.
* Advocate for observability as a critical enabler of operational success across the organization.
What Will You Bring to the Table?
* Extensive Engineering Experience: Proven experience in building and scaling observability platforms in a cloud-native environment.
* Observability Expertise: Deep understanding of observability pillars (metrics, logs, traces) and related tools (e.g., Prometheus, Grafana, OpenTelemetry, Jaeger, Kibana Elastic Stack).
* AI/ML Proficiency: Hands-on experience integrating ML/AI models into observability systems to drive advanced insights, anomaly detection, and predictive analysis.
* Distributed Systems Knowledge: Strong expertise in designing scalable and reliable systems for high-throughput data collection and processing.
* Programming Skills: Proficiency in one or more languages (e.g., Go, Python, Java, Terraform, Pulumi) with a focus on building robust platforms.
* Cloud Proficiency: Hands-on experience with cloud platforms (e.g., AWS, GCP, Azure) and Infrastructure-as-Code tools (e.g., Terraform, Pulumi).
* Leadership and Mentorship: Experience leading and mentoring multicultural engineering teams, driving technical decisions, and delivering large-scale initiatives.
* Cost Optimization: Familiarity with strategies for managing the costs associated with observability data storage, processing, and analysis.
Desirable Qualifications:
* Expertise in applying AI/ML for proactive alerting, root cause analysis, and predictive scaling.
* Experience with service mesh technologies (e.g., Istio, Linkerd) and their observability implications.
* Contributions to open-source observability or ML/AI projects.
* Proficiency with container technologies (Docker, Kubernetes) and best practices configurations for observability and monitoring.
* Understanding of statistical analysis, data mining, and feature engineering techniques to extract meaningful insights from observability data.
At JET, This is on the Menu:
Our teams forge connections internally and collaborate with some of the best-known brands on the planet, giving us truly international impact in a dynamic environment. Fun, fast-paced, and supportive, the JET culture is about movement, growth, and celebrating every aspect of our JETers. Thanks to them, we stay one step ahead of the competition.
Inclusion, Diversity & Belonging
No matter who you are, what you look like, who you love, or where you are from, you can find your place at Just Eat Takeaway.com. We’re committed to creating an inclusive culture that encourages diversity of people and thinking, where all employees feel they truly belong and can bring their most colorful selves to work every day.
What Else is Cooking?
Want to know more about our JETers, culture, or company? Explore our career site where you can find people's stories, blogs, podcasts, and more JET morsels.