Job Description
Join a breakthrough agentic AI startup with $8M funding, led by two exceptional founders with Apple exits and DeepMind pedigree. Be the 8th engineer on a proven team revolutionising AI, accelerating innovation with cutting-edge systems and strategic partners—shape the future, make an impact, and earn equity with real skin in the game
Are you passionate about building scalable, fault-tolerant backend systems in distributed environments? Do you thrive on solving complex engineering challenges while optimizing deployment and observability?
What You’ll Do
* Design, develop, and operate large-scale distributed backend systems that power mission-critical applications.
* Implement microservices, APIs, and data pipelines using backend like Go, Python, or Java.
* Lead deployment automation efforts through containerization (Docker) and orchestration tools such as Kubernetes.
* Build and maintain CI/CD pipelines enabling seamless, reliable software delivery.
* Architect and implement end-to-end observability with tools like Prometheus, Grafana, Jaeger, and OpenTelemetry, ensuring system transparency and rapid troubleshooting.
* Optimize system performance, scalability, and availability while enabling robust monitoring and alerting across distributed environments.
* Collaborate closely with cross-functional teams across engineering, product, and operations to deliver resilient, high-impact solutions.
What We’re Looking For
* Proven hands-on experience with distributed systems architecture, including deep understanding of reliability, consistency models, and concurrency.
* Expertise in backend development with at least one of Go, Python, Java, or Rust.
* Strong experience with containerization (Docker) and orchestration (Kubernetes, Nomad).
* Skilled in CI/CD tooling and infrastructure as code (Terraform, Helm, Ansible).
* Proficient with observability tools and frameworks (Prometheus, Grafana, Jaeger, OpenTelemetry, ELK stack).
* Ability to design scalable, observable systems that offer actionable insights and rapid incident response.
* Solid cloud platform experience (AWS, GCP, Azure) and familiarity with automated deployment and rollback.
* Excellent problem-solving abilities, communication skills, and a proactive ownership mindset.
Why Join?
This is your chance to be at the forefront of building the backbone for the next wave of AI-driven, intelligent applications. You’ll work alongside talented engineers in a fast-paced, innovative environment where your contributions have direct and meaningful impact. Embrace autonomy, continuous learning, and the opportunity to solve some of the toughest challenges in distributed backend engineering.
#J-18808-Ljbffr