Menlo Security provides secure connectivity for enterprises worldwide.
About The Role
Platform Infrastructure Engineer builds and operates Menlo’s Infrastructure Platform, enabling secure global connectivity.
Responsibilities
* Design, deploy, and maintain VM and Kubernetes infrastructure on GCP and AWS across dozens of clusters spanning development, staging, and production environments in multiple regions.
* Coordinate with your peers in your direct team as well as across teams to ensure that the tasks you’re working on solve the problems we need addressed.
* Build and maintain Infrastructure as Code (IaC) using Terraform modules, managing resources through Spacelift or equivalent Terraform Automation and Collaboration Software (TACOS).
* Implement and manage workflows with sophisticated multi-layer configuration management.
* Build and maintain comprehensive observability solutions using Grafana Cloud, Prometheus/Mimir, and OTel collectors. Design Grafana dashboards, configure alerting rules, and ensure visibility across all platform components.
* Manage certificate lifecycle, DNS automation, ingress controllers, and service mesh networking with Cilium.
* Partner with Engineering, Product, Compliance, and Security teams to design resilient, scalable systems. Consult on capacity planning, disaster recovery, and architectural decisions for cloud‑native applications.
* Identify and eliminate toil through automation. Write scripts, develop tools, and build CI/CD pipelines to improve operational efficiency and reduce manual work.
* Participate in a 24x7 on‑call rotation as part of a globally distributed team, responding to incidents and driving post‑incident reviews.
Requirements
* Bachelor's degree in Computer Science, a related technical field, or equivalent practical experience.
* Proficiency in common programming and scripting languages (Python, Bash, Go).
* Understanding of network topologies, communication protocols (TCP/IP, HTTP/S, UDP, TLS), and enterprise‑grade connectivity solutions.
* Kubernetes expertise including cluster administration, RBAC, networking, workload management, and troubleshooting across production environments.
* Proven experience with Terraform for infrastructure provisioning and management.
* Knowledge of Google Cloud Platform services including GKE, VPC networking, Cloud DNS, Artifact Registry, Secret Manager, IAM, Gemini Code Assist, and Workload Identity.
* Experience with GitOps methodologies and tools.
* Clear understanding of using LLM code‑assist tools to effectively build software.
All qualified applicants will receive consideration for employment without regard to race, sex, color, religion, sexual orientation, gender identity, national origin, protected veteran status, or disability.
#J-18808-Ljbffr