Overview
We are seeking a Kubernetes Engineer with experience building resilient, scalable container based platforms in dynamic environments. You’ll play a central role in implementing our container orchestration strategy, optimizing Kubernetes clusters for reliability, performance, and developer velocity. This is a hands-on role where architectural insight meets operational excellence - ideal for an engineer who wants to leave their fingerprints on the core of how things run.
Key Responsibilities
Cluster Management: Design, build, and maintain Kubernetes clusters across development, staging, and production environments (EKS a strong plus).
Platform Engineering: Build tooling and abstractions that streamline application deployment and service discovery for developers.
Autoscaling & Performance: Optimize pod scheduling, resource allocation, and horizontal/vertical scaling for high-performance services.
Security & Policy Enforcement: Implement RBAC, network policies, and runtime security tools to enforce safe, compliant workloads.
Deployment Enablement: Enhance Helm charts, Kustomize workflows, and GitOps processes to support fast, safe, and reliable deployments.
Observability: Own the integration and tuning of observability stacks (e.g., Prometheus, Grafana, Loki) for visibility into cluster and application health.
Resilience & Recovery: Support fault-tolerant architectures, runbooks for failover, and high availability strategies.
Collaboration: Partner with developers, QA, and platform teams to evolve infrastructure-as-code and self-service systems that reduce friction and boost autonomy.
About You
Experience:
* 5+ years in Site Reliability, Infrastructure, or DevOps roles with a clear Kubernetes focus
* Deep experience running production workloads on Kubernetes (especially on AWS/EKS)
* Solid understanding of container lifecycle, networking, and orchestration internals
Technical Skills:
* Strong with tools like Helm, Kustomize, ArgoCD, or Flux
* Proficiency with Terraform or Pulumi for provisioning EKS and supporting infrastructure
* Competence in at least one scripting language (Python, Bash, or Go)
* Familiarity with service meshes (Istio, Linkerd) and Kubernetes-native security tools (OPA/Gatekeeper, Kyverno)
Mindset:
* You are proactive and enjoy taking ownership of infrastructure challenges
* You value automation and reducing manual toil wherever possible
* You are comfortable working in fast-paced, collaborative environments
* You communicate clearly and can explain complex infrastructure topics to different audiences
Nice to Have:
* Experience implementing security controls in regulated or compliance-focused environments
* Familiarity with service mesh architectures or advanced Kubernetes networking
* Background supporting multi-region or multi-cloud deployments