Key Responsibilities
* Air-Gapped Cluster Management: Architect, deploy, and maintain enterprise Kubernetes clusters in on-premises data centers and highly secure, air-gapped environments without direct internet connectivity.
* Platform Lifecycle: Manage the end-to-end lifecycle of Kubernetes distributions, including automated provisioning, day-2 operations, patching, upgrading, and backup/disaster recovery strategies.
* Infrastructure as Code (IaC): Develop and maintain automation scripts and configuration management tools to ensure predictable, repeatable deployments across bare‑metal or private cloud infrastructure.
* Security & Compliance: Implement rigorous security hardening protocols, network policies, access controls (RBAC), and continuous vulnerability monitoring tailored for sensitive, secure environments.
* Observability & Performance: Establish comprehensive logging, monitoring, and alerting frameworks to proactively identify bottlenecks and optimize cluster performance and resource utilization.
* Collaboration & Support: Work closely with application and infrastructure teams to streamline containerization strategies and ensure smooth application delivery pipelines.
Job Requirements
Must-Have Qualifications:
* Eligibility: Singapore Citizen (due to strict security clearance requirements for air-gapped systems).
* Experience: Minimum of 6+ years of professional experience in Platform Engineering, DevOps, or Infrastructure Automation.
* Core Technical Focus: Proven track record of deploying and managing Kubernetes within physical Data Centers and production Air-Gapped environments.
* Container Networking & Storage: Deep understanding of container network interfaces (CNI, e.g., Calico, Cilium), load balancers, and persistent storage solutions (CSI) in a non-public cloud context.
* Automation Skills: Proficiency with Infrastructure as Code (IaC) and automation tooling (e.g., Ansible, Terraform, or similar) specifically adapted for offline/on-premises deployment.
Nice-to-Have / Advantages:
* Hands‑on experience with Rancher Kubernetes Engine (RKE/RKE2) or Rancher Manager for multi-cluster management.
* Familiarity with Harvester (open‑source hyperconverged infrastructure solution) or alternative private‑cloud hypervisors.
* Relevant industry certifications such as CKA (Certified Kubernetes Administrator) or CKAD (Certified Kubernetes Application Developer).
#J-18808-Ljbffr