The Company
Glassbox's mission is to reveal the insights that empower organizations to deliver exceptional digital customer experiences.
Glassbox is a leading force in shaping digital experiences. It helps organizations uncover digital issues, boost conversion rates, enhance accessibility, prevent fraud, and more.
Leveraging AI-driven customer intelligence, Glassbox enables enterprises to deliver secure, proactive, and preventative digital experiences. Its solutions are trusted by highly regulated organizations.
We are a growing global team and are looking for a DevOps Engineer.
What You Will Do
* Work with a diverse set of technologies, simplifying complex solutions.
* Collaborate closely with multiple teams across the organization.
* Be part of a team responsible for designing, optimizing, and maintaining a high-scale production environment that handles massive traffic loads with high complexity.
* Architect, deploy, and maintain robust and scalable cloud infrastructures on AWS and Azure.
* Develop and optimize CI/CD pipelines to support automated deployment, testing, and scaling across multiple environments.
* Implement and manage monitoring, logging, and alerting solutions across cloud platforms to ensure application health and performance.
* Provide advanced troubleshooting and resolution for infrastructure issues in production, development, and testing environments.
What You Will Need
* 4+ years of experience in a DevOps or related engineering role, with a strong background in AWS and cloud-native environments. Proven ability to design, manage, and maintain high-scale production systems, ensuring reliability, performance, and scalability.
* Deep expertise in cloud technologies and application security, with a strong focus on best practices for securing resilient, scalable, and cost-efficient architectures.
* Extensive experience with containerization and orchestration technologies, including Docker, Kubernetes, and Helm.
* Proficiency in automation tools such as Terraform, and hands‑on experience with CI/CD pipelines using tools like Jenkins.
* Strong knowledge of monitoring and logging tools (e.g., Prometheus, Grafana, Loki) to ensure system health, optimize performance, and proactively detect issues.
* Proficiency in scripting languages (e.g., Node.js, Bash) to automate workflows and enhance operational efficiency.
* Excellent communication, collaboration, and documentation skills, with a proactive approach to problem‑solving.
* Passionate about learning new technologies and tackling complex challenges in a fast‑paced environment.
* Hands‑on experience managing big data infrastructure, optimizing performance and scalability for data‑intensive applications.
An Advantage
* Proficiency in database technologies, including Cassandra, Elasticsearch, ClickHouse, and PostgreSQL HA.
* Expertise in Kafka cluster administration, including cross‑region replication and high‑availability configurations.
* Experience with MLOps workflows and infrastructure, including model deployment, monitoring, and scaling in production environments.
#J-18808-Ljbffr