Social network you want to login/join with:
Enhance system reliability and performance by designing and implementing monitoring and observability tools. Leverage metrics, logs, and traces to gain deep insights into system behavior, improve developer experiences, and enable teams to troubleshoot and optimize applications effectively.
Responsibilities:
* Build observability tooling into the developer platform to support rapid software development.
* Guide platform users in adopting observability tools and best practices.
* Act as a subject matter expert on observability for colleagues.
* Develop and integrate observability best practices within the platform community.
Requirements:
* Experience with tools like Datadog (preferred), Honeycomb, or Prometheus/Grafana.
* Knowledge of Microsoft Azure and/or Amazon Web Services.
* Familiarity with DevOps concepts and practices.
Bonus Skills:
* Proficiency in a DevOps-friendly programming language, preferably Go.
* SRE experience.
Technologies We Use:
* Kubernetes (including AKS/EKS) and Docker
* Terraform
Bring your authentic self to work! Even if you don’t meet every requirement, we’d love to hear from you and explore if we’re a great fit.
#J-18808-Ljbffr