* Strong expertise in implementing Site Reliability Engineering (SRE) principles.
* Advanced knowledge of establishing observability using tools Dynatrace & Datadog (primary skills).
* Proficiency in automation & scripting using Python &Ansible(primary skills).
* Strong experience with cloud platforms AWS &Azure (primary skills).
* Solid understanding of containerization and orchestration tools likeDockerandKubernetes.
* Proficiency in cloud native distributed systems & microservices architecture.
1. Exposure to AI/ML techniques for predictive analytics and automated problem resolution.
2. Familiarity with CI/CD pipelines & enabling automated release & deployment engineering solutions.
3. Good to have experience with chaos engineering tools likeGremlinorChaos Monkey and implementing automation frameworks for resilience tracking.
4. Ability to manage and prioritize multiple projects in a fast-paced environment.
5. Strong interpersonal and communication skills to...