Excellent opportunity for AWS Site Reliability Engineers to be part of our Cloud Infrastructure & Security services practice. Cognizant Infrastructure Services – Provides IT infrastructure & Cloud services for clients across industry verticals, including both Consulting/Professional and Managed Services, across Enterprise Computing, Cloud services, Security Services, DevOps, Data Centres, End User Computing, Service Desk, Network Services and Environment Management Services.
Role require 5 days work from office.
Key Responsibilities :
Design, code, test, and deliver software to automate manual operational work
Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
Collaborate with development team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes
Identify application patterns and analytics in support of better service level objectives
Design self-healing and resiliency patterns
Design automated software and product upgrades, change management, and release management solutions
Collaborate with senior technical leads and mentor junior engineers.
Design, deploy and manage AWS environments with a focus on automation, scalability and security.
Build and maintain Infrastructure as Code(IaC) using tools such as Terraform.
Monitor and optimize system performance, availability, and security, applying observability best practices.
Key Skills and Experience :
Must have Strong Hands-on exposure in AWS, Terraform, Python/Bash, CI/CD
Experience with Infrastructure as code(IaC) and CI/CD(Bitbucket, Jenkins,spinnaker).
Strong knowledge of containerization and orchestration, including Docker and Kubernetes.
Strong scripting skills in Python or Bash for automation.
Proven experience deploying and managing and deep understanding of AWS cloud infrastructure in secure environments.
Working knowledge of infrastructure components (e.g. routers, load balancers, cloud products, container systems, compute, storage, and networks, VPC, subnets and security groups.)
Excellent troubleshooting, problem solving and debugging skills.
Bachelor's degree or equivalent experience in software engineering discipline
Nice to have skills - Basic knowledge of AI technologies and prompt engineering to leverage generative AI for enhancing productivity and automating tasks