Exp : 6 + Years
What youll do:
1. Cloud Infrastructure Management:
* Design, deploy, and manage cloud infrastructure on platforms such as AWS, Azure, or Google Cloud.
* Implement best practices for resource provisioning, configuration, and optimization to ensure performance and cost-efficiency.
2. High Availability and Disaster Recovery:
* Architect and implement solutions for high availability and disaster recovery to minimize downtime and ensure business continuity.
* Configure redundancy, failover mechanisms, and automated backups for critical systems and data.
3. Monitoring and Incident Response:
* Set up monitoring, alerting, and logging systems to proactively detect and respond to incidents and performance issues.
* Develop and maintain runbooks and playbooks for incident response and troubleshooting.
4. Performance Optimization:
* Identify performance bottlenecks and optimize cloud infrastructure and applications for improved efficiency and scalability.
* Conduct performance analysis, capacity planning, and tuning to meet performance targets and SLAs.
5. Security and Compliance:
* Implement security controls, policies, and procedures to protect cloud resources and data from unauthorized access and breaches.
* Ensure compliance with industry standards and regulations, conducting regular audits and assessments as needed.
6. Automation and Orchestration:
* Automate routine tasks, workflows, and infrastructure deployments using scripting languages and configuration management tools.
* Implement orchestration and automation frameworks to streamline operations and reduce manual intervention.
7. Collaboration and Knowledge Sharing:
* Collaborate with cross-functional teams to support development, testing, and deployment processes in cloud environments.
* Share knowledge and best practices with team members through documentation, training sessions, and mentorship.
8. Continuous Improvement:
* Drive continuous improvement in cloud operations through analysis, experimentation, and feedback.
* Identify opportunities for innovation, cost optimization, and process enhancements to drive business outcomes.
What you bring:
* Bachelors degree in computer science, Engineering, or related field (or equivalent experience).
* Several years of experience in cloud operations or infrastructure roles, with a focus on AWS, Azure, or Google Cloud.
* Proficiency in cloud infrastructure management tools and services, including compute, storage, networking, and security.
* Strong understanding of cloud architecture principles, design patterns, and best practices.
* Experience with monitoring and logging tools such as AWS CloudWatch, Azure Monitor, ELK stack, or Prometheus/Grafana.
* Knowledge of automation and orchestration tools such as Terraform, Ansible, Puppet, or Chef.
* Familiarity with containerization technologies (e.g., Docker, Kubernetes) and microservices architecture.
* Excellent problem-solving skills, with the ability to troubleshoot complex issues and implement effective solutions.
* Strong communication and collaboration skills, with the ability to work effectively in a cross-functional team environment.
Additional Preferred Skills:
* Certifications in cloud platforms (e.g., AWS Certified Solutions Architect, Microsoft Certified: Azure Solutions Architect Expert, Google Cloud Certified - Professional Cloud Architect).
* Experience with DevOps practices and CI/CD pipelines.
* Knowledge of database administration, network security, and compliance frameworks.
* Familiarity with serverless computing, edge computing, and hybrid cloud environments
Meet Your Team: As a Senior Dev - Operations Engineer, the candidate will be responsible for designing, implementing, and managing cloud infrastructure and services to ensure the availability, reliability, and scalability of mission-critical applications. You will work closely with development, security, and other teams to optimize cloud operations and drive innovation.