Key Responsibilities:
1. Cloud and Infrastructure Projects: Design and implement common deployment, security, operational and cost accounting frameworks for teams working in AWS across the organisation Lead migration projects to the cloud Overhaul and continuously improve infrastructure to ensure scalability, security, reliability and cost-efficiency Develop automated deployment and maintenance methods in collaboration with development teams Liaise effectively with stakeholders across the organisation to get buy-in to changes and improvements
2. Security and Best Practices: Advocate for and implement best security practices within the system engineering and development processes Suggest and implement code or tool enhancements with a focus on security
3. Monitoring and Operations Build and maintain monitoring and alerting using Grafana Cloud Ensure documentation is complete and up-to-date
4. Change, Incident, and Problem Management Investigate and resolve complex problems Represent proposed changes at CAB meetings Liaise with stakeholders to agree on appropriate courses of action Handle escalated user tickets
5. System Administration and Automation: Manage Linux systems (Ubuntu) both on-premises and in the cloud, using automation wherever possible
What We’re Looking For:
6. Essential Skills and Experience: AWS architecture, design and implementation Terraform for developing and managing complex infrastructure as code Strong communication and organisational skills for coordinating with stakeholders, users, other internal teams and suppliers Monitoring technologies (e.g. Grafana) Comfortable working in both agile and ITIL environments Experience with AWS networking and network interoperation between AWS and on-premises systems Linux systems administration (Ubuntu primarily) Strong scripting (e.g. Python, Bash) Containerization and orchestration technologies (Docker & Kubernetes) DevOps tools and application administration (GitLab, Jira & Confluence, Artifactory, Vault or similar tools) Strong general IT skills in system administration, networking and security
7. Preferred Skills and Experience Ansible for automation and orchestration VMware vSphere and vCenter Experience supporting IT in varied environments such as manufacturing and logistics, software development and quality assurance