Description
Key Responsibilities:
* Cloud and Infrastructure Projects:
o Design and implement common deployment, security, operational and cost accounting frameworks for teams working in AWS across the organisation
o Lead migration projects to the cloud
o Overhaul and continuously improve infrastructure to ensure scalability, security, reliability and cost-efficiency
o Develop automated deployment and maintenance methods in collaboration with development teams
o Liaise effectively with stakeholders across the organisation to get buy-in to changes and improvements
* Security and Best Practices:
o Advocate for and implement best security practices within the system engineering and development processes
o Suggest and implement code or tool enhancements with a focus on security
* Monitoring and Operations
o Build and maintain monitoring and alerting using Grafana Cloud
o Ensure documentation is complete and up-to-date
* Change, Incident, and Problem Management
o Investigate and resolve complex problems
o Represent proposed changes at CAB meetings
o Liaise with stakeholders to agree on appropriate courses of action
o Handle escalated user tickets
* System Administration and Automation:
o Manage Linux systems (Ubuntu) both on-premises and in the cloud, using automation wherever possible
What We're Looking For:
* Essential Skills and Experience:
o AWS architecture, design and implementation
o Terraform for developing and managing complex infrastructure as code
o Strong communication and organisational skills for coordinating with stakeholders, users, other internal teams and suppliers
o Monitoring technologies (e.g. Grafana)
o Comfortable working in both agile and ITIL environments
o Experience with AWS networking and network interoperation between AWS and on-premises systems
o Linux systems administration (Ubuntu primarily)
o Strong scripting (e.g. Python, Bash)
o Containerization and orchestration technologies (Docker & Kubernetes)
o DevOps tools and application administration (GitLab, Jira & Confluence, Artifactory, Vault or similar tools)
o Strong general IT skills in system administration, networking and security
* Preferred Skills and Experience
o Ansible for automation and orchestration
o VMware vSphere and vCenter
o Experience supporting IT in varied environments such as manufacturing and logistics, software development and quality assurance