Job Overview
AWS Utility Computing (UC) provides product innovations – from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Internet of Things (Iot), Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for their cloud services.
Key Responsibilities
* Work proactively to solve potential problems and inefficiencies. Communicate clearly and collaborate with others to deliver results with minimal supervision.
* Participate in 24/7 on‑call rotation to troubleshoot high severity issues.
* Analyze dashboards and investigate metrics with the vision for improvements.
* Troubleshoot and diagnose problems and work on solutions.
* Create and maintain Standard Operating Procedures (SOPs) and runbooks for documentation.
* Discuss radical new approaches to automate operational issues, assess risks and develop creative solutions.
Key Details
You will need to be a UK national and able to obtain and maintain a UK Government Security Clearance. Further details found here: https://www.gov.uk/government/publications/united-kingdom-security-vetting-clearance-levels
Day in the Life
On a typical day engineers might dive deep into understanding the root cause of a customer issue, investigate why a metric is trending the wrong way and consult with senior engineers at Amazon. They own their services and believe in making out‑of‑hours support as painless as possible. To achieve this, they implement Operational Excellence best practices and strive to automate manual processes. They utilise Linux skills to troubleshoot, innovate fixes and workarounds, keep software up‑to‑date and provide data and metrics that help manage the capacity and efficiency of services. They draw on networking knowledge to identify and troubleshoot network connectivity issues. They communicate clearly, collaborate with others, are self‑starters, and are comfortable dealing with ambiguity and change. They are customer‑obsessed, always looking to understand customer pain points and find resolutions quickly and completely.
Basic Qualifications
* Knowledge of networking fundamentals
* Experience working in a 24/7 production environment
* Experience in Linux systems administration and/or development
* Experience working in at least two of these languages: Python, Java, Perl, PHP, Ruby or Bash/Shell
Preferred Qualifications
* Knowledge of configuration management systems, such as Puppet, Chef, Ansible, or related systems
* Experience in site reliability engineering (SRE), systems engineering, systems administration, DevOps, security administration, or network administration
* Experience in network capture and systems troubleshooting
* Experience building scripts, tooling, and automation for large‑scale computing environments
EEO Statement
Amazon is an equal opportunities employer. We believe passionately that a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice (https://www.amazon.jobs/en/privacy_page) to know more about how we collect, use and transfer the personal data of our candidates. Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information.
Contact
Company: AWS EMEA SARL (UK Branch) Job ID: A10402401
#J-18808-Ljbffr