Overview
The region service Messaging and Streaming Team (MAST) is a customer experience-oriented team looking for a self-motivated talented engineer to solve complex problems and improve service support. MAST builds and supports Messaging and Streaming services such as Kinesis Data Streams, Simple Queue Service (SQS), Simple Notification Service (SNS), Amazon MQ, and Amazon Managed service for Apache Flink (MSF). A systems engineer will create and drive opportunities to automate and simplify daily operations and scale organisational operations.
Key Responsibilities
* Work proactively to solve potential problems and inefficiencies. Communicate clearly and collaborate with others to deliver results with minimal supervision.
* Participate in 24/7 on-call rotation to troubleshoot high severity issues.
* Analyze dashboards and investigate metrics with an eye for improvements.
* Create and maintain Standard Operating Procedures (SOPs) and runbooks for documentation.
* Discuss radical new approaches to automate operational issues, assess risks, and develop creative solutions.
* Develop strategies for resolving identified problems to prevent future occurrences.
* Assist others in the team.
About the Team
The team has the unique perspective of operating all of the messaging and streaming services, instead of just software components. This enables the team to drive cross‑organization initiatives to remove operational hurdles, optimize software delivery, and eliminate bottlenecks felt by all of AWS. On joining the MAST Engineering team, each employee is paired with a peer buddy who will help you quickly come up to speed with the technology, tools, and business problems you’ll be solving.
Utility Computing (UC) is a division of AWS that provides product innovations—from foundational services such as Amazon S3 and Amazon EC2 to newly released products that distinguish AWS services. As a member of the UC organization, you will support the development and management of Compute, Database, Storage, Internet of Things (IoT), Platform, and Productivity Apps services in AWS, including support for customers requiring specialized security solutions.
Qualifications
* Experience writing scripts from scratch to automate manual tasks (BASH, Python, Perl, Ruby, or similar).
* Solid background in Linux with in‑depth troubleshooting skills and ability to solve complex technical problems.
* Knowledge of network fundamentals (DNS, UDP, TCP/IP, HTTP(S), routing, switching).
* Experience owning services that are secure, scalable, reliable, and efficient. Ability to identify multiple operational and security risks and then resolve, mitigate, or elevate them.
* Bachelor’s Degree in Systems Engineering, Computer Science or related field, or equivalent work experience.
* Exposure to cloud computing concepts and design considerations.
* Experience in a 24x7 production environment.
* Experience with monitoring frameworks (e.g., CloudWatch, Datadog, Grafana, Elastic or similar).
Equal‑Opportunity Employer Statement
Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify, and build. Protecting your privacy and security of your data is a top priority for Amazon.
#J-18808-Ljbffr