Data Engineer, Prime Video Content Analytics & Products
Job ID: 3142743 | Amazon Development Centre (London) Limited
Prime Video is a premium streaming service that offers customers a vast collection of TV shows and movies—all with the ease of finding what they love to watch in one place. We offer customers thousands of popular movies and TV shows, from Originals and exclusive content to exciting live sports events, and the option to add on channels, rent or buy new releases.
The team works in a dynamic environment where innovating on behalf of our customers is at the heart of everything we do. If this sounds exciting to you, read on.
Key job responsibilities
* Build and optimize data pipelines to ingest and transform data from various sources, including traditional ETL pipelines and event data streams.
* Utilize data from disparate sources to build meaningful datasets for analytics and reporting, focusing on consolidating data from various Prime Video systems.
* Implement big‑data technologies (e.g., Redshift, EMR, Spark, SNS, SQS, Kinesis) to optimize processing of large datasets.
* Develop and maintain the team’s data platform, including infrastructure‑as‑code using AWS CDK.
* Work closely with business stakeholders to understand their needs and translate them into technical solutions.
* Analyze business processes, logical data models, and relational database implementations.
* Write high‑performing SQL queries.
* Design and implement automated data processing solutions and data quality controls.
* Collaborate with software engineers to support the data needs of products.
* Participate in on‑call rotations to support the team’s products and data pipelines.
* Optimize data processing and storage solutions to improve performance and reduce costs.
About the team: The Prime Video Content Analytics & Products team is dedicated to developing software and business intelligence products that streamline the process of planning, configuring, and tracking content launches at every stage of the title lifecycle, from the initial concept through production to post‑launch analysis.
Basic Qualifications
* Knowledge of professional software engineering & best practices for full software development life cycle, including coding standards, software architectures, code reviews, source control management, continuous deployments, testing, and operational excellence.
* Experience working on and delivering end‑to‑end projects independently.
* Experience building/operating highly available, distributed systems of data extraction, ingestion, and processing of large data sets.
* Experience with data modeling, warehousing, and building ETL pipelines.
* Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS.
* Experience as a data engineer or related specialty (e.g., software engineer, business intelligence engineer, data scientist) with a track record of manipulating, processing, and extracting value from large datasets.
* Experience with SQL.
Preferred Qualifications
* Experience with AWS technologies like Redshift, S3, AWS Glue, EMR, Kinesis, FireHOSE, Lambda, and IAM roles and permissions.
* Experience with non‑relational databases/data stores (object storage, document or key‑value stores, graph databases, column‑family databases).
* Experience with Apache Spark / Elastic Map Reduce.
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
We value your privacy and the security of your data. Please consult our Privacy Notice (https://www.amazon.jobs/en/privacy_page) to know more about how we collect, use, and transfer your personal data.
#J-18808-Ljbffr