Cloud Support Engineers in the Data in Transit domains support customers who are running ETL workload or analyzing large amounts of data using AWS services. As a part of this team, you will be working on a plethora of services such as Glue (ETL service), Athena (interactive query service), Managed Workflows of Apache Airflow, etc.
Understanding of ETL (Extract, Transform, Load) Creation of ETL Pipelines to extract and ingest data into data lake/warehouse with simple to medium complexity Data transformations and troubleshooting ETL job issues.
Understanding of Linux and Networking concepts.
Excellent oral and written communication skills with multi-tasking ability.
Master’s degree in Information Science/Information Technology, Data Science, Computer Science, Engineering, Mathematics, Physics, or a related field OR Bachelor’s degree in the same with 1+ year of experience OR equivalent experience in a technical position.
Key job responsibilities
1. Intermediate expertise in ETL tools such as Talend, Informatica or similar.
2. Knowledge of data management fundamentals and data storage principles.
3. Advanced SQL and query performance tuning skills.
4. Experience integrating and managing large data sets from multiple sources.
5. Ability to read and understand Python and Scala code.
6. Understanding of distributed computing environments.
7. Proficient in Spark, Hive, and Presto.
8. Experience working with Docker.
9. Python, and shell scripting.
10. Customer service experience / strong customer focus.
11. Prior working experience with AWS - any or all of EC2, S3, EBS, Glue, Athena.
12. Experienced with Linux system monitoring and analysis (disk management, memory management, permissions, etc.).
13. Understanding of Networking concepts and protocols (DNS, TCP/IP, DHCP, HTTPS, etc.).
BASIC QUALIFICATIONS
- 2+ years of experience in big data/Hadoop with excellent knowledge of Hadoop architecture and administration and support.
- Be able to read Java code, and basic coding/scripting ability in Java, Perl, Ruby, C#, and/or PHP with Databases (MySQL, Oracle, NoSQL) experience.
- Good understanding of distributed computing environments and excellent Linux/Unix system administrator skills.
PREFERRED QUALIFICATIONS
- Proficient in Hadoop Map-Reduce and its Ecosystem (Zookeeper, HBASE, HDFS, Pig, Hive, Spark, etc).
- Good understanding of ETL principles and how to apply them within Hadoop.
- Prior working experience with AWS - any or all of EC2, S3, EBS, ELB, RDS, DynamoDB, EMR.
Amazon is an equal opportunities employer, and we value your passion to discover, invent, simplify and build. We welcome applications from all members of society irrespective of age, sex, disability, sexual orientation, race, religion or belief.
#J-18808-Ljbffr