Job Description:
We are looking for a highly experienced AWS Data Platform Lead with 10+ years of hands-on experience in data engineering/platform design and strong expertise in building large‑scale, secure data platforms on AWS. The candidate should be proficient in Python (primary language) and hands‑on with key AWS data services including Lake Formation, Glue, Athena, S3, IAM, and KMS. Extensive experience with Terraform, GitHub Actions, and Spark/PySpark is essential.
Key Responsibilities:
* Architect end‑to‑end AWS data platforms (S3, Lake Formation, Glue, Athena, EMR/Spark).
* Design multi‑account data governance, security, and access models.
* Develop scalable ETL/ELT pipelines using Python & PySpark.
* Implement IaC using Terraform and CI/CD using GitHub Actions.
* Optimize data pipelines for performance, quality, and cost.
* Collaborate with security, data science, and analytics teams.
Must-Have Skills:
* 10+ years’ experience; 8+ years hands-on in AWS data and devops services.
* Strong Python development background.
* Expertise in Lake Formation, Glue, Athena, S3, IAM, KMS.
* Advanced Spark/PySpark experience.
* Strong Terraform and GitHub Actions CI/CD implementation skills.
* Experience with multi-account AWS architectures and governance.
Nice-to-Have:
* Iceberg/Delta/Hudi, DMS, EMR tuning.
* Data quality & lineage tools (Datahub).
* Exposure to regulated industries (Banking/Finance).