MUST HAVE: ACTIVE SC CLEARANCE (SECURITY CLEARED CANDIDATES ONLY)
About the Role
We are seeking an experienced Data Test Lead with strong expertise in Databricks and Talend to lead end-to-end data validation, quality assurance, and testing strategies across modern data platforms. The ideal candidate will have a solid background in data engineering testing, ETL validation, and cloud-based data ecosystems.
Key Responsibilities
* Lead and manage data testing initiatives across projects involving Databricks and Talend pipelines
* Design, develop, and implement robust data validation frameworks and testing strategies
* Perform ETL/ELT testing, including source-to-target validation, data reconciliation, and transformation testing
* Collaborate with data engineers, analysts, and business stakeholders to ensure data accuracy and integrity
* Develop and maintain automated test scripts for large-scale data pipelines
* Validate data workflows in cloud environments (AWS/Azure/GCP) integrated with Databricks
* Ensure data quality through profiling, anomaly detection, and governance practices
* Lead defect management, root cause analysis, and resolution tracking
* Mentor and guide junior testers and ensure adherence to testing best practices
Required Skills & Qualifications
* 7+ years of experience in data testing / ETL testing
* Hands-on experience with Databricks (Spark, PySpark, SQL)
* Strong expertise in Talend ETL tools
* Proficiency in SQL and data validation techniques
* Experience with data warehousing concepts (Snowflake, Redshift, BigQuery, etc.)
* Familiarity with test automation frameworks for data pipelines
* Strong understanding of data quality, governance, and lineage
* Experience working in Agile/Scrum environments
Good to Have
* Experience with Python or Scala for data validation
* Knowledge of CI/CD pipelines (Jenkins, Azure DevOps, GitHub Actions)
* Exposure to big data ecosystems and distributed processing
* Certifications in Databricks or cloud platforms