The Position
We are looking for a Senior Data Engineer with advanced expertise in Databricks to lead the development of scalable data solutions across in our asset performance management software, within our Digital Solutions business.
This role involves architecting complex data pipelines, mentoring junior engineers, and driving best practices in data engineering and cloud analytics. You will play a key role in shaping our data strategy which is the backbone of our software and enabling high-impact analytics and machine learning initiatives.
Accountabilities
1. Design and implement scalable, high-performance data pipelines.
2. Work with the lead cloud architect on the design of data lakehouse solutions leveraging Delta Lake and Unity Catalog.
3. Collaborate with cross-functional teams to define data requirements, governance standards, and integration strategies.
4. Champion data quality, lineage, and observability through automated testing, monitoring, and documentation.
5. Mentoring and guidance of junior data engineers. Using you passion for data engineering to foster a culture of technical excellence and continuous learning.
6. Driving the adoption of CI/CD and DevOps practices for data engineering workflows.
7. Stay ahead of emerging technologies and Databricks platform updates, evaluating their relevance and impact.
Knowledge
8. Deep understanding of distributed data processing, data lakehouse architecture, and cloud-native data platforms.
9. Optimization of data workflows for performance, reliability, and cost-efficiency on cloud platforms (particularly Azure but experience with AWS and/or GCP would be beneficial).
10. Strong knowledge of data modelling, warehousing, and governance principles.
11. Knowledge of data privacy and compliance standards (., GDPR, HIPAA).
12. Understanding of OLTP and OLAP and what scenarios to deploy them in.
13. Understanding of incremental processing patterns.
Skills
14. Strong proficiency in Python and SQL. Experience of working with Scala would be beneficial.
15. Proven ability to design and optimize large-scale ETL/ELT pipelines.
16. Building and managing orchestrations.
17. Excellent oral and written communication, both within the team and with our stakeholders.
Experience
18. 5+ years of experience in data engineering, with at least 2 years working extensively with Databricks and orchestrated pipelines” such as DBT, DLT, or workflows using jobs.
19. Experience with Delta Lake and Unity Catalog in production environments.
20. Experience with CI/CD tools and version control systems (., Git, GitHub Actions, Azure DevOps, Databricks Asset Bundles).
21. Experience with real-time data processing, both batch and streaming.
22. Experience of working on machine learning workflows and integration with data pipelines.
23. Experience leading data engineering projects with distributed teams, ideally in a cross functional environment.
Qualifications
24. Databricks Certified Data Engineer Professional or equivalent certification.