Job Description
Seeking a hands-on data platform architect/engineer to reverse-engineer a legacy solution (currently on a VM) and migrate it to Microsoft Fabric. The goal is to stabilize critical data processes and lay the groundwork for a modular, scalable enterprise data platform.
A working Microsoft Fabric-based replication of the legacy solution, with supporting documentation and recommendations for future improvements.
Responsibilities
* Analyze and document the existing legacy system.
* Rebuild and optimize data pipelines in Microsoft Fabric using PySpark.
* Conduct forensic analysis of data transformations and dependencies.
* Collaborate with data architects, engineers, and analysts.
* Troubleshoot data quality and integration issues.
* Provide recommendations for future modularization and scalability.
Essential Skills
* Strong proficiency in PySpark and modern data platforms (e.g., Microsoft Fabric, Azure).
* Familiarity with Azure Data Lake, Synapse, Power BI, and Fabric dataflows.
* Excellent documentation and communication skills.
Desired Skills
* Proactive, analytical, and delivery-focused.
* Able to work independently and collaboratively under tight timelines.
Experience