Role purposeThis role is responsible for developing industrialised optimisation and machine learning models as part of a full-stack product squad that delivers operations decision-support software.
Contract – 12 months (high potential to extend further)Location – HeathrowHybrid – 2 to 3 days onsitePay – Flexible daily rate (inside IR35)Scope
1. As a key member of a product squad and reporting to the Lead Product Data Scientist, a Data Scientist will develop data pipelines, machine learning models, andplex optimization models in the ODS software product suite.
2. The Data Scientist is in charge of modelling and robust implementation of features contributing to an operations decision-support product.
3. In developing a product’s core algorithm, the full-stack Data Scientist role will ensure that their features integrate seamlessly into the product’s technical stack (data ingestion, user interface, orchestration) as well as the business process and use case (, to maximise impact and value.
Accountabilities
4. The Data Scientist has full-stack accountabilities across the full value chain of building an industrialised data-science software product:
5. Understanding a business problem and itsponent processes end to end, and identifying opportunities to make decisions more optimally leveraging decision-support tooling
6. Efficiently conducting analyses and visualisations to identify valuable opportunities for decision-support and to determine trade-offs between different potential feature implementations
7. Prototyping advanced machine learning and optimization models to prove the value of a use case and approach (in Python)
8. Delivering features to industrialise machine learning and optimization models in Python using best-practice software principles (, strict typing, classes, testing)
9. Build automated, robust data cleaning pipelines that follow software best-practices (in Python)
10. Implementing integrations between the core algorithm (machine-learning or optimization) and a workflow orchestration paradigm such as Dagster
11. Implementing software in a cloud-based deployment pipeline with Continuous Integration / Continuous Deployment (CI/CD) principles
12. Building logging, error handling, and automated tests (, unit tests, regression tests) to ensure the robustness of operationally critical decision-support products
13. Deliver features to harden an algorithm against edge cases in the operation and in data
14. Conduct analysis to quantify the adoption and value-capture from a decision-support product
15. Engage with business stakeholders to collect requirements and get feedback
16. Contribute to conversations on feature prioritisation and roadmap, with an understanding of the trade-off between speed vs. long-term value
17. Understand and integrate the product into existing business processes, and contribute to the development and adoption of new business processes leveraging a decision-support product.
18. The Data Scientist is also accountable for ways of working fit for an Agile cross-functional development squad, including:
19. Using Git-versioning best practices for version control
20. Contributing and reviewing pull-requests and product / technical documentation
21. Giving input on prioritisation, team process improvements, optimising technology choices
22. Working independently and giving predictability on delivery timelines
Skills/capabilities
23. Strong knowledge of either machine learning and optimization techniques, incl. supervised (regression, tree methods, etc.), unsupervised (clustering) learning, and operations research (linear, mixed integer programming, heuristics)
24. Fluent in Python (required) and other programming languages (preferred)with strong skills in applying DS, ML, and OR packages (scikit-learn, pandas, numpy, gurobietc.) to solve real-life problems and visualise the oues ( seaborn)
25. Proficient in working with cloud platforms (AWS preferred), code versioning (Git), experiment tracking ( MLflow)
26. Experience with cloud-based ML tools ( SageMaker), data and model versioning ( DVC), CI/CD ( GitHub Actions), workflow orchestration ( Airflow/Dagster) and containerised solutions ( Docker, ECS) nice to have
27. Experience in code testing (unit, integration, end-to-end tests)
28. Strong data engineering skills in SQL and Python
29. Proficient in use of Microsoft Office, including advanced Excel and PowerPoint Skills
30. Advanced analytical skills, including the ability to apply a range of data science and analytic techniques to quickly generate accurate business insights
31. Understanding of the trade-offs of different data science, machine learning, and optimization approaches, and ability to intelligently select which are the best candidates to solve a particular business problem
32. Able to structure business and technical problems, identify trade-offs, and propose solutions
33. Managing priorities and timelines to deliver features in a timely manner that meets business requirements
34. Collaborative team-working, giving and receiving feedback
Qualifications/experience
35. Master’s degree or greater in data science, ML, or operational research, or 2+ years of highly relevant industry experience (required)
36. Extensive working on production ML or optimization software products at scale (required)
37. Experience in developing industrialised software, especially data science or machine learning software products (preferred)
38. Experience in relevant business domains (transportation, airlines, operations, network problems) (preferred)