Your role
We are seeking a highly skilled and motivated Research Software Engineer with expertise in Python and databases to join the AI knowledge management project for 3 years. We are open to applicants at various career stages, with particular interest in individuals who are eager to utilise cutting-edge technologies to address complex challenges in software development and informatics in the context of drug discovery. This position would be embedded within the Open Targets project team in the Saez-Rodriguez Group at the European Bioinformatics Institute and benefit from joint supervision with Sebastian Lobentanzer in the Saez-Rodriguez Group at Heidelberg University Hospital (UKHD).
You will work collaboratively across the project group with other experts in ML/AI, NLP, data integration and product delivery across ChEMBL, ePMC, Open Targets and Heidelberg University Hospital on a common goal to integrate cutting-edge technology for knowledge extraction, representation and interpretation to help drug discovery scientist. As a crucial member of the project team team, you will design, build, and operate cloud-first software that interfaces with large-scale biomedical data and drug discovery. You will contribute to developing informatics tools designed to support identifying and prioritising drug targets. Leveraging cutting-edge technologies and the expertise of our product owners and industry stakeholders, you will work in a dynamic, multidisciplinary, international environment to tackle a wide range of algorithmic and technical challenges.
As a Research Software Engineer you will be instrumental in extending our Open Targets Platform framework to include a modular knowledge graph platform. Your expertise will enhance the robustness and efficiency of our data processing and knowledge representation systems, contributing directly to our open science initiatives.
As part of a dynamic, collaborative, and international team, you will be responsible for:
1. Developing and implementing a knowledge graph framework on top of the existing data lake to improve our data sharing and analysis pipelines to assist drug discovery user stories.
2. Working closely with data provision and analysis engineers up- and downstream of the framework.
3. Working in an open-source environment, contributing to codebases and collaborating on agile development.
4. Writing clean, efficient, and readable Python code to support our internal pipelines and integrate Large Language Models.
Actively disseminating the outcomes of the project to the scientific community and stakeholders through well-crafted presentations and publications, and .
Closing date: 20 May 2024
5. Contract duration: 3 years
6. Grading: 5 or 6 depending on qualification and experience (monthly salary starting at £3,090 or £3,456 after tax) + benefits
7. Reference number: EBI02243
Related
You have
8. Advanced degree (MSc, PhD) in computer science, bioinformatics, software development, or a related field.
9. Strong skills in Python and familiarity with relevant frameworks and tools.
10. Experience with databases and their Python integrations.
11. Proficient in open-source development and version control (e.g., Git).
12. Passionate about collaborative, agile development in a fast-paced environment.
13. Experience in independent problem-solving and examples of resolving complex issues.
14. Fluency in written and spoken English.
15. Ability to effectively communicate ideas or issues and work with team members from multidisciplinary backgrounds.
You might also have