Data engineer

Hinxton

EMBL

Data engineer

Posted: 11 April

Offer description

Your role

We seek a skilled and motivated Data Engineer to join our dynamic team. As a Data Engineer, you will play a crucial role in optimising and enhancing our data pipelines, ensuring efficient data processing, storage and retrieval. You will work closely with cross-functional teams to analyse requirements, propose new data pipeline architectures, and implement solutions to improve performance and scalability.

The tasks for this post include the following:

1. Analyse existing data pipelines and identify areas for improvement, optimisation, and scalability.
2. Work closely with Bioinformaticians and annotators to integrate data pipelines with existing systems and applications.
3. Monitor data pipeline performance, troubleshoot issues, and implement solutions to ensure reliability and efficiency.
4. Stay current with industry trends and best practices in data engineering and recommend new technologies or tools to enhance data infrastructure.
5. Document data pipelines, processes, and workflows for internal reference and knowledge sharing.

The successful candidate will report directly to the PDBe Technical Project Lead as a Technical Officer. This post is an opportunity for the right person to bring IT skills and innovative ideas to help sustain the growing amount of structural biology data in the PDB and ensure that PDBe, PDBe-KB and AFDB services remain sustainable.

Apply now

Closing date: 12 May 2024

6. Contract duration: 1 year 8 months (estimated 01/07/2024-31/01/2026).

7. Grading: Grade 5 or 6 depending on qualification and experience (monthly salary starting at £3,090 or £3,456 after tax) + benefits

8. Reference number: EBI02234

Related

9. View all EMBL jobs
10. Sign up for job alerts
11. View openings from partners

You have

12. MSc in computer science, IT or a related field, or in bioinformatics with a demonstrated IT expertise
13. Expert in Data Modelling and Advanced SQL
14. Proficiency in Python programming
15. Proficiency in ETL (Extract, Transform, Load) processes and tools for large-scale data processing.
16. Strong understanding of relational databases (Oracle, PostgreSQL) and experience optimising database performance.
17. Proficiency in data warehousing (Redshift, BigQuery)
18. Strong communication and collaboration skills, with the ability to work effectively in a team environment.
19. Proficiency in oral and written English

You might also have

20. PhD in computer science, IT or a related field, or in bioinformatics with a demonstrated IT expertise
21. Experience in big data technologies and frameworks, such as Apache Spark, Hadoop or similar platforms
22. Hands-on experience with CI/CD (GitLab CI/GitHub Actions)
23. Familiarity with Java
24. Familiarity with Google Cloud Platform or AWS
25. Familiarity with data modelling techniques for AI (Artificial Intelligence) and ML (Machine Learning) applications
26. Familiarity with Neo4J or other graph databases is an added advantage
27. Familiarity with data visualisation (Tableau, PowerBI)
28. Knowledge of, or affinity with, structural biology and bioinformatics
29. Experience working in international teams

See the details

Create E-mail Alert

Save

Similar job

Senior data engineer

Cambridge

Permanent

Nigel Frank International

Data engineer

Similar job

Data engineer - sql, python

Milton (Cambridgeshire)

Permanent

Pytec It Recruitment

Data engineer

Similar job

Digital library data engineer

Cambridge

Code4lib

Data engineer

£60,000 - £80,000 a year