Join to apply for the Software Data Engineer (UK) role at Iambic Therapeutics
Get AI-powered advice on this job and more exclusive features.
Job Summary
In this role, you’ll play a pivotal part in building and optimizing data pipelines that transform large, multi-modal datasets into high-quality training inputs for cutting-edge AI models for drug discovery. You’ll help evolve our data pipeline and storage infrastructure to support faster, more reliable turnarounds for research and development of new models.
You’ll join a multidisciplinary team, collaborating closely with ML scientists, software developers and DevOps engineers to improve the performance and reliability of Python-based workflows. As a key contributor, you’ll participate in the design, testing, and maintenance of core software systems, conduct thoughtful code reviews, and champion engineering best practices—including version control, testing, and documentation.
This role is remote, with preference for candidates on the East Coast or UK.
Key Responsibilities
1. Design and improve data pipelines that process large, multi-modal datasets from a variety of internal and external sources into training datasets for AI models.
2. Evolve our data storage layer to support analytics, schema evolution, reproducibility, and efficient data access.
3. Collaborate with ML engineers to improve the performance and reliability of Python-based data processing workflows.
4. Collaborate on the creation, testing and maintenance of software systems.
5. Code review for pull requests in adjoining areas.
6. Maintenance of and mentorship in software best practices, including version control, testing and documentation.
7. Communicate work clearly in meetings and demos, tailored to the audience.
Qualifications
* Minimum of 8 years of related experience with a Bachelor’s degree; or 6 years and a Master’s degree; or a PhD with 3 years experience; or equivalent experience.
* Proven ability to design flexible, maintainable ETL systems.
* Experience with data pipeline orchestration tools such as Prefect, Airflow, Argo, Databricks, or Spark.
* Understanding of the ML model lifecycle; prior work with scientific or ML workflows is a plus.
* Hands-on experience with multi-terabyte scale data processing.
* Familiarity with AWS; Kubernetes experience is a bonus.
* Knowledge of data lake technologies such as Parquet, Iceberg, AWS Glue etc.
* Strong Python software engineering skills.
* Pragmatic mindset — able to evaluate tradeoffs and find solutions that empower ML researchers to move quickly.
* Background in bioinformatics or chemistry is a plus.
About Iambic Therapeutics
Founded in 2019 and headquartered in San Diego, California, Iambic Therapeutics is disrupting the therapeutics landscape with its unique AI-driven drug-discovery platform. The Iambic platform has been demonstrated to deliver high-quality, differentiated therapeutics to clinical stage with unprecedented speed and across multiple target classes and mechanisms of action. The team is advancing an internal pipeline of clinical assets to address urgent unmet patient needs. Learn more at iambic.ai.
Mission & Core Values
We are committed to diversity and inclusion, fostering an environment where talented individuals from varied backgrounds work together to discover therapeutics and create innovative technologies.
Pay and Benefits
We offer a competitive compensation package, pension contributions, and flexible holiday allowances. Our UK office provides a modern, collaborative environment in the centre of Bristol.
Additional Details
Seniority level: Mid-Senior level
Employment type: Full-time
Job function: Information Technology
#J-18808-Ljbffr