At Plumerai, we make it easy and affordable for developers to add highly accurate AI to embedded devices, enabling them to create amazing new products. We combine our on-device Tiny AI software with cloud-based multimodal LLMs, providing People Detection, Video Search, Familiar Face Identification, AI Captions and more.
Major enterprises deploy our advanced computer vision models on millions of smart‑home cameras, and we’re rapidly expanding into commercial security, retail, assisted living and more. Our solution runs as much as possible on‑device to deliver low‑power, accurate, private AI products, surpassing even Google Nest in accuracy.
We build the most accurate and efficient AI solutions by vertically integrating every layer of the stack—from data collection and curation, through custom training software and model architectures, to multimodal LLMs, pre‑ and post‑processing, and the fastest inference engines. Our team possesses deep theoretical knowledge and a proven ability to ship fast and often.
We are based in London and Amsterdam, have recently raised funding to provide multiple years of runway, and our recurring revenue is growing rapidly.
Role Description
We are looking for a Senior Deep Learning Research Engineer to develop state‑of‑the‑art AI products. The role involves improving training algorithms, integrating multimodal LLMs, building the data pipeline, designing new model architectures, and deploying innovative ML approaches for embedded devices.
What You Will Be Doing
* Combine Tiny AI with multimodal LLMs to enable advanced AI features and optimize deployments for cloud and edge.
* Train and design highly accurate, ultra‑small computer‑vision models with memory footprints as low as 1 MB, enabling complex AI applications on low‑cost, low‑power hardware.
* Improve the data pipeline, model architectures, and training software, applying novel approaches and clever hacks to solve engineering problems.
* Deploy training jobs on our Kubernetes cluster, manage datasets with Snowflake and Dataflow, prototype demos with Streamlit, and use GPUs on GCP for new model training and auto‑labeling.
What You Need
* 5+ years of professional software engineering experience with proficiency in Python.
* Comfortable with frameworks such as PyTorch, TensorFlow, Keras, or JAX.
* Strong experience with computer vision and multimodal LLMs.
* Experience training neural networks that have moved into production.
Nice to Have
* Industry experience with efficient inference deployments (cloud or edge).
* Experience with Deep Reinforcement Learning.
We only consider applicants who are currently based in, or willing to relocate to, London or Amsterdam. We offer flexible working hours and encourage at least two fixed days per week in our offices.
What We Offer
* Competitive salary.
* Generous equity stake in the company.
* Relocation assistance.
* Choice of laptop and equipment.
* 25 days of paid vacation time in addition to bank holidays.
* Opportunity to attend top research conferences such as NeurIPS, ICML, and CVPR.
#J-18808-Ljbffr