Job Description
About A1
A1 is a self-funded, independent AI group, focused on building a new consumer AI application with global impact. We’re assembling a small, elite team of ML, engineering and product builders who want to work on meaningful, high-impact problems.
About The Role
You will shape the core technical direction of A1 - model selection, training strategy, infrastructure, and long-term architecture. This is a founding technical role: your decisions will define our model stack, our data strategy, and our product capabilities for years ahead.
You won’t just fine-tune models - you’ll design systems: training pipelines, evaluation frameworks, inference stacks, and scalable deployment architectures. You will have full autonomy to experiment with frontier models (LLaMA, Mistral, Qwen, Claude-compatible architectures) and build new approaches where existing ones fall short.
What You’ll be Doing
1. Build end-to-end training pipelines: data → training → eval → inference
2. Design new model architectures or adapt open-source frontier models
3. Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation)
4. Architect scalable inference systems using vLLM / TensorRT-LLM / DeepSpeed
5. Build data systems for high-quality synthetic and real-world training data
<...