🚀 Data Engineer | AI Startup | Real-Time Data & ML Pipelines
I’m currently partnered with a next-generation AI startup that’s building real-time intelligence systems capable of understanding and adapting to the world as it happens. They’re at the forefront of streaming data, continuous learning, and large-scale inference - and they’re now looking for a Data Engineer to help power that mission.
This isn’t a maintenance role. It’s a build-it-from-the-ground-up opportunity in a company that’s solving genuinely hard, technical problems around real-time data pipelines and ML infrastructure.
⚙️ The Opportunity
The team is looking for a Data Engineer who can design and scale real-time data architectures and machine learning pipelines that drive continuous model training and experimentation.
You’ll be working in close collaboration with ML engineers, product leads, and platform teams to make sure the company’s AI models are learning, and improving, in production.
🧠What You’ll Be Doing
* Architect and maintain real-time streaming data systems (Kafka, Kinesis, or Flink)
* Build robust feature pipelines using Airflow, Prefect, or Dagster
* Manage and optimise data storage solutions (Snowflake, BigQuery, Redshift, or Delta Lake)
* Automate and scale model training pipelines in close partnership with ML engineers
* Deploy, observe, and improve pipelines using Docker, Kubernetes, Terraform, or dbt
* Champion data reliability, scalability, and performance across the platform
đź§© The Tech Environment
You’ll likely be working with some combination of:
* Languages: Python, Scala, Go
* Streaming: Kafka / Flink / Spark Structured Streaming
* Workflow orchestration: Airflow / Prefect / Dagster
* Data storage & processing: Snowflake / Detabricks / BigQuery / Redshift
* Infrastructure: Docker / Kubernetes / Terraform / dbt
* Monitoring: Prometheus / Grafana / OpenTelemetry
* Cloud: AWS / GCP / Azure
🌍 What They’re Looking For
* Proven experience building scalable data pipelines in real-time or near real-time environments
* Strong background in data architecture, performance tuning, and distributed systems
* Comfort working end-to-end - from data ingestion to model-ready outputs
* An interest in AI, ML ops, and data-driven product development
* Someone who thrives in fast-moving, high-ownership start-up environments
đź’ˇ Why This Role?
* Join a well-funded AI startup with massive growth potential
* Be the first Data Engineer, shaping the entire data backbone
* Work with exceptionally talented engineers and researchers solving real-world AI problems
* Competitive package, plus meaningful equity
* A culture that values autonomy, creativity, and continuous learning
⚡ Interested?
If you’ve built or scaled data pipelines in production and want to be part of something genuinely cutting-edge, I’d love to chat.
Drop me a message here on LinkedIn or send over your details - happy to share more about the team, the vision, and what they’re building.