About Instantly.ai
Instantly.ai is a leading AI-driven sales outreach and lead intelligence platform, powering over 35K B2B companies.
The Role
We’re hiring a Data Engineer to lead the backend infrastructure of SuperSearch, our B2B lead intelligence platform. You will own and maintain large-scale data pipelines using AWS Glue (PySpark), S3, Elasticsearch, and MongoDB. This role is central to improving how users search, filter, and find leads, applying everything from algorithm tuning to semantic enhancements with LLMs and embeddings. You’ll have full ownership of a critical system in a fast-moving and high-growth startup environment.
Responsibilities
* Own and maintain our data processing pipelines using AWS Glue (PySpark) and S3
* Work with large-scale datasets stored in Elasticsearch and MongoDB
* Build robust data transformation, cleaning, and normalization workflows
* Improve the performance and relevance of our search system through algorithmic tuning and semantic enhancements (e.g. LLMs, DeepL, embeddings)
Must-Have Skills
* Solid experience with data engineering on AWS (Glue, S3)
* Strong knowledge of Elasticsearch (query design, aggregations, performance tuning...)
* Proficiency in Python, especially in data wrangling (PySpark, Pandas)
* Experience with data quality, schema evolution, and operational monitoring
* Familiarity with LLMs, embeddings, or search ranking improvements is a plus
* Backend development skills with Node.js or general JavaScript knowledge would be advantageous
Why Instantly.ai?
* High-growth environment: join us on the path to unicorn status.
* Impact & autonomy: you’ll own a marquee area critical to our success. No slow processes of big enterprise companies. We move fast like a young lean startup.
* Collaborative culture: work with top developers and seasoned operators.
Apply now and we'll be in touch shortly