Job Description
Senior AI Data Engineer at Turnitin focuses on building AI and data pipelines, integrating AI across products, and supporting AI research.
Responsibilities
* Design, build, and operate scalable real-time data pipelines for Applied AI model training.
* Deploy and maintain robust data infrastructure using AI techniques and engineering best practices.
* Execute initiatives for collecting, normalizing, and storing data across multiple sources, including external LLM providers.
* Collaborate with AI R&D, Applied AI, and Data Platform teams ensuring seamless data flow and quality.
* Support AI Research & Development efforts by applying advanced data warehousing and engineering technologies.
* Maintain clear communication across teams, ensuring alignment with company vision and sharing data infrastructure insights.
* Stay current with emerging tools and methodologies in AI data engineering.
Required Qualifications
* At least 4 years of experience in data engineering, ideally focused on AI/ML data infrastructure.
* Strong proficiency in Python, SQL, and Infrastructure as Code (Terraform, CloudFormation). Experience with modern orchestration frameworks (Airflow, Prefect, or dbt).
* Proficiency with cloud-native data platforms (AWS, Azure, GCP) and vector databases (Pinecone, Weaviate, Qdrant, or Chroma).
* Experience with MLOps tools and platforms (HuggingFace, SageMaker Bedrock, Vertex AI), experiment tracking (MLflow, Weights & Biases), and model deployment pipelines.
* Experience with Large Language Models, embedding generation, retrieval‑augmented generation, and LLM orchestration frameworks (LiteLLM, LangFuse, LangChain, LlamaIndex).
* Strong problem‑solving, analytical, and communication skills with the ability to design scalable AI data systems and collaborate effectively cross‑functionally.
Desired Qualifications
* 6+ years of data engineering experience with AI and machine learning projects.
* Experience in technical leadership or mentorship.
* Experience in education, EdTech, or academic integrity sectors.
* Experience using AI coding tools (Cursor, Claude Code, GitHub Copilot).
* Familiarity with natural language processing, computer vision, or multimodal AI applications.
* Experience with data visualization (Streamlit) and data reporting.
Characteristics for Success
* A passion for creatively solving complex data problems.
* The ability to work collaboratively and cross‑functionally.
* A continuous learning mindset, always striving to improve skills and knowledge.
* Proven track record of delivering results and ensuring a high level of quality.
* Strong written and verbal communication skills.
* Curiosity about problems, the field, and best solutions.
Benefits
Turnitin offers a competitive Total Rewards package that includes generous time off, health and wellness programs, remote‑centric culture, and comprehensive well‑being support.
Equal Opportunity Employer
Turnitin, LLC is committed to ensuring all persons have equal access to its programs, facilities and employment. All qualified applicants will receive consideration without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status.
J-18808-Ljbffr