Job summary
Are you passionate about leveraging AI to solve real-world business challenges? Looking for a talented Data Scientist to join a growing data science function focused on driving innovation through NLP, LLM, and generative AI.
Key skills required for this role
Data Scientist
Important
Data Scientist
Job description
Job Title: Data Scientist - NLP, LLMs & Generative AI
Are you passionate about leveraging AI to solve real-world business challenges? We're looking for a talented Data Scientist to join a growing and forward-thinking data science function focused on driving innovation through NLP, large language models (LLMs), and generative AI.
This is an exciting opportunity to be part of a newly established team, working closely with the Head of Data Science to shape the roadmap, scale AI-driven solutions, and unlock new opportunities across the business. If you're hands-on with Python, experienced in building NLP or LLM-based applications, and thrive in a collaborative, problem-solving environment-this role is for you.
What You'll Do:
1.
Design and deploy AI-powered tools that streamline internal processes and drive growth
2.
Apply advanced NLP, LLMs, and automation techniques to enhance workflows and insights
3.
Identify AI use cases with cross-functional teams and deliver practical, impactful solutions
4.
Experiment with emerging open-source LLMs and generative AI technologies for adoption
5.
Develop RAG (Retrieval-Augmented Generation) pipelines for extracting insights from unstructured data (PDFs, images, emails, etc.)
6.
Support proof-of-concept development and help scale successful solutions across the business
What We're Looking For:
Technical Expertise:
7.
5+ years of hands-on experience with Python for machine learning and data science
8.
Strong background in ML, NLP, and LLMs, including training/fine-tuning models (e.g., Transformers, encoder-decoder architectures)
9.
Experience with open-source LLMs (e.g., LLaMA, Mistral, DeepSeek, Qwen, etc.)
10.
Familiarity with Hugging Face Transformers and related NLP libraries
11.
Experience building RAG pipelines with embedding models and vector search
Application Development:
12.
Comfortable developing APIs using FastAPI or Flask
13.
Experience with tools like Streamlit or Gradio for building interactive AI applications
14.
Demonstrated ability to integrate LLMs into real-world applications
Infrastructure & Deployment:
15.
Experience testing models on local GPU infrastructure
16.
Knowledge of Python environment management and containerization (Docker is a plus)
Data & Version Control:
17.
Proficient with Git for collaborative development
18.
Ability to access and process structured and unstructured data (e.g., SQL, PDFs, images, emails, JSON)
19. Share
manages this role
Matchtech is a STEM Recruitment Specialist, with over 40 years’ experience