Llm researcher

Slough

Astek

Posted: 17 October

Offer description

We are seeking a highly skilled LLM Researcher to join our client team, focusing on innovation in Large Language Models (LLMs) and scalable AI deployment.

Key Responsibilities

Research & Development (Primary Focus)

* Design and experiment with LLM architectures and transformer-based models aimed at enhancing language understanding and generation tasks.
* Implement optimization methods, including parameter-efficient finetuning and quantization strategies.
* Develop and refine pretraining and fine-tuning pipelines for LLMs, contributing to advancements in natural language processing.

Data Engineering

* Create robust, scalable data pipelines for managing multi-modal datasets; oversee preprocessing, cleaning, and curation workflows essential for efficient LLM training and inference.

Model Development & Evaluation

* Lead the development and evaluation of LLMs, leveraging expertise in transformers, self-supervised learning, and large-scale distributed training.
* Optimize training pipelines for enhanced performance, compute efficiency, and responsible AI alignment.

LLM Deployment & Scalability

* Manage the end-to-end deployment of LLMs and foundation models in production environments, focusing on system integration, inference, and optimization.

Cross-functional Collaboration

* Engage with research scientists, ML engineers, and product teams to align technical advances with business needs.
* Mentor junior researchers and engineers, promoting best practices in LLM development, scalable model deployment, and evaluation processes.

Required Skills & Qualifications

* Experience: 3+ years in NLP, transformer models, deep learning, and scalable AI pipelines, with demonstrated involvement in LLM development, fine-tuning, or deployment.
* Technical Skills:
* Proficiency in Python and frameworks such as PyTorch, TensorFlow, or JAX.
* Strong familiarity with distributed computing and data engineering tools (e.g., SLURM, Apache Spark, Airflow).
* Hands-on experience with LLM training, fine-tuning, and deployment (e.g., Hugging Face, LLamafactory, NVIDIA NeMo).

Preferred Qualifications

* Advanced degree (MS/PhD) in Computer Science, AI, or a related field.
* Publications in top-tier conferences as the main author.
* Experience with cloud platforms (AWS, GCP, Azure) and MLOps tools.

Core Competencies

* Strong problem-solving and analytical skills focused on enhancing model performance.
* Excellent communication and collaboration abilities to work effectively within cross-functional teams.
* High attention to detail and commitment to maintaining high standards in research and development.
* An eagerness to stay informed about the latest advancements in LLM technologies and methodologies.

Apply

Create E-mail Alert

Save

See more jobs