We are seeking a highly skilled LLM Researcher to join our client team, focusing on innovation in Large Language Models (LLMs) and scalable AI deployment.
Key Responsibilities
Research & Development (Primary Focus)
* Design and experiment with LLM architectures and transformer-based models aimed at enhancing language understanding and generation tasks.
* Implement optimization methods, including parameter-efficient finetuning and quantization strategies.
* Develop and refine pretraining and fine-tuning pipelines for LLMs, contributing to advancements in natural language processing.
Data Engineering
* Create robust, scalable data pipelines for managing multi-modal datasets; oversee preprocessing, cleaning, and curation workflows essential for efficient LLM training and inference.
Model Development & Evaluation
* Lead the development and evaluation of LLMs, leveraging expertise in transformers, self-supervised learning, and large-scale distributed training.
* Optimize training pipelines for enhanced performance, compute efficiency, and responsible AI alignment.
LLM Deployment & Scalability
* Manage the end-to-end deployment of LLMs and foundation models in production environments, focusing on system integration, inference, and optimization.
Cross-functional Collaboration
* Engage with research scientists, ML engineers, and product teams to align technical advances with business needs.
* Mentor junior researchers and engineers, promoting best practices in LLM development, scalable model deployment, and evaluation processes.
Required Skills & Qualifications
* Experience: 3+ years in NLP, transformer models, deep learning, and scalable AI pipelines, with demonstrated involvement in LLM development, fine-tuning, or deployment.
* Technical Skills:
* Proficiency in Python and frameworks such as PyTorch, TensorFlow, or JAX.
* Strong familiarity with distributed computing and data engineering tools (e.g., SLURM, Apache Spark, Airflow).
* Hands-on experience with LLM training, fine-tuning, and deployment (e.g., Hugging Face, LLamafactory, NVIDIA NeMo).
Preferred Qualifications
* Advanced degree (MS/PhD) in Computer Science, AI, or a related field.
* Publications in top-tier conferences as the main author.
* Experience with cloud platforms (AWS, GCP, Azure) and MLOps tools.
Core Competencies
* Strong problem-solving and analytical skills focused on enhancing model performance.
* Excellent communication and collaboration abilities to work effectively within cross-functional teams.
* High attention to detail and commitment to maintaining high standards in research and development.
* An eagerness to stay informed about the latest advancements in LLM technologies and methodologies.