The role involves post-training of LLMs, model alignment, server operation for checkpoint routing, and building evaluation pipelines.
Job Responsibilities:
1. Advanced post-training of large language models (e.g. SFT, RLHF/RLAIF, continual pretraining).
2. Aligning models for reliable JSON-schema function calls and external tool usage.
3. Design, deploy, and operate Model Context Protocol (MCP) servers that handle checkpoint routing, manage context windows, and enforce safety gates.
4. Experience in distributed training and inference with DeepSpeed/FSDP, LoRA/QLoRA, mixed precision, and performance tuning on vLLM or Triton clusters.
5. Build offline and live eval pipelines for alignment, factuality, grounding, and hallucinations.
Qualifications
1. Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field.
2. 3+ years of experience in developing and optimizing large language models.
3. Proven track record in implementing advanced post-training techniques (SFT, RLHF, RLAIF, continual pretraining).
4. Hands-on experience with distributed training frameworks (DeepSpeed, FSDP) and optimization techniques (LoRA, QLoRA, mixed precision).
5. Familiarity with model alignment, JSON-schema function calls, and external tool integration.
6. Experience in building and maintaining evaluation pipelines for model performance assessment.
7. Proficiency in Python and relevant machine learning frameworks (e.g., PyTorch, TensorFlow).
8. Strong understanding of distributed systems and high-performance computing.
9. Experience with model deployment and inference optimization on vLLM or Triton clusters.
10. Knowledge of JSON-schema and API development.
Top Skills
Deepspeed
Fsdp
Lora
Python
PyTorch
Qlora
TensorFlow
Triton
Vllm
Similar Jobs
Artificial Intelligence • Productivity • Software • Automation
Join Zapier's Events Team as a Backend Engineer to enhance foundational event and queuing services, ensuring reliability and scalability. Collaborate with a tight-knit team, advocate for user experience, and manage tasks effectively while exploring new technologies.
Top Skills:
AuroraAvroAWSGoKafkaLambdaPythonRedisS3SqsTerraformTypescript
Cloud • Security • Software • Cybersecurity • Automation
The Senior Product Manager will own the strategy, roadmap, and adoption of GitLab Dedicated, collaborating with multiple teams to drive enterprise adoption and ensure service quality in regulated environments.
Top Skills:
AIDevsecopsSaaS
Cloud • Security • Software • Cybersecurity • Automation
The Security Manager for PSIRT will lead vulnerability response efforts, improve security processes, and mentor teams while ensuring effective communication across departments.
Top Skills:
AIDevsecops
What you need to know about the Belfast Tech Scene
If asked to name the birthplace of the RMS Titanic, you might not say Belfast. Similarly, if asked to name Europe's leading destination for foreign direct investment in new software development, Belfast might not come to mind. Yet, both are true. The city has emerged as a tech powerhouse, recently ranked among the best in the U.K. for tech careers — especially for software developers. It also leads the U.K. with the highest percentage of software development jobs advertised.


