We are looking to add a Large Language Model (LLM) Algorithm Engineer in Changsha, China within our EM Labs team. The role involves production deployment & optimization of LLM environments, LLM application development, and design of distributed deployment solutions based on NVIDIA hardware architecture.
Requirements
- Production Deployment & Optimization of LLM Environments
- LLM Application Development
- Design distributed deployment solutions based on NVIDIA hardware architecture (NVLink/NVSwitch)
- Build a multi-modal GPU cluster management system
- Model optimization and engineering deployment
- Design Prompt Engineering strategies combined with RAG (Retrieval-Augmented Generation) technology
- Lead model fine-tuning using efficient parameter tuning techniques like LoRA/QLoRA
- Develop enterprise-grade internal toolchains
- Design external customer systems
- Build multi-Agent collaborative online assistants leveraging multi-Agent task allocation mechanisms
Benefits
- 20 days of annual leave
- 13 public holidays
- 10 sick leave days per year
- 22 weeks of paid maternity leave
- 4 weeks of paternity leave
- Monthly lunch allowance
- English courses
- Onsite gym
- Access online learning platforms
- Budget for external training