We are searching for an outstanding researcher to work on Large Language Model (LLM) research team, exploring alternative avenues to unlock new capabilities in language models, and innovating new learning paradigms.
Requirements
- PhD in Computer Science or Computer Engineering (or equivalent experience)
- At least 6 years of research experience in artificial intelligence, machine learning, natural language processing, computer vision or related subjects
- Excellent knowledge of theory and practice of deep learning and natural language processing
- Background in LLM training, alignment, and evaluation is expected
- Excellent programming skills in Python and PyTorch
- Hands-on experience with large-scale model training including data preparation and model parallelization (tensor and pipeline) is required
- Excellent communications skills
Benefits
- Eligible for equity and benefits