Mistral AI is a dynamic, collaborative team passionate about AI and its potential to transform society. We are seeking a Research Engineer - ML track to build and optimize large-scale learning systems that power our open-weight models.
Requirements
- Masterās or PhD in Computer Science (or equivalent proven track record)
- 4 + years working on large-scale ML codebases
- Hands-on with PyTorch, JAX or TensorFlow; comfortable with distributed training (DeepSpeed / FSDP / SLURM / K8s)
- Experience in deep learning, NLP or LLMs; bonus for CUDA or data-pipeline chops
- Strong software-design instincts: testing, code review, CI/CD
Benefits
- Competitive salary
- Healthcare: Medical/Dental/Vision covered for you and your family
- Pension: 401K (6% matching)
- PTO: 18 days
- Transportation: Reimburse office parking charges, or $120/month for public transport
- Sport: $120/month reimbursement for gym membership
- Meal stipend: $400 monthly allowance for meals (solution might evolve as we grow bigger)
- Visa sponsorship
- Coaching: we offer BetterUp coaching on a voluntary basis