Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. We believe that a diverse range of perspectives is a requirement for building great products. As a Data Engineer specializing in pretraining data, you will play a pivotal role in developing the data pipeline that underpins Cohere’s advanced language models.
Requirements
- Strong software engineering skills, with proficiency in Python and experience building data pipelines.
- Familiarity with data processing frameworks such as Apache Spark, Apache Beam, Pandas, or similar tools.
- A passion for bridging research and engineering to solve complex data-related challenges in AI model training.
Benefits
- An open and inclusive culture and work environment
- Weekly lunch stipend, in-office lunches & snacks
- Full health and dental benefits
- 100% Parental Leave top-up for up to 6 months
- Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
- Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
- 6 weeks of vacation (30 working days!)
- Co-working stipend