Pathway is building a post-transformer frontier model that solves AI's fundamental memory problem. As an AI Benchmark & Datasets Engineer/Researcher, you will design and execute rigorous benchmarks and define dataset standards. The role involves collaborating closely with the R&D team to build the evaluation infrastructure that guides the evolution of Pathway’s post-transformer models.
Requirements
- Published at least one paper at NeurIPS, ICLR, or ICML as lead author or made significant conceptual & code contributions
- Significantly contributed to an LLM training effort that became newsworthy
- Spent at least 6 months working in a leading Machine Learning research center
- ICPC World Finalist or an IOI, IMO, or IPhO medalist in High School
- Experience with ML/LLM evaluation, data science, or technical product roles
- Ability to read papers, leaderboards, and Github repos and turn them into clear, repeatable benchmark specs
- Comfortable talking with engineers and customers and translating between technical detail and business value
- Care about high-quality data, reproducible experiments, and crisp documentation
- Respectful of others
- Fluent in English
Benefits
- Generous Paid Time Off
- 401k Matching
- Retirement Plan
- Relocation Assistance
- Tuition Reimbursement