Pathway builds the first post-transformer frontier model that solves AI's fundamental memory problem. We're building what comes after transformers, enabling true continuous learning, infinite context reasoning, and real-time adaptation. Our breakthrough architecture outperforms Transformer and provides the enterprise with full visibility into how the model works.
Requirements
- Proactively identify, prioritize, and curate relevant public and client-driven benchmarks across our target use cases and markets.
- Evaluate candidate benchmarks for clarity, data quality, evaluation methodology, and fit with our model roadmap.
- Run benchmarks with baseline models to validate setup, uncover edge cases, and de-risk R&D runs.
- Hand off âbenchmark-readyâ packages to R&D (specs, data, evaluation scripts, expected metrics, constraints)
- Maintain a shared vocabulary and documentation around benchmarks, datasets, and evaluation formats that GTM and R&D can both use.
- Track and organize benchmark results, model leaderboards, and âwhat good looks likeâ for different customers and scenarios.
- Contribute to demos and publicâfacing proof points based on benchmark outcomes.