Anthropic's RL Data team builds the systems that produce high-quality reinforcement learning data for Claude: data collection pipelines, human feedback tooling, the execution environments RL tasks run in, and the quality assurance that keeps training data trustworthy at scale.
Requirements
- Strong software engineering skills and proficiency in at least one modern programming language — we mostly use Python and TypeScript, and care more that you pick new tools up quickly than that you know our exact stack
- Experience designing, building, and running backend systems or infrastructure
- Effective use of AI tools in your own day-to-day work
- Willingness to own problems end-to-end, including the parts that aren't engineering
- Proactive, open communication: you can be trusted to run a workstream, and to escalate early when something's off
- Comfort iterating quickly in ambiguous, fast-changing situations
- Care about the societal impacts of your work
Benefits
- Competitive compensation
- Optional equity donation matching
- Generous vacation and parental leave
- Flexible working hours
- Lovely office space