Anthropic is a public benefit corporation headquartered in San Francisco, working on creating reliable, interpretable, and steerable AI systems.
As a Societal Impacts research scientist on the Models Research Pod, you'll close the loop between observing Claude's behavior and improving it at the model level. You'll use observational tools like Clio to analyze real-world usage patterns and build evaluations that assess whether Claude provides safe responses aligned with its Constitution.
Anthropic is a public benefit corporation headquartered in San Francisco, working on creating reliable, interpretable, and steerable AI systems.
Anthropic