Anthropic is an AI research company focused on developing reliable, interpretable, and steerable AI systems. Their flagship product, Claude, is an AI assistant designed for tasks of any scale. The company's research encompasses natural language processing, human feedback mechanisms, scaling laws, reinforcement learning, code generation, and interpretability, setting them apart with a commitment to transparency and control in AI development.
Anthropic is seeking a Research Engineer to contribute to their Horizons team. The role focuses on advancing reinforcement learning, specifically agentic models and improving reasoning abilities. This involves building infrastructure, designing training environments, and driving performance improvements across the stack. The team's mission is to create reliable and beneficial AI systems.
Anthropic is an AI research company focused on developing reliable, interpretable, and steerable AI systems. Their flagship product, Claude, is an AI assistant designed for tasks of any scale. The company's research encompasses natural language processing, human feedback mechanisms, scaling laws, reinforcement learning, code generation, and interpretability, setting them apart with a commitment to transparency and control in AI development.
Anthropic