We are seeking a Research Scientist/Engineer to develop techniques to minimize hallucinations and enhance truthfulness in language models. The successful candidate will design and implement novel data curation pipelines, develop specialized classifiers, and create comprehensive honesty benchmarks and evaluation frameworks.

Requirements

Design and implement novel data curation pipelines to identify, verify, and filter training data for accuracy
Develop specialized classifiers to detect potential hallucinations or miscalibrated claims made by the model
Create and maintain comprehensive honesty benchmarks and evaluation frameworks
Implement search and retrieval-augmented generation (RAG) systems to ground model outputs in verified information
Design and deploy human feedback collection specifically for identifying and correcting miscalibrated responses
Design and implement prompting pipelines to generate data that improves model accuracy and honesty
Develop and test novel RL environments that reward truthful outputs and penalize fabricated claims
Create tools to help human evaluators efficiently assess model outputs for accuracy

Benefits

Competitive compensation and benefits
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours
Lovely office space

Requirements

Design and implement novel data curation pipelines to identify, verify, and filter training data for accuracy
Develop specialized classifiers to detect potential hallucinations or miscalibrated claims made by the model
Create and maintain comprehensive honesty benchmarks and evaluation frameworks
Implement search and retrieval-augmented generation (RAG) systems to ground model outputs in verified information
Design and deploy human feedback collection specifically for identifying and correcting miscalibrated responses
Design and implement prompting pipelines to generate data that improves model accuracy and honesty
Develop and test novel RL environments that reward truthful outputs and penalize fabricated claims
Create tools to help human evaluators efficiently assess model outputs for accuracy

Benefits

Competitive compensation and benefits
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours
Lovely office space

Research Scientist/Engineer, Honesty

About the Company

Job Description

Requirements

Benefits

Similar Jobs

Research Scientist/Engineer, Honesty

Research Engineer / Scientist, Alignment Science

Research Engineer / Scientist, Alignment Science, London

Research Scientist/Engineer, Honesty

About the Company

Job Description

Requirements

Benefits

Similar Jobs

Research Scientist/Engineer, Honesty

Research Engineer / Scientist, Alignment Science

Research Engineer / Scientist, Alignment Science, London

Job Details

About Anthropic