SourceScore
SourceScore VERITAS · verified claim92% confidence

TruthfulQA benchmark introduced in paper: TruthfulQA: Measuring How Models Mimic Human Falsehoods (Lin et al., 2021).

Subject
TruthfulQA benchmark
Predicate
introduced_in_paper
Object
TruthfulQA: Measuring How Models Mimic Human Falsehoods (Lin et al., 2021)
Primary source · preprint · 2021-09-08
TruthfulQA: Measuring How Models Mimic Human Falsehoods arXiv (Lin, Hilton, Evans — University of Oxford + OpenAI)
Last verified 2026-05-31 · 3 sources · 824f830889daf33eView full claim →