TruthfulQA benchmark introduced in paper: TruthfulQA: Measuring How Models Mimic Human Falsehoods (Lin et al., 2021). — SourceScore VERITAS embed · SourceScore

SourceScore VERITAS · verified claim92% confidence

TruthfulQA benchmark introduced in paper: TruthfulQA: Measuring How Models Mimic Human Falsehoods (Lin et al., 2021).

TruthfulQA benchmark

introduced_in_paper

TruthfulQA: Measuring How Models Mimic Human Falsehoods (Lin et al., 2021)

Primary source · preprint · 2021-09-08

TruthfulQA: Measuring How Models Mimic Human Falsehoods — arXiv (Lin, Hilton, Evans — University of Oxford + OpenAI)

Last verified 2026-05-31 · 3 sources · 824f830889daf33eView full claim →