TruthfulQA benchmark introduced in paper: TruthfulQA: Measuring How Models Mimic Human Falsehoods (Lin et al., 2021).
Subject
TruthfulQA benchmark
Predicate
introduced_in_paper
Object
TruthfulQA: Measuring How Models Mimic Human Falsehoods (Lin et al., 2021)
Primary source · preprint · 2021-09-08
TruthfulQA: Measuring How Models Mimic Human Falsehoods — arXiv (Lin, Hilton, Evans — University of Oxford + OpenAI)