MMLU-Pro benchmark introduced in paper: MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark (Wang et al., 2024).
Subject
MMLU-Pro benchmark
Predicate
introduced_in_paper
Object
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark (Wang et al., 2024)
Primary source · preprint · 2024-06-03
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark — arXiv (Yubo Wang et al. — TIGER-Lab)