QuALITY

Question Answering with Long Input Texts, Yes!

Dataset Information
Modalities
Texts
Introduced
2021
Homepage

Overview

QuALITY (Question Answering with Long Input Texts, Yes!) is a multiple-choice question answering dataset for long document comprehension. The dataset consists of context passages in English that have an average length of about 5,000 tokens, much longer than typical current models can process. Unlike in prior work with passages, the questions are written and validated by contributors who have read the entire passage, rather than relying on summaries or excerpts.

Variants: QuALITY

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Question Answering RAPTOR + GPT-4 (June 2023) RAPTOR: Recursive Abstractive Processing for … 2024-01-31

Research Papers

Recent papers with results on this dataset: