PopQA is an open-domain QA dataset with 14k QA pairs with fine-grained Wikidata entity ID, Wikipedia page views, and relationship type information.
Source: https://paperswithcode.com/paper/when-not-to-trust-language-models
Image Source: https://arxiv.org/pdf/2212.10511v1.pdf
Variants: PopQA
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Question Answering | SelfRAG-7b | Self-RAG: Learning to Retrieve, Generate, … | 2023-10-17 |
Question Answering | SelfRAG-13b | Self-RAG: Learning to Retrieve, Generate, … | 2023-10-17 |
Recent papers with results on this dataset: