WikiHop

Dataset Information
Modalities
Texts
Languages
English
Introduced
2017
License
Homepage

Overview

WikiHop is a multi-hop question-answering dataset. The query of WikiHop is constructed with entities and relations from WikiData, while supporting documents are from WikiReading. A bipartite graph connecting entities and documents is first built and the answer for each query is located by traversal on this graph. Candidates that are type-consistent with the answer and share the same relation in query with the answer are included, resulting in a set of candidates. Thus, WikiHop is a multi-choice style reading comprehension data set. There are totally about 43K samples in training set, 5K samples in development set and 2.5K samples in test set. The test set is not provided. The task is to predict the correct answer given a query and multiple supporting documents.

The dataset includes a masked variant, where all candidates and their mentions in the supporting documents are replaced by random but consistent placeholder tokens.

Source: Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs
Image Source: http://qangaroo.cs.ucl.ac.uk/

Variants: WikiHop

Associated Benchmarks

This dataset is used in 2 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Question Answering LUKE-Graph LUKE-Graph: A Transformer-based Approach with … 2023-03-12
Question Answering BigBird-etc Big Bird: Transformers for Longer … 2020-07-28
Question Answering Longformer-large Longformer: The Long-Document Transformer 2020-04-10
Question Answering MultiHop (Chen et al., [2019a]) Multi-hop Question Answering via Reasoning … 2019-10-07
Paraphrase Identification StructBERTRoBERTa ensemble StructBERT: Incorporating Language Structures into … 2019-08-13
Question Answering CFC Coarse-grain Fine-grain Coattention Network for … 2019-01-03
Question Answering MHPGM + NOIC Commonsense for Generative Multi-Hop Question … 2018-09-17
Question Answering MHQA Exploring Graph-structured Passage Representation for … 2018-09-06
Question Answering Coref-GRU Neural Models for Reasoning over … 2018-04-16
Question Answering BiDAF Constructing Datasets for Multi-hop Reading … 2017-10-17

Research Papers

Recent papers with results on this dataset: