WikiBio

Wikipedia Biography Dataset

Dataset Information
Modalities
Texts
Languages
English
Introduced
2016
License
Homepage

Overview

This dataset gathers 728,321 biographies from English Wikipedia. It aims at evaluating text generation algorithms. For each article, we provide the first paragraph and the infobox (both tokenized).

Variants: WikiBio

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Table-to-Text Generation MBD Controlling Hallucinations at Word Level … 2021-02-04
Table-to-Text Generation Field-gating Seq2seq + dual attention Table-to-text Generation by Structure-aware Seq2seq … 2017-11-27
Table-to-Text Generation Field-gating Seq2seq + dual attention + beam search Table-to-text Generation by Structure-aware Seq2seq … 2017-11-27
Table-to-Text Generation Table NLM Neural Text Generation from Structured … 2016-03-24

Research Papers

Recent papers with results on this dataset: