WikiHow

Dataset Information
Modalities
Texts
Languages
English
Introduced
2018
License
Homepage

Overview

WikiHow is a dataset of more than 230,000 article and summary pairs extracted and constructed from an online knowledge base written by different human authors. The articles span a wide range of topics and represent high diversity styles.

Source: WikiHow: A Large Scale Text Summarization Dataset
Image Source: WikiHow: A Large Scale Text Summarization Dataset

Variants: WikiHow, wikihow

Associated Benchmarks

This dataset is used in 2 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Text Summarization BertSum Abstractive Summarization of Spoken and … 2020-08-21
Abstractive Text Summarization BertSum Abstractive Summarization of Spoken and … 2020-08-21
Text Summarization MatchSum (BERT-base) Extractive Summarization as Text Matching 2020-04-19
Text Summarization Pointer-generator + coverage WikiHow: A Large Scale Text … 2018-10-18

Research Papers

Recent papers with results on this dataset: