GovReport

Dataset Information
Modalities
Texts
Languages
English
Introduced
2021
License
Unknown
Homepage

Overview

GovReport is a dataset for long document summarization, with significantly longer documents and summaries. It consists of reports written by government research agencies including Congressional Research Service and U.S. Government Accountability Office.

Compared with other long document summarization datasets, government report dataset has longer summaries and documents and requires reading in more context to cover salient words to be summarized.

Variants: GovReport

Associated Benchmarks

This dataset is used in 2 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Text Summarization BART-LS Adapting Pretrained Text-to-Text Models for … 2022-09-21
Text Summarization FactorSum Factorizing Content and Budget Decisions … 2022-05-25
Extractive Text Summarization MemSum (extractive) MemSum: Extractive Summarization of Long … 2021-07-19
Extractive Text Summarization HEPOS Efficient Attention: Attention with Linear … 2018-12-04

Research Papers

Recent papers with results on this dataset: