GovReport is a dataset for long document summarization, with significantly longer documents and summaries. It consists of reports written by government research agencies including Congressional Research Service and U.S. Government Accountability Office.
Compared with other long document summarization datasets, government report dataset has longer summaries and documents and requires reading in more context to cover salient words to be summarized.
Variants: GovReport
This dataset is used in 2 benchmarks:
Task | Model | Paper | Date |
---|---|---|---|
Text Summarization | BART-LS | Adapting Pretrained Text-to-Text Models for … | 2022-09-21 |
Text Summarization | FactorSum | Factorizing Content and Budget Decisions … | 2022-05-25 |
Extractive Text Summarization | MemSum (extractive) | MemSum: Extractive Summarization of Long … | 2021-07-19 |
Extractive Text Summarization | HEPOS | Efficient Attention: Attention with Linear … | 2018-12-04 |
Recent papers with results on this dataset: