ReadingBank

Dataset Information
Introduced
2021
License
Unknown
Homepage

Overview

ReadingBank is a benchmark dataset for reading order detection built with weak supervision from WORD documents, which contains 500K document images with a wide range of document types as well as the corresponding reading order information.

Variants: ReadingBank

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Reading Order Detection TPP (LayoutMask) Reading Order Matters: Information Extraction … 2023-10-17
Reading Order Detection LayoutReader LayoutReader: Pre-training of Text and … 2021-08-26

Research Papers

Recent papers with results on this dataset: