ROOR

Dataset Information
Modalities
Images, Texts
Languages
English
Introduced
2024
License
Homepage

Overview

ROOR is a reading order prediction (ROP) benchmark which annotates layout reading order as ordering relations.

Layout reading order is typically formulated as a permutation of layout elements, i.e. a sequence containing all the layout elements.
However, multiple cases have reflected that this formulation does not adequately convey the complete reading order information in the layout, which may negatively affect the utilization of this signal in downstream VrD tasks.

Therefore, this work investigate the properties of layout reading order, conceptualizing it with terms Immediate Succession During Reading(ISDR) and Generalized Succession During Reading(GSDR), and formulate each of which as an ordering relation over layout elements.
Then, ROOR provides the annotation of the ISDR relationship over layout segments, based on the layout annotation of EC-FUNSD.
Overall, ROOR comprises 199 samples including 10,662 segments, 31,297 words and 10,967 annotated reading order linking pairs.
We hope the construction of this benchmark could facilitate the development of automated ROP methods of the improved task form.

Variants: ROOR

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Reading Order Detection LayoutLMv3-GlobalPointer (large) Modeling Layout Reading Order as … 2024-09-29
Reading Order Detection LayoutLMv3-GlobalPointer (base) Modeling Layout Reading Order as … 2024-09-29
Reading Order Detection TPP (LayoutLMv3-base) Reading Order Matters: Information Extraction … 2023-10-17
Reading Order Detection LayoutReader LayoutReader: Pre-training of Text and … 2021-08-26

Research Papers

Recent papers with results on this dataset: