The paper used 500 scanned Electronic Theses and Dissertation cover pages (i.e., front pages). The dataset contains several intermediate datasets, briefly discussed in the paper.
Variants: ETD500
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Key Information Extraction | CRF-visual | Automatic Metadata Extraction Incorporating Visual … | 2021-07-01 |
Recent papers with results on this dataset: