Consists of a dataset with 1000 whole scanned receipt images and annotations for the competition on scanned receipts OCR and key information extraction (SROIE).
Image source: https://arxiv.org/pdf/2103.10213.pdf
Variants: SROIE
This dataset is used in 2 benchmarks:
Task | Model | Paper | Date |
---|---|---|---|
Key Information Extraction | RORE (GeoLayoutLM) | Modeling Layout Reading Order as … | 2024-09-29 |
Key Information Extraction | ChatGPT 3.5 SpatialFormat | LAPDoc: Layout-Aware Prompting for Documents | 2024-02-15 |
Key Information Extraction | LayoutLMv2LARGE (Excluding OCR mismatch) | LayoutLMv2: Multi-modal Pre-training for Visually-Rich … | 2020-12-29 |
Key Information Extraction | LayoutLMv2LARGE | LayoutLMv2: Multi-modal Pre-training for Visually-Rich … | 2020-12-29 |
Key Information Extraction | LayoutLMv2BASE | LayoutLMv2: Multi-modal Pre-training for Visually-Rich … | 2020-12-29 |
Recent papers with results on this dataset: