SROIE

Dataset Information
Languages
Spanish
Introduced
2021
License
Unknown
Homepage

Overview

Consists of a dataset with 1000 whole scanned receipt images and annotations for the competition on scanned receipts OCR and key information extraction (SROIE).

Image source: https://arxiv.org/pdf/2103.10213.pdf

Variants: SROIE

Associated Benchmarks

This dataset is used in 2 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Key Information Extraction RORE (GeoLayoutLM) Modeling Layout Reading Order as … 2024-09-29
Key Information Extraction ChatGPT 3.5 SpatialFormat LAPDoc: Layout-Aware Prompting for Documents 2024-02-15
Key Information Extraction LayoutLMv2LARGE (Excluding OCR mismatch) LayoutLMv2: Multi-modal Pre-training for Visually-Rich … 2020-12-29
Key Information Extraction LayoutLMv2LARGE LayoutLMv2: Multi-modal Pre-training for Visually-Rich … 2020-12-29
Key Information Extraction LayoutLMv2BASE LayoutLMv2: Multi-modal Pre-training for Visually-Rich … 2020-12-29

Research Papers

Recent papers with results on this dataset: