ML Research Wiki / Benchmarks / Document Image Classification / RVL-CDIP

RVL-CDIP

Document Image Classification Benchmark

Performance Over Time

📊 Showing 29 results | 📏 Metric: Accuracy

Top Performing Models

Rank Model Paper Accuracy Date Code
1 Roberta base RoBERTa: A Robustly Optimized BERT Pretraining Approach 90.06 2019-07-26 📦 huggingface/transformers 📦 pytorch/fairseq 📦 PaddlePaddle/PaddleNLP
2 EAML EAML: Ensemble Self-Attention-based Mutual Learning Network for Document Image Classification 0.00 2023-05-11 -
3 DocFormerBASE DocFormer: End-to-End Transformer for Document Understanding 0.00 2021-06-22 📦 shabie/docformer
4 LayoutLMV3Large LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking 0.00 2022-04-18 📦 huggingface/transformers 📦 microsoft/unilm 📦 pwc-1/Paper-9 📦 MindSpore-scientific-2/code-14
5 LiLT[EN-R]BASE LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding 0.00 2022-02-28 📦 huggingface/transformers 📦 jpwang/lilt 📦 pwc-1/Paper-9 📦 MindSpore-scientific-2/code-14 📦 MS-P3/code3
6 LayoutLMv2LARGE LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding 0.00 2020-12-29 📦 huggingface/transformers 📦 PaddlePaddle/PaddleOCR 📦 microsoft/unilm
7 TILT-Large Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer 0.00 2021-02-18 📦 uakarsh/TiLT-Implementation
8 DocFormer large DocFormer: End-to-End Transformer for Document Understanding 0.00 2021-06-22 📦 shabie/docformer
9 LayoutLMv3BASE LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking 0.00 2022-04-18 📦 huggingface/transformers 📦 microsoft/unilm 📦 pwc-1/Paper-9 📦 MindSpore-scientific-2/code-14
10 Donut OCR-free Document Understanding Transformer 0.00 2021-11-30 📦 clovaai/donut 📦 impira/docquery 📦 MindCode-4/code-3 📦 code-implementation1/Code9 📦 2023-MindSpore-1/ms-code-2

All Papers (29)

Analysis of Convolutional Neural Networks for Document Image Classification

2017
AlexNet + spatial pyramidal pooling + image resizing