Roberta base
|
RoBERTa: A Robustly Optimized BERT Pretraining Ap…
|
90.06
|
2019-07-26
|
|
EAML
|
EAML: Ensemble Self-Attention-based Mutual Learni…
|
|
2023-05-11
|
|
DocFormerBASE
|
DocFormer: End-to-End Transformer for Document Un…
|
|
2021-06-22
|
|
LayoutLMV3Large
|
LayoutLMv3: Pre-training for Document AI with Uni…
|
|
2022-04-18
|
|
LiLT[EN-R]BASE
|
LiLT: A Simple yet Effective Language-Independent…
|
|
2022-02-28
|
|
LayoutLMv2LARGE
|
LayoutLMv2: Multi-modal Pre-training for Visually…
|
|
2020-12-29
|
|
TILT-Large
|
Going Full-TILT Boogie on Document Understanding …
|
|
2021-02-18
|
|
DocFormer large
|
DocFormer: End-to-End Transformer for Document Un…
|
|
2021-06-22
|
|
LayoutLMv3BASE
|
LayoutLMv3: Pre-training for Document AI with Uni…
|
|
2022-04-18
|
|
Donut
|
OCR-free Document Understanding Transformer
|
|
2021-11-30
|
|
TILT-Base
|
Going Full-TILT Boogie on Document Understanding …
|
|
2021-02-18
|
|
LayoutLMv2BASE
|
LayoutLMv2: Multi-modal Pre-training for Visually…
|
|
2020-12-29
|
|
LayoutXLM
|
LayoutXLM: Multimodal Pre-training for Multilingu…
|
|
2021-04-18
|
|
StrucTexTv2 (large)
|
StrucTexTv2: Masked Visual-Textual Prediction for…
|
|
2023-03-01
|
|
Pre-trained LayoutLM
|
LayoutLM: Pre-training of Text and Layout for Doc…
|
|
2019-12-31
|
|
DoPTA
|
DoPTA: Improving Document Layout Analysis using P…
|
|
2024-12-17
|
|
StrucTexTv2 (small)
|
StrucTexTv2: Masked Visual-Textual Prediction for…
|
|
2023-03-01
|
|
VLCDoC
|
VLCDoC: Vision-Language Contrastive Pre-Training …
|
|
2022-05-24
|
|
TransferDoc
|
GlobalDoc: A Cross-Modal Vision-Language Framewor…
|
|
2023-09-11
|
|
Multimodal (ResNet50)
|
Multimodal Side-Tuning for Document Classification
|
|
2023-01-16
|
|
DiT-L
|
DiT: Self-supervised Pre-training for Document Im…
|
|
2022-03-04
|
|
Pre-trained EfficientNet
|
Improving accuracy and speeding up Document Image…
|
|
2020-06-16
|
|
Transfer Learning from VGG16 trained on Imagenet
|
Document Image Classification with Intra-Domain T…
|
|
2018-01-29
|
|
Multimodal (MobileNetV2)
|
Multimodal Side-Tuning for Document Classification
|
|
2023-01-16
|
|
DiT-B
|
DiT: Self-supervised Pre-training for Document Im…
|
|
2022-03-04
|
|
BEiT-B
|
BEiT: BERT Pre-Training of Image Transformers
|
|
2021-06-15
|
|
Transfer Learning from AlexNet, VGG-16, GoogLeNet and ResNet50
|
Cutting the Error by Half: Investigation of Very …
|
|
2017-04-11
|
|
AlexNet + spatial pyramidal pooling + image resizing
|
Analysis of Convolutional Neural Networks for Doc…
|
|
2017-08-10
|
|
DeiT-B
|
Training data-efficient image transformers & dist…
|
|
2020-12-23
|
|