ML Research Wiki / Benchmarks / Scene Text Recognition / IIIT5k

IIIT5k

Scene Text Recognition Benchmark

Performance Over Time

📊 Showing 16 results | 📏 Metric: Accuracy

Top Performing Models

Rank Model Paper Accuracy Date Code
1 CLIP4STR-L (DataComp-1B) 📚 CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model 99.60 2023-05-23 📦 VamosC/CLIP4STR
2 DTrOCR 105M DTrOCR: Decoder-only Transformer for Optical Character Recognition 99.60 2023-08-30 📦 arvindrajan92/DTrOCR
3 CLIP4STR-L 📚 CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model 99.50 2023-05-23 📦 VamosC/CLIP4STR
4 CLIP4STR-B (DataComp-1B) 📚 CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model 99.50 2023-05-23 📦 VamosC/CLIP4STR
5 CPPD 📚 Context Perception Parallel Decoder for Scene Text Recognition 99.30 2023-07-23 📦 PaddlePaddle/PaddleOCR 📦 topdu/openocr
6 CLIP4STR-B 📚 CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model 99.20 2023-05-23 📦 VamosC/CLIP4STR
7 MGP-STR 📚 Multi-Granularity Prediction for Scene Text Recognition 98.80 2022-09-08 📦 alibabaresearch/advancedliteratemachinery 📦 AlibabaResearch/AdvancedLiterateMachinery 📦 topdu/openocr
8 CCD-ViT-Small(ARD_2.8M) 📚 Self-supervised Character-to-Character Distillation for Text Recognition 98.00 2022-11-01 📦 tongkunguan/ccd
9 CCD-ViT-Base(ARD_2.8M) 📚 Self-supervised Character-to-Character Distillation for Text Recognition 98.00 2022-11-01 📦 tongkunguan/ccd
10 S-GTR 📚 Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition 97.50 2021-12-24 📦 adeline-cs/GTR

All Papers (16)