ML Research Wiki / Benchmarks / Scene Text Recognition / SVT

SVT

Scene Text Recognition Benchmark

Performance Over Time

📊 Showing 34 results | 📏 Metric: Accuracy

Top Performing Models

Rank Model Paper Accuracy Date Code
1 CLIP4STR-H (DFN-5B) 📚 CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model 99.10 2023-05-23 📦 VamosC/CLIP4STR
2 DTrOCR 105M DTrOCR: Decoder-only Transformer for Optical Character Recognition 98.90 2023-08-30 📦 arvindrajan92/DTrOCR
3 CLIP4STR-B* 📚 An Empirical Study of Scaling Law for OCR 98.76 2023-12-29 📦 large-ocr-model/large-ocr-model.github.io
4 MGP-STR 📚 Multi-Granularity Prediction for Scene Text Recognition 98.60 2022-09-08 📦 alibabaresearch/advancedliteratemachinery 📦 AlibabaResearch/AdvancedLiterateMachinery 📦 topdu/openocr
5 CLIP4STR-L (DataComp-1B) 📚 CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model 98.60 2023-05-23 📦 VamosC/CLIP4STR
6 CPPD 📚 Context Perception Parallel Decoder for Scene Text Recognition 98.50 2023-07-23 📦 PaddlePaddle/PaddleOCR 📦 topdu/openocr
7 CLIP4STR-L 📚 CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model 98.50 2023-05-23 📦 VamosC/CLIP4STR
8 CLIP4STR-B 📚 CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model 98.30 2023-05-23 📦 VamosC/CLIP4STR
9 CCD-ViT-Base(ARD_2.8M) 📚 Self-supervised Character-to-Character Distillation for Text Recognition 97.80 2022-11-01 📦 tongkunguan/ccd
10 CCD-ViT-Small(ARD_2.8M) 📚 Self-supervised Character-to-Character Distillation for Text Recognition 96.40 2022-11-01 📦 tongkunguan/ccd

All Papers (34)