SVT

Street View Text Dataset

Dataset Information
Modalities
Images
License
Unknown
Homepage

Overview

The Street View Text (SVT) dataset was harvested from Google Street View. Image text in this data exhibits high variability and often has low resolution. In dealing with outdoor street level imagery, we note two characteristics. (1) Image text often comes from business signage and (2) business names are easily available through geographic business searches. These factors make the SVT set uniquely suited for word spotting in the wild: given a street view image, the goal is to identify words from nearby businesses.

Note: the dataset has undergone revision since the time it was evaluated in this publication. Please consult the ICCV2011 paper for most up-to-date results.

Source: Street View Text Dataset

Image source: Street View Text Dataset

Variants: SVT

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Scene Text Recognition CLIP4STR-B* An Empirical Study of Scaling … 2023-12-29
Scene Text Recognition DTrOCR 105M DTrOCR: Decoder-only Transformer for Optical … 2023-08-30
Scene Text Recognition CPPD Context Perception Parallel Decoder for … 2023-07-23
Scene Text Recognition DiffusionSTR DiffusionSTR: Diffusion Model for Scene … 2023-06-29
Scene Text Recognition CLIP4STR-H (DFN-5B) CLIP4STR: A Simple Baseline for … 2023-05-23
Scene Text Recognition CLIP4STR-B CLIP4STR: A Simple Baseline for … 2023-05-23
Scene Text Recognition CLIP4STR-L (DataComp-1B) CLIP4STR: A Simple Baseline for … 2023-05-23
Scene Text Recognition CLIP4STR-L CLIP4STR: A Simple Baseline for … 2023-05-23
Scene Text Recognition NRTR+TPS++ TPS++: Attention-Enhanced Thin-Plate Spline for … 2023-05-09
Scene Text Recognition CCD-ViT-Base(ARD_2.8M) Self-supervised Character-to-Character Distillation for Text … 2022-11-01
Scene Text Recognition CCD-ViT-Small(ARD_2.8M) Self-supervised Character-to-Character Distillation for Text … 2022-11-01
Scene Text Recognition CCD-ViT-Tiny(ARD_2.8M) Self-supervised Character-to-Character Distillation for Text … 2022-11-01
Scene Text Recognition MGP-STR Multi-Granularity Prediction for Scene Text … 2022-09-08
Scene Text Recognition PARSeq Scene Text Recognition with Permuted … 2022-07-14
Scene Text Recognition SIGA_T Self-supervised Implicit Glyph Attention for … 2022-03-07
Scene Text Recognition SAFL SAFL: A Self-Attention Scene Text … 2022-01-01
Scene Text Recognition S-GTR Visual Semantics Allow for Textual … 2021-12-24
Scene Text Recognition MATRN Multi-modal Text Recognition Networks: Interactive … 2021-11-30
Scene Text Recognition CDistNet (Ours) CDistNet: Perceiving Multi-Domain Character Distance … 2021-11-22
Scene Text Recognition Yet Another Text Recognizer Why You Should Try the … 2021-07-29

Research Papers

Recent papers with results on this dataset: