ImageNet-Sketch

Name: ImageNet-Sketch
Published: 2019-05-29
License: Unknown

Dataset Information

Modalities

Images

Introduced

2019

License

Unknown

Homepage

Official Website

Contents

Overview
Associated Benchmarks
Recent Benchmark Submissions
Research Papers

Overview

ImageNet-Sketch data set consists of 50,889 images, approximately 50 images for each of the 1000 ImageNet classes. The data set is constructed with Google Image queries "sketch of __", where __ is the standard class name. Only within the "black and white" color scheme is searched. 100 images are initially queried for every class, and the pulled images are cleaned by deleting the irrelevant images and images that are for similar but different classes. For some classes, there are less than 50 images after manually cleaning, and then the data set is augmented by flipping and rotating the images.

Source: ImageNet-Sketch
Image Source: https://github.com/HaohanWang/ImageNet-Sketch

Variants: ImageNet-Sketch

Associated Benchmarks

This dataset is used in 3 benchmarks:

Image Classification - Metrics: Accuracy
Zero-Shot Transfer Image Classification - Metrics: Accuracy (Private)
Domain Generalization - Metrics: Top-1 accuracy

Recent Benchmark Submissions

Task	Model	Paper	Date
Zero-Shot Transfer Image Classification	EVA-CLIP-18B	EVA-CLIP-18B: Scaling CLIP to 18 …	2024-02-06
Zero-Shot Transfer Image Classification	InternVL-C	InternVL: Scaling up Vision Foundation …	2023-12-21
Domain Generalization	Discrete Adversarial Distillation (ViT-B, 224)	Distilling Out-of-Distribution Robustness from Vision-Language …	2023-11-02
Zero-Shot Transfer Image Classification	EVA-CLIP-E/14+	EVA-CLIP: Improved Training Techniques for …	2023-03-27
Domain Generalization	LLE (ViT-H/14, MAE, Edge Aug)	A Whac-A-Mole Dilemma: Shortcuts Come …	2022-12-09
Domain Generalization	CAR-FT (CLIP, ViT-L/14@336px)	Context-Aware Robust Fine-Tuning	2022-11-29
Zero-Shot Transfer Image Classification	AltCLIP	AltCLIP: Altering the Language Encoder …	2022-11-12
Domain Generalization	CAFormer-B36 (IN21K)	MetaFormer Baselines for Vision	2022-10-24
Domain Generalization	ConvFormer-B36	MetaFormer Baselines for Vision	2022-10-24
Domain Generalization	CAFormer-B36	MetaFormer Baselines for Vision	2022-10-24
Domain Generalization	ConvFormer-B36 (IN21K, 384)	MetaFormer Baselines for Vision	2022-10-24
Domain Generalization	ConvFormer-B36 (IN21K)	MetaFormer Baselines for Vision	2022-10-24
Domain Generalization	CAFormer-B36 (IN21K, 384)	MetaFormer Baselines for Vision	2022-10-24
Domain Generalization	GPaCo (ViT-L)	Generalized Parametric Contrastive Learning	2022-09-26
Domain Generalization	MAE+DAT (ViT-H)	Enhance the Visual Representation via …	2022-09-16
Image Classification	µ2Net+ (ViT-L/16)	A Continual Development Methodology for …	2022-09-15
Zero-Shot Transfer Image Classification	CoCa	CoCa: Contrastive Captioners are Image-Text …	2022-05-04
Domain Generalization	Sequencer2D-L	Sequencer: Deep LSTM for Image …	2022-05-04
Domain Generalization	Model soups (ViT-G/14)	Model soups: averaging weights of …	2022-03-10
Domain Generalization	Model soups (BASIC-L)	Model soups: averaging weights of …	2022-03-10

Research Papers

Recent papers with results on this dataset:

External Links:

ImageNet-Sketch

Overview edit

Associated Benchmarks

Recent Benchmark Submissions

Research Papers

Edit Dataset Information

Overview