ImageNet-Sketch

Dataset Information
Modalities
Images
Introduced
2019
License
Unknown
Homepage

Overview

ImageNet-Sketch data set consists of 50,889 images, approximately 50 images for each of the 1000 ImageNet classes. The data set is constructed with Google Image queries "sketch of __", where __ is the standard class name. Only within the "black and white" color scheme is searched. 100 images are initially queried for every class, and the pulled images are cleaned by deleting the irrelevant images and images that are for similar but different classes. For some classes, there are less than 50 images after manually cleaning, and then the data set is augmented by flipping and rotating the images.

Source: ImageNet-Sketch
Image Source: https://github.com/HaohanWang/ImageNet-Sketch

Variants: ImageNet-Sketch

Associated Benchmarks

This dataset is used in 3 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Zero-Shot Transfer Image Classification EVA-CLIP-18B EVA-CLIP-18B: Scaling CLIP to 18 … 2024-02-06
Zero-Shot Transfer Image Classification InternVL-C InternVL: Scaling up Vision Foundation … 2023-12-21
Domain Generalization Discrete Adversarial Distillation (ViT-B, 224) Distilling Out-of-Distribution Robustness from Vision-Language … 2023-11-02
Zero-Shot Transfer Image Classification EVA-CLIP-E/14+ EVA-CLIP: Improved Training Techniques for … 2023-03-27
Domain Generalization LLE (ViT-H/14, MAE, Edge Aug) A Whac-A-Mole Dilemma: Shortcuts Come … 2022-12-09
Domain Generalization CAR-FT (CLIP, ViT-L/14@336px) Context-Aware Robust Fine-Tuning 2022-11-29
Zero-Shot Transfer Image Classification AltCLIP AltCLIP: Altering the Language Encoder … 2022-11-12
Domain Generalization CAFormer-B36 (IN21K) MetaFormer Baselines for Vision 2022-10-24
Domain Generalization ConvFormer-B36 MetaFormer Baselines for Vision 2022-10-24
Domain Generalization CAFormer-B36 MetaFormer Baselines for Vision 2022-10-24
Domain Generalization ConvFormer-B36 (IN21K, 384) MetaFormer Baselines for Vision 2022-10-24
Domain Generalization ConvFormer-B36 (IN21K) MetaFormer Baselines for Vision 2022-10-24
Domain Generalization CAFormer-B36 (IN21K, 384) MetaFormer Baselines for Vision 2022-10-24
Domain Generalization GPaCo (ViT-L) Generalized Parametric Contrastive Learning 2022-09-26
Domain Generalization MAE+DAT (ViT-H) Enhance the Visual Representation via … 2022-09-16
Image Classification µ2Net+ (ViT-L/16) A Continual Development Methodology for … 2022-09-15
Zero-Shot Transfer Image Classification CoCa CoCa: Contrastive Captioners are Image-Text … 2022-05-04
Domain Generalization Sequencer2D-L Sequencer: Deep LSTM for Image … 2022-05-04
Domain Generalization Model soups (ViT-G/14) Model soups: averaging weights of … 2022-03-10
Domain Generalization Model soups (BASIC-L) Model soups: averaging weights of … 2022-03-10

Research Papers

Recent papers with results on this dataset: