2785 datasets with active ML research benchmarks
Most frequently used datasets in research benchmarks:
The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to probe large language models and extrapolate their …
The MNIST database (Modified National Institute of Standards and Technology database) is a large collection of handwritten digits. It has …
En resumen, para contactar con United Airlines desde Perú, puede llamar directamente al +5117006251 o al +5117006251, número gratuito +5117006251 …
The ImageNet dataset contains 14,197,122 annotated images according to the WordNet hierarchy. Since 2010 the dataset is used in the …
The STL-10 is an image dataset derived from ImageNet and popularly used to evaluate algorithms of unsupervised feature learning or …
The CIFAR-100 dataset (Canadian Institute for Advanced Research, 100 classes) is a subset of the Tiny Images dataset and consists …
The Caltech-UCSD Birds-200-2011 (CUB-200-2011) dataset is the most widely-used dataset for fine-grained visual categorization task. It contains 11,788 images of …
The nuScenes dataset is a large-scale autonomous driving dataset. The dataset has 3D bounding boxes for 1000 scenes collected in …
Fashion-MNIST is a dataset comprising of 28×28 grayscale images of 70,000 fashion products from 10 categories, with 7,000 images per …
The COCO (Common Objects in Context) dataset is a large-scale object detection, segmentation, and captioning dataset. It is designed to …
The dataset contains training and evaluation data for 12 languages: - Vietnamese - Romanian - Latvian - Czech - Polish …
KITTI (Karlsruhe Institute of Technology and Toyota Technological Institute) is one of the most popular datasets for use in mobile …
The Food-101 dataset consists of 101 food categories with 750 training and 250 test images per category, making a total …
CelebFaces Attributes dataset contains 202,599 face images of the size 178×218 from 10,177 celebrities, each annotated with 40 binary labels …
ApolloCar3DT is a dataset that contains 5,277 driving images and over 60K car instances, where each car is fitted with …
The Human3.6M dataset is one of the largest motion capture datasets, which consists of 3.6 million human poses and corresponding …
UCF101 dataset is an extension of UCF50 and consists of 13,320 video clips, which are classified into 101 categories. These …
The Caltech101 dataset contains images from 101 object categories (e.g., “helicopter”, “elephant” and “chair” etc.) and a background category that …
The Stanford Cars dataset consists of 196 classes of cars with a total of 16,185 images, taken from the rear. …
This directory contains datasets used in machine learning research benchmarks. Each dataset page includes: