MNIST

Dataset Information
Modalities
Images
Languages
English
Introduced
1998
License
Unknown
Homepage

Overview

The MNIST database (Modified National Institute of Standards and Technology database) is a large collection of handwritten digits. It has a training set of 60,000 examples, and a test set of 10,000 examples. It is a subset of a larger NIST Special Database 3 (digits written by employees of the United States Census Bureau) and Special Database 1 (digits written by high school students) which contain monochrome images of handwritten digits. The digits have been size-normalized and centered in a fixed-size image. The original black and white (bilevel) images from NIST were size normalized to fit in a 20x20 pixel box while preserving their aspect ratio. The resulting images contain grey levels as a result of the anti-aliasing technique used by the normalization algorithm. the images were centered in a 28x28 image by computing the center of mass of the pixels, and translating the image so as to position this point at the center of the 28x28 field.

Source: http://yann.lecun.com/exdb/mnist/
Image Source: https://en.wikipedia.org/wiki/MNIST_database#/media/File:MnistExamples.png

Variants: USPS-to-MNIST, MNIST-to-USPS, Rotating MNIST, Noisy MNIST (Motion), Noisy MNIST (Contrast), Noisy MNIST (AWGN), MNIST (Conditional), Indexed Rotating MNIST, Rotated MNIST, Moving MNIST, Sequential MNIST, SVNH-to-MNIST, MNIST-test, MNIST-full, MNIST, 75 Superpixel MNIST

Associated Benchmarks

This dataset is used in 31 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Graph Classification GCN+ Unlocking the Potential of Classic … 2025-02-13
Graph Classification GatedGCN+ Unlocking the Potential of Classic … 2025-02-13
Density Estimation DVP-VAE Hierarchical VAE with a Diffusion-based … 2024-12-02
Image Classification R-ExplaiNet-22 (single model) Learning local discrete features in … 2024-10-31
Image Classification DGMMC-S Performance of Gaussian Mixture Model … 2024-10-17
Image Classification CNN+ Wilson-Cowan model RNN Learning in Wilson-Cowan model for … 2024-06-24
Image Classification Wilson-Cowan model RNN Learning in Wilson-Cowan model for … 2024-06-24
Image Classification rKAN rKAN: Rational Kolmogorov-Arnold Networks 2024-06-20
Image Clustering TURTLE (CLIP + DINOv2) Let Go of Your Labels … 2024-06-11
Image Classification fKAN fKAN: Fractional Kolmogorov-Arnold Networks with … 2024-06-11
Graph Classification NeuralWalker Learning Long Range Dependencies on … 2024-06-05
Image Classification Hypervector Tsetlin Machine Exploring Effects of Hyperdimensional Vectors … 2024-06-04
Graph Classification CKGCN CKGConv: General Graph Convolution with … 2024-04-21
Density Estimation PaddingFlow PaddingFlow: Improving Normalizing Flows with … 2024-03-13
Graph Classification ESA (Edge set attention, no positional encodings, tuned) An end-to-end attention-based approach for … 2024-02-16
Graph Classification ESA (Edge set attention, no positional encodings) An end-to-end attention-based approach for … 2024-02-16
Graph Classification TIGT Topology-Informed Graph Transformer 2024-02-03
Image Classification GECCO A Single Graph Convolution Is … 2024-02-01
Graph Classification EIGENFORMER Graph Transformers without Positional Encodings 2024-01-31
Image Classification Convolutional PMM (Parametric Matrix Model) Parametric Matrix Models 2024-01-22

Research Papers

Recent papers with results on this dataset: