ML Research Wiki / Benchmarks / Image Reconstruction / ImageNet

ImageNet

Image Reconstruction Benchmark

Performance Over Time

📊 Showing 15 results | 📏 Metric: FID

Top Performing Models

Rank Model Paper FID Date Code
1 MGVQ (16x16x8) MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization 0.49 2025-07-14 📦 MKJia/MGVQ
2 MGVQ (16x16x4) MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization 0.64 2025-07-14 📦 MKJia/MGVQ
3 GigaTok-XL-XXL GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation 0.79 2025-04-11 📦 SilentView/GigaTok
4 OptVQ (16x16x8) Preventing Local Pitfalls in Vector Quantization via Optimal Transport 0.91 2024-12-19 📦 zbr17/OptVQ
5 OptVQ (16x16x4) Preventing Local Pitfalls in Vector Quantization via Optimal Transport 1.00 2024-12-19 📦 zbr17/OptVQ
6 IBQ (16x16) Taming Scalable Visual Tokenizer for Autoregressive Image Generation 1.00 2024-12-03 📦 tencentarc/seed-voken 📦 tencentarc/open-magvit2
7 Mo-VQGAN (16x16x4) MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation 1.12 2022-09-19 📦 ai-forever/Kandinsky-2 📦 ai-forever/movqgan
8 Open-Magvit2 (16x16) Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation 1.17 2024-09-06 📦 tencentarc/open-magvit2 📦 tencentarc/seed-voken
9 ViT-VQGAN (16x16) Vector-quantized Image Modeling with Improved VQGAN 1.28 2021-10-09 📦 lucidrains/DALLE2-pytorch 📦 thuanz123/enhancing-transformers 📦 thuangb/enhancing-transformers 📦 ai-forever/movqgan 📦 CuddleSabe/VQGAN
10 MaskBit (16x16) MaskBit: Embedding-free Image Generation via Bit Tokens 1.66 2024-09-24 📦 markweberdev/maskbit

All Papers (15)