ML Research Wiki / Benchmarks / Image Classification / iNaturalist

iNaturalist

Image Classification Benchmark

Performance Over Time

📊 Showing 18 results | 📏 Metric: Top 1 Accuracy

Top Performing Models

Rank Model Paper Top 1 Accuracy Date Code
1 AIMv2-3B (448 res) Multimodal Autoregressive Pre-training of Large Vision Encoders 85.90 2024-11-21 📦 apple/ml-aim
2 Hiera-H (448px) 📚 Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles 83.80 2023-06-01 📦 huggingface/pytorch-image-models 📦 facebookresearch/hiera 📦 leondgarse/keras_cv_attention_models 📦 birder/birder
3 MAE (ViT-H, 448) 📚 Masked Autoencoders Are Scalable Vision Learners 83.40 2021-11-11 📦 facebookresearch/mae 📦 lightly-ai/lightly 📦 open-mmlab/mmselfsup
4 AIMv2-3B Multimodal Autoregressive Pre-training of Large Vision Encoders 81.50 2024-11-21 📦 apple/ml-aim
5 AIMv2-1B Multimodal Autoregressive Pre-training of Large Vision Encoders 79.70 2024-11-21 📦 apple/ml-aim
6 AIMv2-H Multimodal Autoregressive Pre-training of Large Vision Encoders 77.90 2024-11-21 📦 apple/ml-aim
7 AIMv2-L Multimodal Autoregressive Pre-training of Large Vision Encoders 76.00 2024-11-21 📦 apple/ml-aim
8 FixSENet-154 📚 Fixing the train-test resolution discrepancy 75.40 2019-06-14 📦 facebookresearch/FixRes 📦 libffcv/ffcv-imagenet 📦 kun-woo-park/Deeplearning_project_STL_10
9 b_22DeiT-LT(ours) DeiT-LT Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets 75.10 2024-04-03 📦 val-iisc/DeiT-LT 📦 pwc-1/Paper-8
10 SEB+EfficientNet-B5 On the Eigenvalues of Global Covariance Pooling for Fine-grained Visual Recognition 72.30 2022-05-26 📦 KingJamesSong/DifferentiableSVD

All Papers (18)