LeafNet

LeafNet: A large-scale dataset for training image-text models in leaf disease identification

Dataset Information
Languages
English
Introduced
2025
License
Unknown
Homepage

Overview

The PlantVillage dataset, with over 54,000 images spanning 14 plant species and 26 disease types, has been widely used for leaf disease classification. However, it is limited in both scale and diversity. To address these limitations, we developed LeafNet, a large-scale dataset designed to support foundation models for leaf disease diagnosis. LeafNet comprises over 186,000 images from 22 crop species, covering 43 fungal diseases, 8 bacterial diseases, 2 mould (oomycete) diseases, 6 viral diseases, and 3 mite-induced diseases, categorized into 97 classes. The dataset was meticulously collected and processed to minimize intra-class variations while ensuring clarity by maintaining a consistent imaging distance. The disease symptom descriptions were curated from reputable sources, including UME, NIH, and published studies, providing high-quality annotations to support AI-driven plant pathology research.

Variants: LeafNet

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Image Classification SCOLD A Vision-Language Foundation Model for … 2025-05-11

Research Papers

Recent papers with results on this dataset: