LeafNet: A large-scale dataset for training image-text models in leaf disease identification
The PlantVillage dataset, with over 54,000 images spanning 14 plant species and 26 disease types, has been widely used for leaf disease classification. However, it is limited in both scale and diversity. To address these limitations, we developed LeafNet, a large-scale dataset designed to support foundation models for leaf disease diagnosis. LeafNet comprises over 186,000 images from 22 crop species, covering 43 fungal diseases, 8 bacterial diseases, 2 mould (oomycete) diseases, 6 viral diseases, and 3 mite-induced diseases, categorized into 97 classes. The dataset was meticulously collected and processed to minimize intra-class variations while ensuring clarity by maintaining a consistent imaging distance. The disease symptom descriptions were curated from reputable sources, including UME, NIH, and published studies, providing high-quality annotations to support AI-driven plant pathology research.
Variants: LeafNet
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Image Classification | SCOLD | A Vision-Language Foundation Model for … | 2025-05-11 |
Recent papers with results on this dataset: