VDD

Varied Drone Dataset for Semantic Segmentation

Dataset Information
Introduced
2023
License
Unknown
Homepage

Overview

Semantic segmentation of drone images is critical for various aerial vision tasks as it provides essential seman- tic details to understand scenes on the ground. Ensuring high accuracy of semantic segmentation models for drones requires access to diverse, large-scale, and high-resolution datasets, which are often scarce in the field of aerial image processing. While existing datasets typically focus on urban scenes and are relatively small, our Varied Drone Dataset (VDD) addresses these limitations by offering a large-scale, densely labeled collection of 400 high-resolution images spanning 7 classes. This dataset features various scenes in urban, industrial, rural, and natural areas, captured from different camera angles and under diverse lighting conditions.

Variants: VDD

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Semantic Segmentation Segformer-B0 VDD: Varied Drone Dataset for … 2023-05-23
Semantic Segmentation Segformer-B2 VDD: Varied Drone Dataset for … 2023-05-23
Semantic Segmentation UperNet(Swin-L) VDD: Varied Drone Dataset for … 2023-05-23
Semantic Segmentation UperNet(Swin-T) VDD: Varied Drone Dataset for … 2023-05-23
Semantic Segmentation Mask2Former(ResNet-50) VDD: Varied Drone Dataset for … 2023-05-23
Semantic Segmentation Segformer-B5 VDD: Varied Drone Dataset for … 2023-05-23
Semantic Segmentation Mask2Former(Swin-T) VDD: Varied Drone Dataset for … 2023-05-23

Research Papers

Recent papers with results on this dataset: