SWORD

'Scenes with occluded regions' dataset

Dataset Information
Modalities
Images, Videos, 3D
Introduced
2022
License
Homepage

Overview

The new dataset contains around 1,500 train videos and 290 test videos, with 50 frames per video on average.
The dataset was obtained after processing the manually captured video sequences of static real-life urban scenes.
The main property of the dataset is the abundance of close objects and, consequently, the larger prevalence of occlusions.
According to the introduced heuristic, the mean area of occluded image parts for SWORD is approximately five times larger than for RealEstate10k data (14% vs 3% respectively).
This rationalizes the collection and usage of SWORD and explains that SWORD allows training more powerful models despite being of smaller size.

Variants: SWORD

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Novel View Synthesis StereoLayers (2 layers) Stereo Magnification with Multi-Layer Images 2022-01-13
Novel View Synthesis StereoLayers Stereo Magnification with Multi-Layer Images 2022-01-13
Novel View Synthesis StereoLayers (8 layers) Stereo Magnification with Multi-Layer Images 2022-01-13

Research Papers

Recent papers with results on this dataset: