Large Scale Movie Description Challenge
This dataset contains 118,081 short video clips extracted from 202 movies. Each video has a caption, either extracted from the movie script or from transcribed DVS (descriptive video services) for the visually impaired. The validation set contains 7408 clips and evaluation is performed on a test set of 1000 videos from movies disjoint from the training and val sets.
Source: Use What You Have: Video Retrieval Using Representations From Collaborative Experts
Image Source: https://sites.google.com/site/describingmovies/
Variants: LSMDC
This dataset is used in 3 benchmarks:
Recent papers with results on this dataset: