TV show Retrieval
A new multimodal retrieval dataset. TVR requires systems to understand both videos and their associated subtitle (dialogue) texts, making it more realistic. The dataset contains 109K queries collected on 21.8K videos from 6 TV shows of diverse genres, where each query is associated with a tight temporal window.
Source: TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Variants: TVR
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Video Retrieval | Hero w/ pre-training | HERO: Hierarchical Encoder for Video+Language … | 2020-05-01 |
Video Retrieval | XML (Lei et al., 2020) | TVR: A Large-Scale Dataset for … | 2020-01-24 |
Recent papers with results on this dataset: