MUSES

MUlti-Shot EventS

Dataset Information
Introduced
2020
License
Unknown
Homepage

Overview

MUSES is a large-scale dataset for temporal event (action) localization. It focuses on the temporal localization of multi-shot events, which are captured with multiple shots. Such events often appear in edited videos, such as TV shows and movies.

What’s included in MUSES:

  • 3,697 videos of TV and movie dramas
  • 716 hours of duration
  • 25 event categories
  • 652k shots
  • 31,477 annotated event instances

Variants: MUSES

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Temporal Action Localization TemporalMaxer TemporalMaxer: Maximize Temporal Context with … 2023-03-16
Temporal Action Localization MUSES Multi-shot Temporal Event Localization: a … 2020-12-17

Research Papers

Recent papers with results on this dataset: