MUlti-Shot EventS
MUSES is a large-scale dataset for temporal event (action) localization. It focuses on the temporal localization of multi-shot events, which are captured with multiple shots. Such events often appear in edited videos, such as TV shows and movies.
What’s included in MUSES:
Variants: MUSES
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Temporal Action Localization | TemporalMaxer | TemporalMaxer: Maximize Temporal Context with … | 2023-03-16 |
Temporal Action Localization | MUSES | Multi-shot Temporal Event Localization: a … | 2020-12-17 |
Recent papers with results on this dataset: