The SumMe dataset is a video summarization dataset consisting of 25 videos, each annotated with at least 15 human summaries (390 in total).
Source: https://gyglim.github.io/me/vsum/index.html
Image Source: https://gyglim.github.io/me/vsum/index.html
Variants: SumMe
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Video Summarization | CSTA | CSTA: CNN-based Spatiotemporal Attention for … | 2024-05-20 |
Video Summarization | VASNet | Summarizing Videos with Attention | 2018-12-05 |
Video Summarization | M-AVS | Video Summarization with Attention-Based Encoder-Decoder … | 2017-08-31 |
Recent papers with results on this dataset: