The Sports-1M dataset consists of over a million videos from YouTube. The videos in the dataset can be obtained through the YouTube URL specified by the authors. Approximately 7% (as of 2016) of the videos have been removed by the YouTube uploaders since the dataset was compiled. However, there are still over a million videos in the dataset with 487 sports-related categories with 1,000 to 3,000 videos per category. The videos are automatically labelled with 487 sports classes using the YouTube Topics API by analyzing the text metadata associated with the videos (e.g. tags, descriptions). Approximately 5% of the videos are annotated with more than one class.
Source: Review of Action Recognition and Detection Methods
Image Source: Computer Vision for Sports
Variants: Sports-1M
This dataset is used in 2 benchmarks:
Task | Model | Paper | Date |
---|---|---|---|
Action Recognition In Videos | G-Blend | What Makes Training Multi-Modal Classification … | 2019-05-29 |
Action Recognition | ip-CSN-101 (RGB) | Video Classification with Channel-Separated Convolutional … | 2019-04-04 |
Action Recognition | ip-CSN-152 (RGB) | Video Classification with Channel-Separated Convolutional … | 2019-04-04 |
Action Recognition | R[2+1]D-Flow-32frame | A Closer Look at Spatiotemporal … | 2017-11-30 |
Action Recognition | R[2+1]D-Two-Stream-32frame | A Closer Look at Spatiotemporal … | 2017-11-30 |
Action Recognition | R[2+1]D-RGB-32frame | A Closer Look at Spatiotemporal … | 2017-11-30 |
Action Recognition | P3D | Learning Spatio-Temporal Representation with Pseudo-3D … | 2017-11-28 |
Action Recognition In Videos | LSTM +Pretrained on YT-8M | YouTube-8M: A Large-Scale Video Classification … | 2016-09-27 |
Action Recognition | Conv pooling | Beyond Short Snippets: Deep Networks … | 2015-03-31 |
Action Recognition | C3D | Learning Spatiotemporal Features with 3D … | 2014-12-02 |
Recent papers with results on this dataset: