UnLoc-L
|
UnLoc: A Unified Framework for Video Localization…
|
72.80
|
2023-08-21
|
|
Univl
|
UniVL: A Unified Video and Language Pre-Training …
|
70.00
|
2020-02-15
|
|
Norton
|
Multi-granularity Correspondence Learning from Lo…
|
69.80
|
2024-01-30
|
|
VideoClip
|
VideoCLIP: Contrastive Pre-training for Zero-shot…
|
68.70
|
2021-09-28
|
|
VLM
|
VLM: Task-agnostic Video-Language Model Pre-train…
|
68.40
|
2021-05-20
|
|
TACo
|
TACo: Token-aware Cascade Contrastive Learning fo…
|
68.40
|
2021-08-23
|
|
MIL-NCE
|
End-to-End Learning of Visual Representations fro…
|
61.00
|
2019-12-13
|
|
ActBERT
|
ActBERT: Learning Global-Local Video-Text Represe…
|
57.00
|
2020-11-14
|
|
CBT
|
End-to-End Learning of Visual Representations fro…
|
53.90
|
2019-12-13
|
|