UnLoc-L
|
UnLoc: A Unified Framework for Video Localization…
|
66.10
|
2023-08-21
|
|
UnLoc-B
|
UnLoc: A Unified Framework for Video Localization…
|
64.50
|
2023-08-21
|
|
DenoiseLoc
|
Boundary-Denoising for Video Activity Localization
|
59.27
|
2023-04-06
|
|
SG-DETR (w/ PT)
|
Saliency-Guided DETR for Moment Retrieval and Hig…
|
58.80
|
2024-10-02
|
|
SG-DETR
|
Saliency-Guided DETR for Moment Retrieval and Hig…
|
54.10
|
2024-10-02
|
|
LLaVA-MR
|
LLaVA-MR: Large Language-and-Vision Assistant for…
|
52.73
|
2024-11-21
|
|
FlashVTG
|
FlashVTG: Feature Layering and Adaptive Score Han…
|
52.00
|
2024-12-18
|
|
InternVideo2-6B
|
InternVideo2: Scaling Foundation Models for Multi…
|
49.24
|
2024-03-22
|
|
CG-DETR (w/ PT)
|
Correlation-Guided Query-Dependency Calibration f…
|
47.97
|
2023-11-15
|
|
VideoLights-B-pt
|
VideoLights: Feature Refinement and Cross-Task Al…
|
47.94
|
2024-12-02
|
|
LA-DETR
|
Length-Aware DETR for Robust Moment Retrieval
|
47.93
|
2024-12-30
|
|
BAM-DETR (w/ audio)
|
BAM-DETR: Boundary-Aligned Moment Detection Trans…
|
46.91
|
2023-11-30
|
|
BAM-DETR (w/ PT ASR Captions)
|
BAM-DETR: Boundary-Aligned Moment Detection Trans…
|
46.67
|
2023-11-30
|
|
LD-DETR
|
LD-DETR: Loop Decoder DEtection TRansformer for V…
|
46.41
|
2025-01-18
|
|
R^2-Tuning
|
$R^2$-Tuning: Efficient Image-to-Video Transfer L…
|
46.17
|
2024-03-31
|
|
BAM-DETR
|
BAM-DETR: Boundary-Aligned Moment Detection Trans…
|
45.36
|
2023-11-30
|
|
video-mamba-suite
|
Video Mamba Suite: State Space Model as a Versati…
|
45.18
|
2024-03-14
|
|
LLMEPET
|
Prior Knowledge Integration via LLM Encoding and …
|
44.05
|
2024-07-21
|
|
UVCOM (w/ PT ASR Captions)
|
Bridging the Gap: A Unified Video Comprehension F…
|
43.80
|
2023-11-28
|
|
UniVTG (w/ PT)
|
UniVTG: Towards Unified Video-Language Temporal G…
|
43.63
|
2023-07-31
|
|
UVCOM
|
Bridging the Gap: A Unified Video Comprehension F…
|
43.18
|
2023-11-28
|
|
CG-DETR
|
Correlation-Guided Query-Dependency Calibration f…
|
42.86
|
2023-11-15
|
|
QD-DETR (w/ PT)
|
Query-Dependent Video Representation for Moment R…
|
40.62
|
2023-03-24
|
|
QD-DETR (w/ audio)
|
Query-Dependent Video Representation for Moment R…
|
40.19
|
2023-03-24
|
|
BM-DETR
|
Background-aware Moment Detection for Video Momen…
|
40.08
|
2023-06-05
|
|
QD-DETR (only Video w/ PT ASR Captions)
|
Query-Dependent Video Representation for Moment R…
|
40.00
|
2023-03-24
|
|
QD-DETR (only Video)
|
Query-Dependent Video Representation for Moment R…
|
39.86
|
2023-03-24
|
|
UMT (w/ audio + PT ASR Cpations)
|
UMT: Unified Multi-modal Transformers for Joint V…
|
38.08
|
2022-03-23
|
|
Moment-DETR (w/ PT ASR Cpations)
|
QVHighlights: Detecting Moments and Highlights in…
|
36.14
|
2021-07-20
|
|
UMT
|
UMT: Unified Multi-modal Transformers for Joint V…
|
36.12
|
2022-03-23
|
|
UniVTG
|
UniVTG: Towards Unified Video-Language Temporal G…
|
35.47
|
2023-07-31
|
|