QD-DETR (only Video w/ PT)
|
Query-Dependent Video Representation for Moment R…
|
61.91
|
2023-03-24
|
|
SG-DETR (w/ PT)
|
Saliency-Guided DETR for Moment Retrieval and Hig…
|
44.70
|
2024-10-02
|
|
FlashVTG
|
FlashVTG: Feature Layering and Adaptive Score Han…
|
44.09
|
2024-12-18
|
|
SG-DETR
|
Saliency-Guided DETR for Moment Retrieval and Hig…
|
43.76
|
2024-10-02
|
|
VideoLights-B-pt
|
VideoLights: Feature Refinement and Cross-Task Al…
|
42.84
|
2024-12-02
|
|
HL-CLIP
|
Unleash the Potential of CLIP for Video Highlight…
|
41.94
|
2024-04-02
|
|
R^2-Tuning
|
$R^2$-Tuning: Efficient Image-to-Video Transfer L…
|
40.75
|
2024-03-31
|
|
CG-DETR (w/ PT)
|
Correlation-Guided Query-Dependency Calibration f…
|
40.71
|
2023-11-15
|
|
NumPro
|
Number it: Temporal Grounding Videos like Flippin…
|
40.54
|
2024-11-15
|
|
UniVTG (w/ PT)
|
UniVTG: Towards Unified Video-Language Temporal G…
|
40.54
|
2023-07-31
|
|
CG-DETR
|
Correlation-Guided Query-Dependency Calibration f…
|
40.33
|
2023-11-15
|
|
LLMEPET
|
Prior Knowledge Integration via LLM Encoding and …
|
40.33
|
2024-07-21
|
|
UMT (w. PT)
|
UMT: Unified Multi-modal Transformers for Joint V…
|
39.12
|
2022-03-23
|
|
QD-DETR
|
Query-Dependent Video Representation for Moment R…
|
39.04
|
2023-03-24
|
|
QD-DETR (only Video)
|
Query-Dependent Video Representation for Moment R…
|
38.94
|
2023-03-24
|
|
QD-DETR (w/ PT)
|
Query-Dependent Video Representation for Moment R…
|
38.52
|
2023-03-24
|
|
UniVTG
|
UniVTG: Towards Unified Video-Language Temporal G…
|
38.20
|
2023-07-31
|
|
UMT
|
UMT: Unified Multi-modal Transformers for Joint V…
|
38.18
|
2022-03-23
|
|
Moment-DETR w/ PT
|
QVHighlights: Detecting Moments and Highlights in…
|
37.43
|
2021-07-20
|
|
VideoChat-T (FT)
|
TimeSuite: Improving MLLMs for Long Video Underst…
|
27.00
|
2024-10-25
|
|