TC-CLIP
|
Leveraging Temporal Contextualization for Video A…
|
78.10
|
2024-04-15
|
|
IMP-MoE-L
|
Alternating Gradient Descent and Mixture-of-Exper…
|
76.80
|
2023-05-10
|
|
OST
|
OST: Refining Text Knowledge with Optimal Spatio-…
|
75.10
|
2023-11-30
|
|
MAXI
|
MAtch, eXpand and Improve: Unsupervised Finetunin…
|
71.60
|
2023-03-15
|
|
OTI(ViT-L/14)
|
Orthogonal Temporal Interpolation for Zero-Shot V…
|
70.60
|
2023-08-14
|
|
VideoCoCa
|
VideoCoCa: Video-Text Modeling with Zero-Shot Tra…
|
70.10
|
2022-12-09
|
|
Text4Vis
|
Revisiting Classifier: Transferring Vision-Langua…
|
68.90
|
2022-07-04
|
|
BIKE
|
Bidirectional Cross-Modal Knowledge Exploration f…
|
68.50
|
2022-12-31
|
|
X-CLIP
|
Expanding Language-Image Pretrained Models for Ge…
|
65.20
|
2022-08-04
|
|
LanguageBind
|
LanguageBind: Extending Video-Language Pretrainin…
|
64.10
|
2023-10-03
|
|
ER-ZSAR (ST+Obj)
|
Elaborative Rehearsal for Zero-shot Action Recogn…
|
42.10
|
2021-08-05
|
|
ER-ZSAR (ST)
|
Elaborative Rehearsal for Zero-shot Action Recogn…
|
37.10
|
2021-08-05
|
|
DEM
|
Learning a Deep Embedding Model for Zero-Shot Lea…
|
23.60
|
2016-11-15
|
|
ALE
|
Label-Embedding for Image Classification
|
23.40
|
2015-03-30
|
|
GCN
|
All About Knowledge Graphs for Actions
|
22.30
|
2020-08-28
|
|
SJE(Word Embedding)
|
Evaluation of Output Embeddings for Fine-Grained …
|
22.30
|
2014-09-30
|
|