Oscar
|
Oscar: Object-Semantics Aligned Pre-training for …
|
98.30
|
2020-04-13
|
|
BLIP-2 ViT-G (fine-tuned)
|
BLIP-2: Bootstrapping Language-Image Pre-training…
|
68.30
|
2023-01-30
|
|
VisualSparta
|
VisualSparta: An Embarrassingly Simple Approach t…
|
68.20
|
2021-01-01
|
|
BLIP-2 ViT-L (fine-tuned)
|
BLIP-2: Bootstrapping Language-Image Pre-training…
|
66.30
|
2023-01-30
|
|
FLAVA (zero-shot)
|
FLAVA: A Foundational Language And Vision Alignme…
|
38.38
|
2021-12-08
|
|
CLIP (zero-shot)
|
FLAVA: A Foundational Language And Vision Alignme…
|
33.29
|
2021-12-08
|
|