VLIS (Lynx)
|
VLIS: Unimodal Language Models Guide Multimodal L…
|
80.00
|
2023-10-15
|
|
VLIS (LLaVA)
|
VLIS: Unimodal Language Models Guide Multimodal L…
|
73.00
|
2023-10-15
|
|
Ground-truth Caption -> GPT3 (Oracle)
|
Breaking Common Sense: WHOOPS! A Vision-and-Langu…
|
68.00
|
2023-03-13
|
|
Predicted Caption -> GPT3
|
Breaking Common Sense: WHOOPS! A Vision-and-Langu…
|
33.00
|
2023-03-13
|
|
BLIP2 FlanT5-XXL (Fine-tuned)
|
Breaking Common Sense: WHOOPS! A Vision-and-Langu…
|
27.00
|
2023-03-13
|
|
BLIP2 FlanT5-XL (Fine-tuned)
|
Breaking Common Sense: WHOOPS! A Vision-and-Langu…
|
15.00
|
2023-03-13
|
|
BLIP2 FlanT5-XXL (Zero-shot)
|
Breaking Common Sense: WHOOPS! A Vision-and-Langu…
|
0.00
|
2023-03-13
|
|