RA-VQAv2 w/ PreFLMR
|
PreFLMR: Scaling Up Fine-Grained Late-Interaction…
|
30.65
|
2024-02-13
|
|
PaLI-X
|
PaLI-X: On Scaling up a Multilingual Vision and L…
|
24.00
|
2023-05-29
|
|
CLIP + FiD
|
Can Pre-trained Vision and Language Models Answer…
|
20.90
|
2023-02-23
|
|
CLIP + PaLM (540B)
|
Can Pre-trained Vision and Language Models Answer…
|
20.40
|
2023-02-23
|
|
PaLI
|
Can Pre-trained Vision and Language Models Answer…
|
19.70
|
2023-02-23
|
|
BLIP2
|
BLIP-2: Bootstrapping Language-Image Pre-training…
|
14.60
|
2023-01-30
|
|