MARVAL
|
A New Path: Scaling Vision-and-Language Navigatio…
|
66.76
|
2022-10-06
|
|
EnvEdit-PT
|
EnvEdit: Environment Editing for Vision-and-Langu…
|
64.61
|
2022-03-29
|
|
HAMT
|
History Aware Multimodal Transformer for Vision-a…
|
59.94
|
2021-10-25
|
|
CLEAR-CLIP
|
How Much Can CLIP Benefit Vision-and-Language Tas…
|
53.69
|
2021-07-13
|
|
Monolingual Baseline
|
Room-Across-Room: Multilingual Vision-and-Languag…
|
41.05
|
2020-10-15
|
|
Multilingual Baseline
|
Room-Across-Room: Multilingual Vision-and-Languag…
|
36.81
|
2020-10-15
|
|