Multi30K is a large-scale multilingual multimodal dataset for interdisciplinary machine learning research. It extends the Flickr30K dataset with German translations created by professional translators over a subset of the English descriptions, and descriptions crowdsourced independently of the original English descriptions. The dataset was introduced to stimulate multilingual multimodal research.
Variants: multi30k_test_2018_flickr eng-deu, multi30k_test_2018_flickr deu-eng, multi30k_test_2018_flickr, multi30k_test_2017_mscoco, multi30k_test_2017_flickr, multi30k_test_2016_flickr, multi30k_task2_test_2016, Multi30K
This dataset is used in 1 benchmark:
Recent papers with results on this dataset: