Multi30K

Dataset Information
Introduced
2016
License
Unknown
Homepage

Overview

Multi30K is a large-scale multilingual multimodal dataset for interdisciplinary machine learning research. It extends the Flickr30K dataset with German translations created by professional translators over a subset of the English descriptions, and descriptions crowdsourced independently of the original English descriptions. The dataset was introduced to stimulate multilingual multimodal research.

Variants: multi30k_test_2018_flickr eng-deu, multi30k_test_2018_flickr deu-eng, multi30k_test_2018_flickr, multi30k_test_2017_mscoco, multi30k_test_2017_flickr, multi30k_test_2016_flickr, multi30k_task2_test_2016, Multi30K

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Multimodal Machine Translation ERNIE-UniX2 ERNIE-UniX2: A Unified Cross-lingual Cross-modal … 2022-11-09
Multimodal Machine Translation IKD-MMT Distill the Image to Nowhere: … 2022-10-10
Multimodal Machine Translation Gumbel-Attention MMT Gumbel-Attention for Multi-modal Machine Translation 2021-03-16
Multimodal Machine Translation ImagiT Generative Imagination Elevates Machine Translation 2020-09-21
Multimodal Machine Translation DCCN Dynamic Context-guided Capsule Network for … 2020-09-04
Multimodal Machine Translation PS-KD Self-Knowledge Distillation with Progressive Refinement … 2020-06-22
Multimodal Machine Translation Caglayan Multimodal Machine Translation through Visuals … 2019-11-28
Multimodal Machine Translation del+obj Distilling Translations with Visual Awareness 2019-06-18
Multimodal Machine Translation del Distilling Translations with Visual Awareness 2019-06-18
Multimodal Machine Translation VMMTF Latent Variable Model for Multi-modal … 2018-11-01
Multimodal Machine Translation VAG-NMT A Visual Attention Grounding Neural … 2018-08-24
Multimodal Machine Translation Transformer Attention Is All You Need 2017-06-12
Multimodal Machine Translation NMTSRC+IMG Doubly-Attentive Decoder for Multi-modal Neural … 2017-02-04
Multimodal Machine Translation IMGD Incorporating Global Visual Features into … 2017-01-23

Research Papers

Recent papers with results on this dataset: