RecipeQA

Dataset Information
Modalities
Images, Texts
Introduced
2018
License
Homepage

Overview

RecipeQA is a dataset for multimodal comprehension of cooking recipes. It consists of over 36K question-answer pairs automatically generated from approximately 20K unique recipes with step-by-step instructions and images. Each question in RecipeQA involves multiple modalities such as titles, descriptions or images, and working towards an answer requires (i) joint understanding of images and text, (ii) capturing the temporal flow of events, and (iii) making sense of procedural knowledge.

Source: RecipeQA

Variants: RecipeQA

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Question Answering multimodal+LXMERT+ConstrainedMaxPooling Latent Alignment of Procedural Concepts … 2021-01-12

Research Papers

Recent papers with results on this dataset: