Multi-Genre Natural Language Inference
The Multi-Genre Natural Language Inference (MultiNLI) dataset has 433K sentence pairs. Its size and mode of collection are modeled closely like SNLI. MultiNLI offers ten distinct genres (Face-to-face, Telephone, 9/11, Travel, Letters, Oxford University Press, Slate, Verbatim, Goverment and Fiction) of written and spoken English data. There are matched dev/test sets which are derived from the same sources as those in the training set, and mismatched sets which do not closely resemble any seen at training time.
Source: Semantic Sentence Matching with Densely-connectedRecurrent and Co-attentive Information
Variants: MultiNLI MisMatched dev, MultiNLI Matched dev, MultiNLI Dev MisMatched, MultiNLI Dev Matched, MNLI-mm, mnli_mismatched, MNLI-m, MNLI, MultiNLI-mismatched, MultiNLI-matched, multi_nli, MultiNLI Dev, MultiNLI
This dataset is used in 1 benchmark:
Recent papers with results on this dataset: