MultiNLI

Multi-Genre Natural Language Inference

Dataset Information
Modalities
Texts
Languages
Vietnamese
Introduced
2018
Homepage

Overview

The Multi-Genre Natural Language Inference (MultiNLI) dataset has 433K sentence pairs. Its size and mode of collection are modeled closely like SNLI. MultiNLI offers ten distinct genres (Face-to-face, Telephone, 9/11, Travel, Letters, Oxford University Press, Slate, Verbatim, Goverment and Fiction) of written and spoken English data. There are matched dev/test sets which are derived from the same sources as those in the training set, and mismatched sets which do not closely resemble any seen at training time.

Source: Semantic Sentence Matching with Densely-connectedRecurrent and Co-attentive Information

Variants: MultiNLI MisMatched dev, MultiNLI Matched dev, MultiNLI Dev MisMatched, MultiNLI Dev Matched, MNLI-mm, mnli_mismatched, MNLI-m, MNLI, MultiNLI-mismatched, MultiNLI-matched, multi_nli, MultiNLI Dev, MultiNLI

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Natural Language Inference UnitedSynT5 (3B) First Train to Generate, then … 2024-12-12
Natural Language Inference UnitedSynT5 (335M) First Train to Generate, then … 2024-12-12
Natural Language Inference GPST(unsupervised generative syntactic LM) Generative Pretrained Structured Transformers: Unsupervised … 2024-03-13
Natural Language Inference ELC-BERT-small 24M Not all layers are equally … 2023-11-03
Natural Language Inference LTG-BERT-base 98M Not all layers are equally … 2023-11-03
Natural Language Inference ELC-BERT-base 98M (zero init) Not all layers are equally … 2023-11-03
Natural Language Inference LTG-BERT-small 24M Not all layers are equally … 2023-11-03
Natural Language Inference LM-CPPF RoBERTa-base LM-CPPF: Paraphrasing-Guided Data Augmentation for … 2023-05-29
Natural Language Inference LaMini-F-T5 783M LaMini-LM: A Diverse Herd of … 2023-04-27
Natural Language Inference LaMini-GPT 1.5B LaMini-LM: A Diverse Herd of … 2023-04-27
Natural Language Inference LaMini-T5 738M LaMini-LM: A Diverse Herd of … 2023-04-27
Natural Language Inference T5-Large 738M LaMini-LM: A Diverse Herd of … 2023-04-27
Natural Language Inference GPT-2-XL 1.5B LaMini-LM: A Diverse Herd of … 2023-04-27
Natural Language Inference RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned) LLM.int8(): 8-bit Matrix Multiplication for … 2022-08-15
Natural Language Inference ASA + BERT-base Adversarial Self-Attention for Language Understanding 2022-06-25
Natural Language Inference ASA + RoBERTa Adversarial Self-Attention for Language Understanding 2022-06-25
Natural Language Inference Charformer-Tall Charformer: Fast Character Transformers via … 2021-06-23
Natural Language Inference gMLP-large Pay Attention to MLPs 2021-05-17
Natural Language Inference FNet-Large FNet: Mixing Tokens with Fourier … 2021-05-09
Natural Language Inference BERT-Large FNet: Mixing Tokens with Fourier … 2021-05-09

Research Papers

Recent papers with results on this dataset: