MultiNLI

Name: MultiNLI
Published: 2018-01-01
License: Custom (multiple, see the paper)

Multi-Genre Natural Language Inference

Dataset Information

Modalities

Texts

Languages

Vietnamese

Introduced

2018

License

Custom (multiple, see the paper)

Homepage

Official Website

Contents

Overview
Associated Benchmarks
Recent Benchmark Submissions
Research Papers

Overview

The Multi-Genre Natural Language Inference (MultiNLI) dataset has 433K sentence pairs. Its size and mode of collection are modeled closely like SNLI. MultiNLI offers ten distinct genres (Face-to-face, Telephone, 9/11, Travel, Letters, Oxford University Press, Slate, Verbatim, Goverment and Fiction) of written and spoken English data. There are matched dev/test sets which are derived from the same sources as those in the training set, and mismatched sets which do not closely resemble any seen at training time.

Source: Semantic Sentence Matching with Densely-connectedRecurrent and Co-attentive Information

Variants: MultiNLI MisMatched dev, MultiNLI Matched dev, MultiNLI Dev MisMatched, MultiNLI Dev Matched, MNLI-mm, mnli_mismatched, MNLI-m, MNLI, MultiNLI-mismatched, MultiNLI-matched, multi_nli, MultiNLI Dev, MultiNLI

Associated Benchmarks

This dataset is used in 1 benchmark:

Natural Language Inference - Metrics: Matched, Mismatched, Accuracy, Dev Matched, Dev Mismatched

Recent Benchmark Submissions

Task	Model	Paper	Date
Natural Language Inference	UnitedSynT5 (3B)	First Train to Generate, then …	2024-12-12
Natural Language Inference	UnitedSynT5 (335M)	First Train to Generate, then …	2024-12-12
Natural Language Inference	GPST(unsupervised generative syntactic LM)	Generative Pretrained Structured Transformers: Unsupervised …	2024-03-13
Natural Language Inference	ELC-BERT-small 24M	Not all layers are equally …	2023-11-03
Natural Language Inference	LTG-BERT-base 98M	Not all layers are equally …	2023-11-03
Natural Language Inference	ELC-BERT-base 98M (zero init)	Not all layers are equally …	2023-11-03
Natural Language Inference	LTG-BERT-small 24M	Not all layers are equally …	2023-11-03
Natural Language Inference	LM-CPPF RoBERTa-base	LM-CPPF: Paraphrasing-Guided Data Augmentation for …	2023-05-29
Natural Language Inference	LaMini-F-T5 783M	LaMini-LM: A Diverse Herd of …	2023-04-27
Natural Language Inference	LaMini-GPT 1.5B	LaMini-LM: A Diverse Herd of …	2023-04-27
Natural Language Inference	LaMini-T5 738M	LaMini-LM: A Diverse Herd of …	2023-04-27
Natural Language Inference	T5-Large 738M	LaMini-LM: A Diverse Herd of …	2023-04-27
Natural Language Inference	GPT-2-XL 1.5B	LaMini-LM: A Diverse Herd of …	2023-04-27
Natural Language Inference	RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)	LLM.int8(): 8-bit Matrix Multiplication for …	2022-08-15
Natural Language Inference	ASA + BERT-base	Adversarial Self-Attention for Language Understanding	2022-06-25
Natural Language Inference	ASA + RoBERTa	Adversarial Self-Attention for Language Understanding	2022-06-25
Natural Language Inference	Charformer-Tall	Charformer: Fast Character Transformers via …	2021-06-23
Natural Language Inference	gMLP-large	Pay Attention to MLPs	2021-05-17
Natural Language Inference	FNet-Large	FNet: Mixing Tokens with Fourier …	2021-05-09
Natural Language Inference	BERT-Large	FNet: Mixing Tokens with Fourier …	2021-05-09

Research Papers

Recent papers with results on this dataset:

External Links:

MultiNLI

Overview edit

Associated Benchmarks

Recent Benchmark Submissions

Research Papers

Edit Dataset Information

Overview