ViNLI

Name: ViNLI
Published: 2022-10-01
License: Unknown

Vietnamese Natural Language Inference Dataset

Dataset Information

Languages

Vietnamese

Introduced

2022

License

Unknown

Homepage

Official Website

Contents

Overview
Associated Benchmarks
Recent Benchmark Submissions
Research Papers

Overview

A large-scale and high-quality corpus is necessary for studies on NLI for Vietnamese, which can be considered a low-resource language. In this paper, we introduce ViNLI (Vietnamese Natural Language Inference), an open-domain and high-quality corpus for evaluating Vietnamese NLI models, which is created and evaluated with a strict process of quality control. ViNLI comprises over 30,000 human-annotated premise-hypothesis sentence pairs extracted from more than 800 online news articles on 13 distinct topics.

Variants: ViNLI

Associated Benchmarks

This dataset is used in 1 benchmark:

Vietnamese Natural Language Inference - Metrics: 3-class test accuracy, 4-class test accuracy

Recent Benchmark Submissions

Task	Model	Paper	Date
Vietnamese Natural Language Inference	CafeBERT	VLUE: A New Benchmark and …	2024-03-23

Research Papers

Recent papers with results on this dataset:

VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding (2024) -

External Links:

ViNLI

Overview edit

Associated Benchmarks

Recent Benchmark Submissions

Research Papers

Edit Dataset Information

Overview