KLUE

Name: KLUE
Published: 2021-05-20
License: Unknown

Korean Language Understanding Evaluation

Dataset Information

Modalities

Texts

Languages

Korean

Introduced

2021

License

Unknown

Homepage

Official Website

Contents

Overview
Associated Benchmarks
Recent Benchmark Submissions
Research Papers

Overview

Korean Language Understanding Evaluation (KLUE) benchmark is a series of datasets to evaluate natural language understanding capability of Korean language models. KLUE consists of 8 diverse and representative tasks, which are accessible to anyone without any restrictions. With ethical considerations in mind, we deliberately design annotation guidelines to obtain unambiguous annotations for all datasets. Furthermore, we build an evaluation system and carefully choose evaluations metrics for every task, thus establishing fair comparison across Korean language models.

KLUE benchmark is composed of 8 tasks:

Topic Classification (TC)
Sentence Textual Similarity (STS)
Natural Language Inference (NLI)
Named Entity Recognition (NER)
Relation Extraction (RE)
(Part-Of-Speech) + Dependency Parsing (DP)
Machine Reading Comprehension (MRC)
Dialogue State Tracking (DST)

Variants: KLUE

Associated Benchmarks

This dataset is used in 1 benchmark:

Text Classification - Metrics: Accuracy, F1, Pearsonr

Recent Benchmark Submissions

No recent benchmark submissions available for this dataset.

Research Papers

No papers with results on this dataset found.

External Links:

KLUE

Overview edit

Associated Benchmarks

Recent Benchmark Submissions

Research Papers

Edit Dataset Information

Overview