TextComplexityDE

Name: TextComplexityDE
Published: 2019-04-16
License: Unknown

Dataset Information

Modalities

Texts

Languages

German

Introduced

2019

License

Unknown

Homepage

Official Website

Contents

Overview
Associated Benchmarks
Recent Benchmark Submissions
Research Papers

Overview

TextComplexityDE is a dataset consisting of 1000 sentences in German language taken from 23 Wikipedia articles in 3 different article-genres to be used for developing text-complexity predictor models and automatic text simplification in German language. The dataset includes subjective assessment of different text-complexity aspects provided by German learners in level A and B. In addition, it contains manual simplification of 250 of those sentences provided by native speakers and subjective assessment of the simplified sentences by participants from the target group. The subjective ratings were collected using both laboratory studies and crowdsourcing approach.

Source: Subjective Assessment of Text Complexity: A Dataset for German Language

Variants: TextComplexityDE

Associated Benchmarks

This dataset is used in 1 benchmark:

Text Complexity Assessment (GermEval 2022) - Metrics: RMSE

Recent Benchmark Submissions

Task	Model	Paper	Date
Text Complexity Assessment (GermEval 2022)	Transformer Ensemble	Automatic Readability Assessment of German …	2022-09-09

Research Papers

Recent papers with results on this dataset:

Automatic Readability Assessment of German Sentences with Transformer Ensembles (2022) -

External Links:

TextComplexityDE

Overview edit

Associated Benchmarks

Recent Benchmark Submissions

Research Papers

Edit Dataset Information

Overview