TurkCorpus, a dataset with 2,359 original sentences from English Wikipedia, each with 8 manual reference simplifications.
The dataset is divided into two subsets: 2,000 sentences for validation and 359 for testing of sentence simplification models.
Variants: TurkCorpus
This dataset is used in 1 benchmark:
Recent papers with results on this dataset: