TURL

Twitter News URL Corpus

Dataset Information
Modalities
Texts
License
Unknown
Homepage

Overview

Twitter News URL Corpus is a human-labeled paraphrase corpus to date of 51,524 sentence pairs and the first cross-domain benchmarking for automatic paraphrase identification.

Source: A Continuously Growing Dataset of Sentential Paraphrases
Image Source: https://arxiv.org/pdf/1708.00391v1.pdf

Variants: TURL

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Paraphrase Identification TSDAE TSDAE: Using Transformer-based Sequential Denoising … 2021-04-14

Research Papers

Recent papers with results on this dataset: