TURL

Name: TURL
License: Unknown

Twitter News URL Corpus

Dataset Information

Modalities

Texts

License

Unknown

Homepage

Official Website

Contents

Overview
Associated Benchmarks
Recent Benchmark Submissions
Research Papers

Overview

Twitter News URL Corpus is a human-labeled paraphrase corpus to date of 51,524 sentence pairs and the first cross-domain benchmarking for automatic paraphrase identification.

Source: A Continuously Growing Dataset of Sentential Paraphrases
Image Source: https://arxiv.org/pdf/1708.00391v1.pdf

Variants: TURL

Associated Benchmarks

This dataset is used in 1 benchmark:

Paraphrase Identification - Metrics: AP

Recent Benchmark Submissions

Task	Model	Paper	Date
Paraphrase Identification	TSDAE	TSDAE: Using Transformer-based Sequential Denoising …	2021-04-14

Research Papers

Recent papers with results on this dataset:

TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding Learning (2021) -

External Links:

TURL

Overview edit

Associated Benchmarks

Recent Benchmark Submissions

Research Papers

Edit Dataset Information

Overview