PIT

Paraphrase and Semantic Similarity in Twitter

Dataset Information
Modalities
Texts
Languages
English
License
Unknown
Homepage

Overview

Paraphrase and Semantic Similarity in Twitter (PIT) presents a constructed Twitter Paraphrase Corpus that contains 18,762 sentence pairs.

Source: SemEval-2015 Task 1: Paraphrase and Semantic Similarity in Twitter (PIT)

Variants: PIT

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Paraphrase Identification TSDAE TSDAE: Using Transformer-based Sequential Denoising … 2021-04-14

Research Papers

Recent papers with results on this dataset: