USR-PersonaChat

Dataset Information
Modalities
Texts
Languages
English
Introduced
2020
License
Unknown
Homepage

Overview

This dataset was collected with the goal of assessing dialog evaluation metrics. In the paper, USR: An Unsupervised and Reference Free Evaluation Metric for Dialog (Mehri and Eskenazi, 2020), the authors collect this data to measure the quality of several existing word-overlap and embedding-based metrics, as well as their newly proposed USR metric.

Variants: USR-PersonaChat

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Dialogue Evaluation USR - DR (x = c) USR: An Unsupervised and Reference … 2020-05-01
Dialogue Evaluation USR USR: An Unsupervised and Reference … 2020-05-01
Dialogue Evaluation USR - MLM USR: An Unsupervised and Reference … 2020-05-01
Dialogue Evaluation USR - DR (x = f) USR: An Unsupervised and Reference … 2020-05-01

Research Papers

Recent papers with results on this dataset: