USR-PersonaChat

Name: USR-PersonaChat
Published: 2020-05-01
License: Unknown

Dataset Information

Modalities

Texts

Languages

English

Introduced

2020

License

Unknown

Homepage

Official Website

Contents

Overview
Associated Benchmarks
Recent Benchmark Submissions
Research Papers

Overview

This dataset was collected with the goal of assessing dialog evaluation metrics. In the paper, USR: An Unsupervised and Reference Free Evaluation Metric for Dialog (Mehri and Eskenazi, 2020), the authors collect this data to measure the quality of several existing word-overlap and embedding-based metrics, as well as their newly proposed USR metric.

Variants: USR-PersonaChat

Associated Benchmarks

This dataset is used in 1 benchmark:

Dialogue Evaluation - Metrics: Spearman Correlation, Pearson Correlation

Recent Benchmark Submissions

Task	Model	Paper	Date
Dialogue Evaluation	USR - DR (x = c)	USR: An Unsupervised and Reference …	2020-05-01
Dialogue Evaluation	USR	USR: An Unsupervised and Reference …	2020-05-01
Dialogue Evaluation	USR - MLM	USR: An Unsupervised and Reference …	2020-05-01
Dialogue Evaluation	USR - DR (x = f)	USR: An Unsupervised and Reference …	2020-05-01

Research Papers

Recent papers with results on this dataset:

USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation (2020) -

External Links:

USR-PersonaChat

Overview edit

Associated Benchmarks

Recent Benchmark Submissions

Research Papers

Edit Dataset Information

Overview