CAIS

Name: CAIS
Published: 2019-09-16
License: Unknown

Chinese Artificial Intelligence Speakers

Dataset Information

Modalities

Texts

Languages

Chinese

Introduced

2019

License

Unknown

Homepage

Official Website

Contents

Overview
Associated Benchmarks
Recent Benchmark Submissions
Research Papers

Overview

We collect utterances from the Chinese Artificial Intelligence Speakers (CAIS), and annotate them with slot tags and intent labels. The training, validation and test sets are split by the distribution of intents, where detailed statistics are provided in the supplementary material. Since the utterances are collected from speaker systems in the real world, intent labels are partial to the PlayMusic option. We adopt the BIOES tagging scheme for slots instead of the BIO2 used in the ATIS, since previous studies have highlighted meaningful improvements with this scheme (Ratinov and Roth, 2009) in the sequence labeling field

Variants: CAIS

Associated Benchmarks

This dataset is used in 2 benchmarks:

Slot Filling - Metrics: F1
Intent Detection - Metrics: Acc

Recent Benchmark Submissions

Task	Model	Paper	Date
Slot Filling	CM-Net	CM-Net: A Novel Collaborative Memory …	2019-09-16
Intent Detection	CM-Net	CM-Net: A Novel Collaborative Memory …	2019-09-16

Research Papers

Recent papers with results on this dataset:

CM-Net: A Novel Collaborative Memory Network for Spoken Language Understanding (2019) -

External Links:

CAIS

Overview edit

Associated Benchmarks

Recent Benchmark Submissions

Research Papers

Edit Dataset Information

Overview