Chinese Polyphones with Pinyin
A benchmark dataset that consists of 99,000+ sentences for Chinese polyphone disambiguation.
Variants: CPP
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Polyphone disambiguation | g2pW | g2pW: A Conditional Weighted Softmax … | 2022-03-20 |
Polyphone disambiguation | g2pM (BERT) | g2pM: A Neural Grapheme-to-Phoneme Conversion … | 2020-04-07 |
Polyphone disambiguation | g2pM (BiLSTM) | g2pM: A Neural Grapheme-to-Phoneme Conversion … | 2020-04-07 |
Recent papers with results on this dataset: