CMeIE

Chinese Medical Information Extraction Dataset

Dataset Information
Modalities
Texts, Medical
Languages
Chinese
Introduced
2021
License
@book{2020CMeIE, title={CMeIE: Construction and Evaluation of Chinese Medical Information Extraction Dataset}, author={ Guan, T. and Zan, H. and Zhou, X. and Xu, H. and K Zhang}, publisher={Natural Language Processing and Chinese Computing, 9th CCF International Conference, NLPCC 2020, Zhengzhou, China, October 14–18, 2020, Proceedings, Part I}, year={2020}, }
Homepage

Overview

Chinese Medical Information Extraction, a dataset that is also released in CHIP2020, is used for CMeIE task. The task is aimed at identifying both entities and relations in a sentence following the schema constraints. There are 53 relations defined in the dataset, including 10 synonymous sub-relationships and 43 other sub-relationships.

Variants: CMeIE

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Medical Relation Extraction RoBERTa-wwm-ext-large CBLUE: A Chinese Biomedical Language … 2021-06-15

Research Papers

Recent papers with results on this dataset: