BC4CHEMD

BioCreative IV Chemical compound and drug name recognition

Dataset Information
Modalities
Texts
Languages
English
License
Unknown
Homepage

Overview

Introduced by Krallinger et al. in The CHEMDNER corpus of chemicals and drugs and its annotation principles

BC4CHEMD is a collection of 10,000 PubMed abstracts that contain a total of 84,355 chemical entity mentions labeled manually by expert chemistry literature curators.

Variants: BC4CHEMD

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Named Entity Recognition (NER) UniNER-7B UniversalNER: Targeted Distillation from Large … 2023-08-07
Named Entity Recognition (NER) BERN2 BERN2: an advanced neural biomedical … 2022-01-06
Named Entity Recognition (NER) BLSTM-CNN-Char (SparkNLP) Biomedical Named Entity Recognition at … 2020-11-12

Research Papers

Recent papers with results on this dataset: