HiNER: A Large Hindi Named Entity Recognition Dataset
This dataset releases a significantly sized standard-abiding Hindi NER dataset containing 109,146 sentences and 2,220,856 tokens, annotated with 3 collapsed tags (PER, LOC, ORG).
Variants: HiNER-collapsed
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Named Entity Recognition (NER) | cfilt/HiNER-collapsed-xlm-roberta-large | HiNER: A Large Hindi Named … | 2022-04-28 |
Named Entity Recognition (NER) | cfilt/HiNER-collapsed-muril-base-cased | HiNER: A Large Hindi Named … | 2022-04-28 |
Recent papers with results on this dataset: