HiNER-collapsed

HiNER: A Large Hindi Named Entity Recognition Dataset

Dataset Information
Modalities
Texts
Languages
Hindi
Introduced
2022
License
Homepage

Overview

This dataset releases a significantly sized standard-abiding Hindi NER dataset containing 109,146 sentences and 2,220,856 tokens, annotated with 3 collapsed tags (PER, LOC, ORG).

Variants: HiNER-collapsed

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Named Entity Recognition (NER) cfilt/HiNER-collapsed-xlm-roberta-large HiNER: A Large Hindi Named … 2022-04-28
Named Entity Recognition (NER) cfilt/HiNER-collapsed-muril-base-cased HiNER: A Large Hindi Named … 2022-04-28

Research Papers

Recent papers with results on this dataset: