HiNER-original

HiNER: A Large Hindi Named Entity Recognition Dataset

Dataset Information
Modalities
Texts
Languages
Hindi
Introduced
2022
License
Homepage

Overview

This dataset releases a significantly sized standard-abiding Hindi NER dataset containing 109,146 sentences and 2,220,856 tokens, annotated with 11 tags.

Variants: HiNER-original

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Named Entity Recognition (NER) cfilt/HiNER-original-xlm-roberta-large HiNER: A Large Hindi Named … 2022-04-28
Named Entity Recognition (NER) cfilt/HiNER-original-muril-base-cased HiNER: A Large Hindi Named … 2022-04-28

Research Papers

Recent papers with results on this dataset: