HiNER-collapsed

Name: HiNER-collapsed
Published: 2022-04-28
License: CC-BY-SA 4.0

HiNER: A Large Hindi Named Entity Recognition Dataset

Dataset Information

Modalities

Texts

Languages

Hindi

Introduced

2022

License

CC-BY-SA 4.0

Homepage

Official Website

Contents

Overview
Associated Benchmarks
Recent Benchmark Submissions
Research Papers

Overview

This dataset releases a significantly sized standard-abiding Hindi NER dataset containing 109,146 sentences and 2,220,856 tokens, annotated with 3 collapsed tags (PER, LOC, ORG).

Variants: HiNER-collapsed

Associated Benchmarks

This dataset is used in 1 benchmark:

Named Entity Recognition (NER) - Metrics: F1-score (Weighted)

Recent Benchmark Submissions

Task	Model	Paper	Date
Named Entity Recognition (NER)	cfilt/HiNER-collapsed-xlm-roberta-large	HiNER: A Large Hindi Named …	2022-04-28
Named Entity Recognition (NER)	cfilt/HiNER-collapsed-muril-base-cased	HiNER: A Large Hindi Named …	2022-04-28

Research Papers

Recent papers with results on this dataset:

HiNER: A Large Hindi Named Entity Recognition Dataset (2022) -

External Links:

HiNER-collapsed

Overview edit

Associated Benchmarks

Recent Benchmark Submissions

Research Papers

Edit Dataset Information

Overview