Spoken Hate in the Albanian Jargon
This is an abusive/offensive language detection dataset for Albanian. The data is formatted following the OffensEval convention. Data is from Instagram and YouTube comments.
Variants: SHAJ
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Hate Speech Detection | Baseline BERT (task A) | Detecting Abusive Albanian | 2021-07-28 |
Recent papers with results on this dataset: