BN-AuthProf

Bangla Author Profiling Dataset

Dataset Information
Modalities
Texts
Languages
Bengali
Introduced
2024
License
MIT
Homepage

Overview

Although research on author profiling has quite progressed in abundant resources languages, it is still infancy for limited resources languages such as Bengali. This repository contains our Bangla Author Profiling Dataset (BN-AuthProf). The primary objective is to introduce and benchmark the performance of machine learning approaches on Age and Gender Classification tasks from the social media status of people.

BN-AuthProf dataset consists of 300 anonymized authors and 30,131 manually curated Facebook posts in Bangla, tagged with age and gender information.

Variants: BN-AuthProf

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Age And Gender Classification Multinomial Naive Bayes (MNB) BN-AuthProf: Benchmarking Machine Learning for … 2024-12-03

Research Papers

Recent papers with results on this dataset: