20NewsGroups

Dataset Information
License
Unknown
Homepage

Overview

The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across 20 different newsgroups.

Variants: 20NewsGroups

Associated Benchmarks

This dataset is used in 2 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Topic Models vONTSS vONTSS: vMF based semi-supervised neural … 2023-07-03
Topic Models NSTM Neural Topic Model via Optimal … 2020-08-12
Topic Models ETM Topic Modeling in Embedding Spaces 2019-07-08
Topic Models vNVDM Spherical Latent Spaces for Stable … 2018-08-31
Intrusion Detection intrusion detection A Neural Network Architecture Combining … 2017-09-10
Topic Models GSM Discovering Discrete Latent Topics with … 2017-06-01
Topic Models ProdLDA Autoencoding Variational Inference For Topic … 2017-03-04

Research Papers

Recent papers with results on this dataset: