DBLP

Citation Network Dataset

Dataset Information
Modalities
Graphs
Introduced
2008
License
Unknown
Homepage

Overview

The DBLP is a citation network dataset. The citation data is extracted from DBLP, ACM, MAG (Microsoft Academic Graph), and other sources. The first version contains 629,814 papers and 632,752 citations. Each paper is associated with abstract, authors, year, venue, and title.
The data set can be used for clustering with network and side information, studying influence in the citation network, finding the most influential papers, topic modeling analysis, etc.

Source: https://www.aminer.org/citation

Variants: DBLP (PACT) 14k, DBLP

Associated Benchmarks

This dataset is used in 3 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Node Classification FIT-GNN FIT-GNN: Faster Inference Time for … 2024-10-19
Node Classification RR-GCN-PPV R-GCN: The R Could Stand … 2022-03-04
Node Classification R-GCN R-GCN: The R Could Stand … 2022-03-04
Node Classification PairE Graph Representation Learning Beyond Node … 2022-03-03
Node Classification GRACE Deep Graph Contrastive Representation Learning 2020-06-07
Node Classification DAOR Bridging the Gap between Community … 2019-12-17
Link Prediction GLACE Gaussian Embedding of Large-scale Attributed … 2019-12-02
Link Prediction HSRL (DW) Learning Topological Representation for Networks … 2019-02-15
Link Prediction Event2vec Representation Learning for Heterogeneous Information … 2019-01-29
Community Detection CommunityGAN CommunityGAN: Community Detection with Generative … 2019-01-20

Research Papers

Recent papers with results on this dataset: