Decagon

Bio-decagon

Dataset Information
Modalities
Graphs
Introduced
2018
License
Unknown
Homepage

Overview

Bio-decagon is a dataset for polypharmacy side effect identification problem framed as a multirelational link prediction problem in a two-layer multimodal graph/network of two node types: drugs and proteins. Protein-protein interaction
network describes relationships between proteins. Drug-drug interaction network contains 964 different types of edges (one for each side effect type) and describes which drug pairs lead to which side effects. Lastly,
drug-protein links describe the proteins targeted by a given drug.

The final network after linking entity vocabularies used by different databases has 645 drug and 19,085 protein nodes connected by 715,612 protein-protein, 4,651,131 drug-drug, and 18,596 drug-protein edges.

Source: Modeling polypharmacy side effects with graph convolutional networks
Image Source: Modeling polypharmacy side effects with graph convolutional networks

Variants: Decagon

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Link Prediction TIP Tri-graph Information Propagation for Polypharmacy … 2020-01-28
Link Prediction Decagon Modeling polypharmacy side effects with … 2018-02-02

Research Papers

Recent papers with results on this dataset: