Awesome Graph Self-Supervised Learning

August 15, 2024 · View on GitHub

PRs WelcomeAwesomeGitHub stars GitHub forks

A curated list for awesome self-supervised graph representation learning resources. Inspired by awesome-deep-vision, awesome-adversarial-machine-learning, awesome-deep-learning-papers, awesome-architecture-search, and awesome-self-supervised-learning.

Why Self-Supervised?

Self-Supervised Learning has become an exciting direction in AI community.

  • Jitendra Malik: "Supervision is the opium of the AI researcher"
  • Alyosha Efros: "The AI revolution will not be supervised"
  • Yann LeCun: "self-supervised learning is the cake, supervised learning is the icing on the cake, reinforcement learning is the cherry on the cake"

Table of Contents

Overview

We extend the concept of self-supervised learning, which first emerged in the fields of computer vision and natural language processing, to present a timely and comprehensive review of the existing SSL techniques for graph data. Specifically, we divide existing graph SSL methods into three categories: contrastive, generative, and predictive as shown below.

  • Contrastive Learning: it contrasts the views generated by different data augmentation methods. The information about the differences and sameness between data-data pairs (inter-data) is used as self-supervision signals.
  • Generative Learning: it focuses on the (intra-data) information embedded in the data, generally based on prtext tasks such as reconstruction, which exploit the attributes and structure of the data itself as self-supervision signals.
  • Predictive Learning: it generally self-generates labels from graph data through some simple statistical analysis, or expert knowledge, and designs prediction-based pretext tasks based on the self-generated labels to handle the data-label relationship.

Training Strategy

Considering the relationship among bottleneck encoders, self-supervised pretext tasks, and downstream tasks, the training strategies can be divided into three categories: Pre-training and Fine-tuning (P&F), Joint Learning (JL), and Unsupervised Representation Learning (URL), with their detailed workflow shown below.

  • Pre-train&Fine-tune (P&F): it first pre-trains the encoder with unlabeled nodes by the self-supervised pretext tasks. The pre-trained encoder’s parameters are then used as the initialization of the encoder used in supervised fine-tuning for downstream tasks.
  • Joint Learning (JL): an auxiliary pretext task with self-supervision is included to help learn the supervised downstream task. The encoder is trained through both the pretext task and the downstream task simultaneously.
  • Unsupervised Representation Learning (URL): it first pre-trains the encoder with unlabeled nodes by the self-supervised pretext tasks. The pre-trained encoder’s parameters are then frozen and used in the supervised downstream task with additional labels.

Contrastive Learning

A general framework for contrastive learning is shown below. The two contrasting components may be local, contextual, or global, corresponding to node-level (marked in red), subgraph-level (marked in green), or graph-level (marked in yellow) information in the graph. The contrastive learning can thus contrast two views (at the same or different scales), which leads to two categories of algorithm: (1) same-scale contrasting, including Local-Local (L-L) contrasting, Context-Context (C-C) contrasting, and Global-Global (G-G) contrasting; and (2) cross-scale contrasting, including Local-Context (L-C) contrasting, Local-Global (L-G) contrasting, and Context-Global (C-G) contrasting.

Global-Global Contrasting

  • GraphCL: Graph Contrastive Learning with Augmentations.
    • Y. You, T. Chen, Y. Sui, T. Chen, Z. Wang, and Y. Shen. NIPS 2020. [pdf] [code]
  • IGSD: Iterative Graph Self-Distillation.
    • H. Zhang, S. Lin, W. Liu, P. Zhou, J. Tang, X. Arxiv 2020. [pdf]
  • DACL: Towards Domain-Agnostic Contrastive Learning.
    • V. Verma, M.-T. Luong, K. Kawaguchi, H. Pham, andQ. V. Le. Arxiv 2020. [pdf]
  • LCC: Label Contrastive Coding Based Graph Neural Network for Graph Classification.
    • Y. Ren, J. Bai, and J. Zhang. Arxiv 2021. [pdf] [code]
  • CCGL: Contrastive Cascade Graph Learning.
    • X. Xu, F. Zhou, K. Zhang, and S. Liu. TKDE 2022. [pdf] [code]
  • CSSL: Contrastive Self-Supervised Learning for Graph Classification.
    • J. Zeng and P. Xie. Arxiv 2020. [pdf]

Context-Context Contrasting

  • GCC: Graph Contrastive Coding for Graph Neural Network Pre-training.
    • J. Qiu, Q. Chen, Y. Dong, J. Zhang, H. Yang, M. Ding, K. Wang, and J. Tang. KDD 2020. [pdf] [code]

Local-Local Contrasting

  • CDNMF: Contrastive Deep Nonnegative Matrix Factorization for Community Detection.
    • Y. Li, J. Chen, C. Chen, L. Yang, Z. Zheng. ICASSP 2024. [pdf] [code]
  • GRACE: Deep Graph Contrastive Representation Learning.
    • Y. Zhu, Y. Xu, F. Yu, Q. Liu, S. Wu, and L. Wang. Arxiv 2020. [pdf] [code]
  • GCA: Graph Contrastive Learning with Adaptive Augmentation.
    • Y. Zhu, Y. Xu, F. Yu, Q. Liu, S. Wu, and L. Wang. Arxiv 2020. [pdf] [code]
  • GROC: Towards Robust Graph Contrastive Learning.
    • N. Jovanovi´c, Z. Meng, L. Faber, and R. Wattenhofer. Arxiv 2021. [pdf]
  • SEPT: Socially-Aware Self-Supervised Tri-Training for Recommendation.
    • J. Yu, H. Yin, M. Gao, X. Xia, X. Zhang, and N. Q. V.Hung. Arxiv 2021. [pdf] [code]
  • STDGI: Spatio-Temporal Deep Graph Infomax.
    • F. L. Opolka, A. Solomon, C. Cangea, P. Veliˇckovi´c, P. Li` o, and R. D. Hjelm. Arxiv 2019. [pdf]
  • GMI: Graph Representation Learning via Graphical Mutual Information Maximization.
    • L. Yu, S. Pei, C. Zhang, L. Ding, J. Zhou, L. Li, and X. Zhang. WWW 2020. [pdf] [code]
  • KS2L: Self-Supervised Smoothing Graph Neural Networks.
    • L. Yu, S. Pei, C. Zhang, L. Ding, J. Zhou, L. Li, and X. Zhang. Arxiv 2020. [pdf]
  • CG3: Contrastive and Generative Graph Convolutional Networks for Graph-based Semi-Supervised Learning.
    • S. Wan, S. Pan, J. Yang, and C. Gong. Arxiv 2020. [pdf]
  • BGRL: Bootstrapped Representation Learning on Graphs.
    • S. Thakoor, C. Tallec, M. G. Azar, R. Munos, P. Veliˇckovi´c, and M. Valko. Arxiv 2021. [pdf][code]
  • SelfGNN: Self-supervised Graph Neural Networks without Explicit Negative Sampling.
    • Z. T. Kefato and S. Girdzijauskas. Arxiv 2021. [pdf] [code]
  • HeCo: Self-supervised Heterogeneous Graph Neural Network with Co-contrastive Learning.
    • X. Wang, N. Liu, H. Han, and C. Shi. Arxiv 2021. [pdf] [code]
  • PT-DGNN: Pre-training on Dynamic Graph Neural Networks.
    • J. Zhang, K. Chen, and Y. Wang. Arxiv 2021. [pdf] [code]
  • COAD: Coad: Contrastive Pretraining with Adversarial Fine-tuning for Zero-shot Expert Linking.
    • B. Chen, J. Zhang, X. Zhang, X. Tang, L. Cai, H. Chen, C. Li, P. Zhang, and J. Tang. Arxiv 2020. [pdf] [code]
  • Contrast-Reg: Improving Graph Representation Learning by Contrastive Regularization.
    • K. Ma, H. Yang, H. Yang, T. Jin, P. Chen, Y. Chen, B. F. Kamhoua, and J. Cheng. Arxiv 2021. [pdf]
  • C-SWM: Contrastive Learning of Structured World Models.
    • T. Kipf, E. van der Pol, and M. Welling. *Arxiv 2019. [pdf] [code]

Local-Global Contrasting

  • DGI: Deep Graph Infomax.
    • P. Velickovic, W. Fedus, W. L. Hamilton, P. Li` o, Y. Bengio, and R. D. Hjelm. ICLR 2019. [pdf] [code]
  • HDMI: Hdmi: High-order Deep Multiplex Infomax.
    • B. Jing, C. Park, and H. Tong. Arxiv 2021. [pdf]
  • DMGI: Unsupervised Attributed Multiplex Network Embedding.
    • C. Park, D. Kim, J. Han, and H. Yu. AAAI 2020. [pdf] [code]
  • MVGRL: Contrastive Multi-View Representation Learning on Graphs.
    • K. Hassani and A. H. K. Ahmadi. ICML 2020. [pdf] [code]
  • HDGI: Heterogeneous Deep Graph Infomax.
    • Y. Ren, B. Liu, C. Huang, P. Dai, L. Bo, and J. Zhang. Arxiv 2019. [pdf] [code]

Local-Context Contrasting

  • CDNMF: Contrastive Deep Nonnegative Matrix Factorization for Community Detection.
    • Y. Li, J. Chen, C. Chen, L. Yang, Z. Zheng. ICASSP 2024. [pdf] [code]
  • Subg-Con: Sub-graph Contrast for Scalable Self-Supervised Graph Representation Learning.
    • Y. Jiao, Y. Xiong, J. Zhang, Y. Zhang, T. Zhang, and Y. Zhu. Arxiv 2020. [pdf] [code]
  • Cotext Prediction: Strategies for Pre-training Graph Neural Networks.
    • W. Hu, B. Liu, J. Gomes, M. Zitnik, P. Liang, V. S. Pande, and J. Leskovec. ICLR 2020. [pdf] [code]
  • GIC: Leveraging Cluster-level Node Information for Unsupervised Graph Representation Learning.
    • C. Mavromatis and G. Karypis. Arxiv 2020. [pdf] [code]
  • GraphLoG: Self-Supervised Graph-level Representation Learning with Local and Global Structure.
    • M. Xu, H. Wang, B. Ni, H. Guo, and J. Tang. OpenReview 2021. [pdf] [code]
  • MHCN: Self-Supervised Multi-channel Hypergraph Convolutional Network for Social Recommendation.
    • J. Yu, H. Yin, J. Li, Q. Wang, N. Q. V. Hung, and X. Zhang. Arxiv 2021. [pdf] [code]
  • EGI: Transfer Learning of Graph Neural Networks with Ego-graph Information Maximization.
    • Q. Zhu, Y. Xu, H.Wang, C. Zhang, J. Han, and C. Yang. Arxiv 2020. [pdf] [code]

Context-Global Contrasting

  • MICRO-Graph: Motif-Driven Contrastive Learning of Graph Representations.
    • S. Zhang, Z. Hu, A. Subramonian, and Y. Sun. Arxiv 2020. [pdf] [code]
  • InfoGraph: Unsupervised and Semi-Supervised Graph-level Representation Learning via Mutual Information Maximization.
    • F. Sun, J. Hoffmann, V. Verma, and J. Tang. ICLR 2020. [pdf] [code]
  • SUGAR: Subgraph Neural Network with Reinforcement Pooling and Self-Supervised Mutual Information Mechanism.
    • Q. Sun, H. Peng, J. Li, J. Wu, Y. Ning, P. S. Yu, and L. He. Arxiv 2021. [pdf] [code]
  • BiGI: Bipartite Graph Embedding via Mutual Information Maximization.
    • J. Cao, X. Lin, S. Guo, L. Liu, T. Liu, and B. Wang. WSDM 2021. [pdf] [code]
  • HTC: Graph Representation Learning by Ensemble Aggregating Subgraphs via Mutual Information Maximization.
    • C. Wang and Z. Liu. Arxiv 2021. [pdf]
  • DITNet: Drug Target Prediction using Graph Representation Learning via Substructures Contrast.
    • S. Cheng, L. Zhang, B. Jin, Q. Zhang, and X. Lu. Preprints 2021. [pdf] [code]

Generative Learning

Graph Autoencoding

  • CDNMF: Contrastive Deep Nonnegative Matrix Factorization for Community Detection.
    • Y. Li, J. Chen, C. Chen, L. Yang, Z. Zheng. ICASSP 2024. [pdf] [code]
  • GraphMAE: Self-supervised Masked Graph Autoencoders
    • Z. Hou, X. Liu, Y. Cen, Y. Dong, H. Yang, C. Wang, and J. Tang. KDD 2022 [pdf] [code]
  • Graph Completion: When Does Self-Supervision Help Graph Convolutional Networks?
    • Y. You, T. Chen, Z. Wang, and Y. Shen. PMLR 2020. [pdf] [code]
  • Node Attribute Masking: Self-Supervised Learning on Graphs: Deep Insights and New Direction.
    • W. Jin, T. Derr, H. Liu, Y. Wang, S. Wang, Z. Liu, and J. Tang. Arxiv 2020. [pdf] [code]
  • Edge Attribute Masking: Strategies for Pre-training Graph Neural Networks.
    • W. Hu, B. Liu, J. Gomes, M. Zitnik, P. Liang, V. S. Pande, and J. Leskovec. ICLR 2020. [pdf] [code]
  • Node Attribute and Embedding Denoising: Graph-based Neural Network Models with Multiple Self-Supervised Auxiliary Tasks.
    • F. Manessi and A. Rozza. Arxiv 2020. [pdf]
  • Adjacency Matrix Reconstruction: Self-Supervised Training of Graph Convolutional Networks.
    • Q. Zhu, B. Du, and P. Yan. Arxiv 2020. [pdf]
  • Graph Bert: Only Attention is Needed for Learning Graph Representations.
    • J. Zhang, H. Zhang, C. Xia, and L. Sun. Arxiv 2020. [pdf] [code]
  • Pretrain-Recsys: Pretraining Graph Neural Networks for Cold-start Users and Items Representation.
    • B. Hao, J. Zhang, H. Yin, C. Li, and H. Chen. WSDM 2021. [pdf] [code]
  • SLAPS: Self-Supervision Improves Structure Learning for Graph Neural Networks.
    • B. Fatemi, L. E. Asri, and S. M. Kazemi. Arxiv 2021. [pdf] [code]
  • G-BERT: Pre-Training of Graph Augmented Transformers for Medication Recommendation.
    • J. Shang, T. Ma, C. Xiao, and J. Sun. Arxiv 2019. [pdf] [code]

Graph Autoregression

  • GPT-GNN: Generative Pre-training of Graph Neural Networks.
    • Z. Hu, Y. Dong, K. Wang, K. Chang, and Y. Sun. KDD 2020. [pdf] [code]

Predictive Learning

A comparison of the predictive learning is shown below. The predictive method generally self-generates labels from graph data and then designs prediction-based pretext tasks based on the self-generated labels. Categorized by how the labels areobtained, we summarize predictive learning methods forgraph data into four categories:

  • Node Property Prediction: it pre-calculates the node properties, such as node degree and used them as self-supervised labels.
  • Context-based Prediction: the local or global contextual information in the graph, such as the shortest path length between nodes can be extracted as labels to help with self-supervised learning.
  • Self-Training: it applies algorithms such as unsupervised clustering to obtain pseudo-labels and then updates the pseudo-label set of the previous stage based on the prediction results or losses.
  • Domain Knowledge-based Prediction: the domain knowledge, such as expert knowledge or specialized tools, can be used in advance to obtain informative labels.

Node Property Prediction

  • Node Property Prediction: Self-Supervised Learning on Graphs: Deep Insights and New Direction.
    • W. Jin, T. Derr, H. Liu, Y. Wang, S. Wang, Z. Liu, and J. Tang. Arxiv 2020. [pdf] [code]

Context-based Prediction

  • S2GRL: Self-Supervised Graph Representation Learning via Global Context Prediction.
    • Z. Peng, Y. Dong, M. Luo, X.-M. Wu, and Q. Zheng. Arxiv 2020. [pdf]
  • PairwiseDistance: Self-Supervised Learning on Graphs: Deep Insights and New Direction.
    • W. Jin, T. Derr, H. Liu, Y. Wang, S. Wang, Z. Liu, and J. Tang. Arxiv 2020. [pdf] [code]
  • PairwiseAttsim: Self-Supervised Learning on Graphs: Deep Insights and New Direction.
    • W. Jin, T. Derr, H. Liu, Y. Wang, S. Wang, Z. Liu, and J. Tang. Arxiv 2020. [pdf] [code]
  • Distance2Cluster: Self-Supervised Learning on Graphs: Deep Insights and New Direction.
    • W. Jin, T. Derr, H. Liu, Y. Wang, S. Wang, Z. Liu, and J. Tang. Arxiv 2020. [pdf] [code]
  • EdgeMask: Self-Supervised Learning on Graphs: Deep Insights and New Direction.
    • W. Jin, T. Derr, H. Liu, Y. Wang, S. Wang, Z. Liu, and J. Tang. Arxiv 2020. [pdf] [code]
  • TopoTER: Unsupervised Learning of Topology Transformation Equivariant Representations.
    • X. Gao, W. Hu, and G.-J. Qi. OpenReview 2021. [pdf]
  • Centrality Score Ranking: Pretraining Graph Neural Networks for Generic Structural Feature Extraction.
    • Z. Hu, C. Fan, T. Chen, K.-W. Chang, and Y. Sun. Arxiv 2019. [pdf]
  • Meta-path prediction: Self-supervised Auxiliary Learning with Meta-paths for Heterogeneous Graphs.
    • D. Hwang, J. Park, S. Kwon, K. Kim, J. Ha, and H. J. Kim. NIPS 2020. [pdf] [code]
  • SLiCE: Self-Supervised Learning of Contextual Embeddings for Link Prediction in Heterogeneous Networks.
    • P. Wang, K. Agarwal, C. Ham, S. Choudhury, and C. K. Reddy. Arxiv 2020. [pdf] [code]
  • Distance2Labeled: Self-Supervised Learning on Graphs: Deep Insights and New Direction.
    • W. Jin, T. Derr, H. Liu, Y. Wang, S. Wang, Z. Liu, and J. Tang. Arxiv 2020. [pdf] [code]
  • Distance2Labeled: Self-Supervised Learning on Graphs: Deep Insights and New Direction.
    • W. Jin, T. Derr, H. Liu, Y. Wang, S. Wang, Z. Liu, and J. Tang. Arxiv 2020. [pdf] [code]
  • HTM: Hop-count based Self-Supervised Anomaly Detection on Attributed Networks.
    • T. Huang, Y. Pei, V. Menkovski, and M. Pechenizkiy. Arxiv 2021. [pdf]

Self-Training

  • Multi-stage Self-training: Deeper insights into Graph Convolutional Networks for Semi-Supervised Learning.
  • Node Clustering and Partitioning: When Does Self-Supervision Help Graph Convolutional Networks.
    • Y. You, T. Chen, Z. Wang, and Y. Shen. PMLR 2020. [pdf] [code]
  • CAGAN: Cluster-Aware Graph Neural Networks for Unsupervised Graph Representation Learning.
    • Y. Zhu, Y. Xu, F. Yu, S. Wu, and L. Wang. Arxiv 2020. [pdf]
  • M3S: Multi-stage Self-Supervised Learning for Graph Convolutional Networks on Graphs with Few Labeled Nodes.
  • Cluster Preserving: Pretraining Graph Neural Networks for Generic Structural Feature Extraction.
    • Z. Hu, C. Fan, T. Chen, K.-W. Chang, and Y. Sun. Arxiv 2019. [pdf]
  • SEF: Self-Supervised Edge Features for Improved Graph Neural Network Training.
    • A. Sehanobish, N. G. Ravindra, and D. van Dijk. Arxiv 2020. [pdf][code]

Domain Knowledge-based Prediction

  • Contextual Molecular Property Prediction: Self-Supervised Graph Transformer on Large-Scale Molecular Data.
    • Y. Rong, Y. Bian, T. Xu, W. Xie, Y. Wei, W. Huang, and J. Huang. NIPS 2020. [pdf] [code]
  • Graph-level Motif Prediction: Self-Supervised Graph Transformer on Large-scale Molecular Data.
    • Y. Rong, Y. Bian, T. Xu, W. Xie, Y. Wei, W. Huang, and J. Huang. NIPS 2020. [pdf] [code]
  • DrRepair: Graph-based, Self-Supervised Program Repair from Diagnostic Feedback.

A summary of all the surveyed works is presented below.

A Summary of Methodology Details

About Graph Property, Pretext Task, Data Augmentation, Objective Function, Training Strategy, and Year of publication.

MethodsGraph PropertyPretext-TaskData AugmentationObjective FunctionTraining StrategyYear
CDNMFAttributedContrastive/L-C + Generative/AENoneInfoNCE + AEURL2024
Graph CompletionAttributedGenerative/AEAttribute MaskingMAEP&F/JL2020
Node Attribute MaskingAttributedGenerative/AEAttribute MaskingMAEP&F/JL2020
Edge Attribute MaskingAttributedGenerative/AEAttribute MaskingMAEP&F2019
Node Attribute and
Embedding Denoising
AttributedGenerative/AEAttribute MaskingMAEJL2020
Adjacency Matrix
Reconstruction
AttributedGenerative/AEAttribute Masking
Edge Perturbation
MAEJL2020
Graph BertAttributedGenerative/AEAttribute Masking
Edge Perturbation
MAEP&F2020
Pretrain-RecsysAttributedGenerative/AEEdge PerturbationMAEP&F2021
GPT-GNNHeterogeneousGenerative/ARAttribute Masking
Edge Perturbation
MAE/InfoNCEP&F2020
GraphCLAttributedContrastive/G-GAttribute Masking
Edge Perturbation
Random Walk Sampling
InfoNCEURL2020
IGSDAttributedContrastive/G-GEdge Perturbation
Edge Doffisopm
InfoNCEJL/URL2020
DACLAttributedContrastive/G-GMixupInfoNCEURL2020
LCCAttributedContrastive/G-GNoneInfoNCEJL2021
CCGLAttributedContrastive/G-GInformation Re-DiffusionInfoNCEP&F2021
CSSLAttributedContrastive/G-GNodeInsertion
Edge Perturbation
Uniform Sampling
InfoNCEP&F/JL/URL2020
GCCUnattributedContrastive/C-CRandom Walk SamplingInfoNCEP&F/URL2020
GRACEAttributedContrastive/L-LAttribute Masking
Edge Perturbation
InfoNCEURL2020
GCAAttributedContrastive/L-LAttention-basedInfoNCEURL2020
GROCAttributedContrastive/L-LGradient-basedInfoNCEURL2021
SEPTAttributedContrastive/L-LEdge PerturbationInfoNCEJL2021
STDGISpatial-TemporalContrastive/L-LAttribute ShufflingJS EstimatorURL2019
GMIAttributedContrastive/L-LNoneSP EstimatorURL2020
KS2LAttributedContrastive/L-LNoneInfoNCEURL2020
CG3AttributedContrastive/L-LNoneInfoNCEJL2020
BGRLAttributedContrastive/L-LAttribute Masking
Edge Perturbation
Inner ProductURL2021
SelfGNNAttributedContrastive/L-LAttribute Masking
Edge Diffusion
MSEURL2021
HeCoHeterogeneousContrastive/L-LNoneInfoNCEURL2021
PT-DGNNDynamicContrastive/L-LAttribute Masking
Edge Perturbation
InforNCEP&F2021
COADAttributedContrastive/L-LNoneTriplet Margin LossP&F2020
Contrst-RegAttributedContrastive/L-LAttribute ShufflingInfoNCEJL2021
DGIAttributedContrastive/L-GArbitraryJS EstimatorURL2019
HDMIAttributedContrastive/L-GAttribute ShufflingJS EstimatorURL2021
DMGIHeterogeneousContrastive/L-GAttribute ShufflingJS Estimator/MAEURL2020
MVGRLAttributedContrastive/L-GAttribute Masking
Edge Perturbation
Edge Diffusion
Random Walk Sampling
DV Estimator
JS Estimator
NT-Xent
InfoNCE
URL2020
HDGIHeterogeneousContrastive/L-GAttribute ShufflingJS EstimatorURL2019
Subg-ConAttributedContrastive/L-CImportance SamplingTriplet Margin LossURL2020
Cotext PredictionAttributedContrastive/L-CEgo-nets SamplingCross EntropyP&F2019
GICAttributedContrastive/L-CArbitraryJS EstimatorURL2020
GraphLoGAttributedContrastive/L-CAttribute MaskingInfoNCEURL2021
MHCNHeterogeneousContrastive/L-CAttribute ShufflingInfoNCEJL2021
EGIAttributedContrastive/L-CEgo-nets SamplingSP EstimatorP&F2020
MICRO-GraphAttributedContrastive/C-GKnowledge SamplingInfoNCEURL2020
InfoGraphAttributedContrastive/C-GNoneSP EstimatorURL2019
SUGARAttributedContrastive/C-GBFS SamplingJS EstimatorJL2021
BiGIHeterogeneousContrastive/C-GEdge Perturbation
Ego-nets Sampling
JS EstimatorJL2021
HTCAttributedContrastive/C-GAttribute ShufflingSP Estimator
DV Estimator
URL2021
Node Property PredictionAttributedPredictive/Node PropertyNoneMAEP&F/JL2020
S2GRLAttributedPredictive/Context-basedNoneCross EntropyURL2020
PairwiseDistanceAttributedPredictive/Context-basedNoneCross EntropyP&F/JL2020
PairwiseAttrSimAttributedPredictive/Context-basedNoneMAEP&F/JL2020
Distance2ClusterAttributedPredictive/Context-basedNoneMAEP&F/JL2020
EdgeMaskAttributedPredictive/Context-basedNoneCross EntropyP&F/JL2020
TopoTERAttributedPredictive/Context-basedEdge PerturbationCross EntropyURL2021
Centrality Score RankingAttributedPredictive/Context-basedNoneCross EntropyP&F2019
Meta-path predictionHeterogeneousPredictive/Context-basedNoneCross EntropyJL2020
SLiCEHeterogeneousPredictive/Context-basedNoneCross EntropyP&F2020
Distance2LabeledAttributedPredictive/Context-basedNoneMAEP&F/JL2020
ContextLabelAttributedPredictive/Context-basedNoneMAEP&F/JL2020
HCMAttributedPredictive/Context-basedEdge PerturbationBayesian InferenceURL2021
Contextual Molecular
Property Prediction
AttributedPredictive/Domain-basedNoneCross EntropyP&F2020
Graph-level Motif PredictionAttributedPredictive/Domain-basedNoneCross EntropyP&F2020
Multi-stage Self-trainingAttributedPredictive/Self-trainingNoneNoneJL2018
Node ClusteringAttributedPredictive/Self-trainingNoneClusteringP&F/JL2020
Graph PartitioningAttributedPredictive/Self-trainingNoneGraph PartitioningP&F/JL2020
CAGANAttributedPredictive/Self-trainingNoneClusteringURL2020
M3SAttributedPredictive/Self-trainingNoneClusteringJL2020
Cluster PreservingAttributedPredictive/Self-trainingNoneCross EntropyP&F2019

A Summary of Implementation Details

About Task Level, Evaluation Metric, and Evaluation Datasets.

MethodsTask LevelEvaluation MetricDataset
CDNMFNodeNode Clustering (Acc, NMI)Cora, Citeseer, Pubmed
Graph CompletionNodeNode Classification (Acc)Cora, Citeseer, Pubmed
Node Attribute MaskingNodeNode Classification (Acc)Cora, Citeseer, Pubmed, Reddit
Edge Attribute MaskingGraphGraph Classification (ROC-AUC)MUTAG, PTC, PPI, BBBP, Tox21, ToxCast, ClinTox, MUV, HIV, SIDER, BACE
Node Attribute and
Embedding Denoising
NodeNode Classification (Acc)Cora, Citeseer, Pubmed
Adjacency Matrix
Reconstruction
NodeNode Classification (Acc)Cora, Citeseer, Pubmed
Graph BertNodeNode Classification (Acc)
Node Clustering (NMI)
Cora, Citeseer, Pubmed
Pretrain-RecsysNode/Link-ML-1M, MOOCs and Last-FM
GPT-GNNNode/LinkNode Classification (F1-score)
Link Prediction (ROC-AUC)
OAG, Amazon, Reddit
GraphCLGraphGraph Classification (Acc, ROC-AUC)NCI1, PROTEINS, D&D, COLLAB, RDT-B, RDT-M5K, GITHUB, MNIST, CIFAR10, MUTAG, IMDB-B, BBBP, Tox21, ToxCast, SIDER, ClinTox, MUV, HIV, BACE, PPI
IGSDGraphGraph Classification (Acc)MUTAG, PTC_MR, NCI1, IMDB-B, QM9, COLLAB, IMDB-M
DACLGraphGraph Classification (Acc)MUTAG, PTC_MR, IMDB-B, IMDB-M, RDT-B, RDT-M5K
LCCGraphGraph Classification (Acc)IMDB-B, IMDB-M, COLLAB, MUTAG, PROTEINS, PTC, NCI1, D&D
CCGLGraphCascade Graph Prediction (MSLE)Weibo, Twitter, ACM, APS, DBLP
CSSLGraphGraph Classification (Acc)PROTEINS, D&D, NCI1, NCI109, Mutagenicity
GCCNode/GraphNode Classification (Acc)
Graph Classification (Acc)
US-Airport, H-index, COLLAB, IMDB-B, IMDB-M, RDT-B, RDT-M5K
GRACENodeNode Classification (Acc, Micro-F1)Cora, Citeseer, Pubmed, DBLP, Reddit, PPI
GCANodeNode Classification (Acc)Wiki-CS, Amazon-Computers, Amazon-Photo, Coauthor-CS, Coauthor-Physics
GROCNodeNode Classification (Acc)Cora, Citeseer, Pubmed, Amazon-Photo, Wiki-CS
SEPTNode/Link-Last-FM, Douban, Yelp
STDGINodeNode Regression (MAE, RMSE, MAPE)METR-LA
GMINode/LinkNode Classification (Acc, Micro-F1)
Link Prediction (ROC-AUC)
Cora, Citeseer, PubMed, Reddit, PPI, BlogCatalog, Flickr
KS2LNode/LinkNode Classification (Acc)
Link Prediction (ROC-AUC)
Cora, Citeseer, Pubmed, Amazon-Computers, Amazon-Photo, Coauthor-CS
CG3NodeNode Classification (Acc)Cora, Citeseer, Pubmed, Amazon-Computers, Amazon-Photo, Coauthor-CS
BGRLNodeNode Classification (Acc, Micro-F1)Wiki-CS, Amazon-Computers, Amazon-Photo, PPI, Coauthor-CS, Coauthor-Physics, ogbn-arxiv
SelfGNNNodeNode Classification (Acc)Cora, Citeseer, Pubmed, Amazon-Computers, Amazon-Photo, Coauthor-CS, Coauthor-Physics
HeCoNodeNode Classification
(ROC-AUC, Micro-F1, Macro-F1)
Node Clustering (NMI, ARI)
ACM, DBLP, Freebase, AMiner
PT-DGNNLinkLink Prediction (ROC-AUC)HepPh, Math Overflow, Super User
COADNode/LinkNode Clustering
(Precision, Recall, F1-score)
Link Prediction (HitRatio@K, MRR)
AMiner, News, LinkedIn
Contrast-RegNode/LinkNode Classification (Acc)
Node Clustering
(NMI, Acc, Macro-F1)
Link Prediction (ROC-AUC)
Cora, Citeseer, Pubmed, Reddit, ogbn-arxiv, Wikipedia, ogbn-products, Amazo-Computers, Amazo-Photo
DGINodeNode Classification (Acc, Micro-F1)Cora, Citeseer, Pubmed, Reddit, PPI
HDMINodeNode Classification
(Micro-F1, Macro-F1)
Node Clustering (NMI)
ACM, IMDB, DBLP, Amazon
DMGINodeNode Clustering (NMI)
Node Classification (Acc)
ACM, IMDB, DBLP, Amazon
MVGRLNode/GraphNode Classification (Acc)
Node Clustering (NMI, ARI)
Graph Classification (Acc)
Cora, Citeseer, Pubmed, MUTAG, PTC_MR, IMDB-B, IMDB-M, RDT-B
HDGINodeNode Classification
(Micro-F1, Macro-F1)
Node Clustering (NMI, ARI)
ACM, DBLP, IMDB
Subg-ConNodeNode Classification (Acc, Micro-F1)Cora, Citeseer, Pubmed, PPI, Flickr, Reddit
Cotext PredictionGraphGraph Classification (ROC-AUC)MUTAG, PTC, PPI, BBBP, Tox21, ToxCast, ClinTox, MUV, HIV, SIDER, BACE
GICNode/LinkNode Classification (Acc)
Node Clustering (Acc, NMI, ARI)
Link Prediction (ROC-AUC, ROC-AP)
Cora, Citeseer, Pubmed, Amazon-Computers, Amazon-Photo, Coauthor-CS, Coauthor-Physics
GraphLoGGraphGraph Classification (ROC-AUC)BBBP, Tox21, ToxCast, ClinTox, MUV, HIV, SIDER, BACE
MHCNNode/Link-Last-FM, Douban, Yelp
EGINode/LinkNode Classification (Acc)
Link Prediction (ROC-AUC, MRR)
YAGO, Airport
MICRO-GraphGraphGraph Classification (ROC-AUC)BBBP, Tox21, ToxCast, ClinTox, HIV, SIDER, BACE
InfoGraphGraphGraph Classification (Acc)MUTAG, PTC_MR, RDT-B, RDT-M5K, IMDB-B, QM9, IMDB-M
SUGARGraphGraph Classification (Acc)MUTAG, PTC, PROTEINS, D&D, NCI1, NCI109
BiGILinkLink Prediction (AUC-ROC, AUC-PR)DBLP, ML-100K, ML-1M, Wikipedia
HTCGraphGraph Classification (Acc)MUTAG, PTC_MR, IMDB-B, IMDB-M, RDT-B, QM9, RDT-M5K
Node Property PredictionNodeNode Classification (Acc)Cora, Citeseer, Pubmed, Reddit
S2GRLNode/LinkNode Classification (Acc, Micro-F1)
Node Clustering (NMI)
Link Prediction (ROC-AUC)
Cora, Citeseer, Pubmed, PPI, Flickr, BlogCatalog, Reddit
PairwiseDistanceNodeNode Classification (Acc)Cora, Citeseer, Pubmed, Reddit
PairwiseAttrSimNodeNode Classification (Acc)Cora, Citeseer, Pubmed, Reddit
Distance2ClusterNodeNode Classification (Acc)Cora, Citeseer, Pubmed, Reddit
EdgeMaskNodeNode Classification (Acc)Cora, Citeseer, Pubmed, Reddit
TopoTERNode/GraphNode Classification (Acc)
Graph Classification (Acc)
Cora, Citeseer, Pubmed, MUTAG, PTC-MR, RDT-B, RDT-M5K, IMDB-B, IMDB-M
Centrality Score RankingNode/Link/GraphNode Classification (Micro-F1)
Link Prediction (Micro-F1)
Graph Classification (Micro-F1)
Cora, Pubmed, ML-100K, ML-1M, IMDB-M, IMDB-B
Meta-path predictionNode/LinkNode Classification (F1-score)
Link Prediction (ROC-AUC)
ACM, IMDB, Last-FM, Book-Crossing
SLiCELinkLink Prediction (ROC-AUC, Micro-F1)Amazon, DBLP, Freebase, Twitter, Healthcare
Distance2LabeledNodeNode Classification (Acc)Cora, Citeseer, Pubmed, Reddit
ContextLabelNodeNode Classification (Acc)Cora, Citeseer, Pubmed, Reddit
HCMNodeNode Classification (ROC-AUC)ACM, Amazon, Enron, BlogCatalog, Flickr
Contextual Molecular
Property Prediction
GraphGraph Classification (Acc)
Graph Regression (MAE)
BBBP, SIDER, ClinTox, BACE, Tox21, ToxCast, ESOL, FreeSolv, Lipo, QM7, QM8
Graph-level Motif PredictionGraphGraph Classification (Acc)
Graph Regression (MAE)
BBBP, SIDER, ClinTox, BACE, Tox21, ToxCast, ESOL, FreeSolv, Lipo, QM7, QM8
Multi-stage Self-trainingNodeNode Classification (Acc)Cora, Citeseer, Pubmed
Node ClusteringNodeNode Classification (Acc)Cora, Citeseer, Pubmed
Graph PartitioningNodeNode Classification (Acc)Cora, Citeseer, Pubmed
CAGANNodeNode Classfication
(Micro-F1, Macro-F1)
Node Clustering
(Micro-F1, Macro-F1, NMI)
Cora, Citeseer, Pubmed
M3SNodeNode Classification (Acc)Cora, Citeseer, Pubmed
Cluster PreservingNode/Link/GraphNode Classification (Micro-F1)
Link Prediction (Micro-F1)
Graph Classification (Micro-F1)
Cora, Pubmed, ML-100K, ML-1M, IMDB-M, IMDB-B

A Summary of Common Graph Datasets

About category, graph number, node number per graph, edge number per graph, dimensionality of node attributes, class number, and citation papers.

DatasetCategory#Graph#Node (Avg.)#Edge (Avg.)#Feature#Class
CoraCitation Network12708542914337
CiteseerCitation Network13327473237036
PubmedCitation Network119717443385003
Wiki-CSCitation Network11170121612330010
Coauthor-CSCitation Network11833381894680515
Coauthor-PhysicsCitation Network13449324796284155
DBLP (v12)Citation Network1489408145564149--
ogbn-arxivCitation Network1169343116624312840
RedditSocial Network12329651160691960241
BlogCatalogSocial Network1519617174381896
FlickrSocial Network17575239738120479
COLLABSocial Networks500074.492457.78-2
RDT-BSocial Networks2000429.63497.75-2
RDT-M5KSocial Networks4999508.52594.87-5
IMDB-BSocial Networks100019.7796.53-2
IMDB-MSocial Networks150013.0065.94-3
ML-100KSocial Networks12625100000-5
ML-1MSocial Networks199401000209-5
PPIProtein Networks245694481871650121
D&DProtein Networks1178284.32715.65822
PROTEINSProtein Networks111339.0672.8142
NCI1Molecule Graphs411029.8732.30372
MUTAGMolecule Graphs18817.9319.7972
QM9 (QM7, QM8)Molecule Graphs133885----
BBBPMolecule Graphs203924.0525.94-2
Tox21Molecule Graphs783118.5125.94-12
ToxCastMolecule Graphs857518.7819.26-167
ClinToxMolecule Graphs147826.1327.86-2
MUVMolecule Graphs9308724.2326.28-17
HIVMolecule Graphs4112725.5327.48-2
SIDERMolecule Graphs142733.6435.36-27
BACEMolecule Graphs151334.1236.89-2
PTCMolecule Graphs34414.2914.69192
NCI109Molecule Graphs412729.6832.13-2
MutagenicityMolecule Graphs433730.3230.77-2
MNISTOthers (Image)-70000-78410
CIFAR10Others (Image)-60000-102410
METR-LAOthers (Traffic)120715152-
Amazon-ComputersOthers (Purchase)11375224586176710
Amazon-PhotoOthers (Purchase)176501190817458
ogbn-productsOthers (Purchase)124490296185914010047

A Summary of Open-source Codes

MethodsGithub
CDNMFhttps://github.com/6lyc/CDNMF
Graph Completionhttps://github.com/Shen-Lab/SS-GCNs
Node Attribute Maskinghttps://github.com/ChandlerBang/SelfTask-GNN
Edge Attribute Maskinghttp://snap.stanford.edu/gnn-pretrain
Attribute and Embedding DenoisingN.A.
Adjacency Matrix ReconstructionN.A.
Graph Berthttps://github.com/anonymous-sourcecode/Graph-Bert
Pretrain-Recsyshttps://github.com/jerryhao66/Pretrain-Recsys
SLAPShttps://github.com/BorealisAI/SLAPS-GNN
G-BERThttps://github.com/jshang123/G-Bert
GPT-GNNhttps://github.com/acbull/GPT-GNN
GraphCLhttps://github.com/Shen-Lab/GraphCL
IGSDN.A.
DACLN.A.
LCChttps://github.com/YuxiangRen
CCGLhttps://github.com/Xovee/ccgl
CSSLN.A.
GCChttps://github.com/THUDM/GCC
GRACEhttps://github.com/CRIPAC-DIG/GRACE
GCAhttps://github.com/CRIPAC-DIG/GCA
GROCN.A.
SEPThttps://github.com/Coder-Yu/QRec
STDGIN.A.
GMIhttps://github.com/zpeng27/GMI
KS2LN.A.
CG3N.A.
BGRLN.A.
SelfGNNhttps://github.com/zekarias-tilahun/SelfGNN
HeCohttps://github.com/liun-online/HeCo
PT-DGNNhttps://github.com/Mobzhang/PT-DGNN
COADhttps://github.com/allanchen95/Expert-Linking
Contrast-RegN.A.
C-SWMhttps://github.com/tkipf/c-swm
DGIhttps://github.com/PetarV-/DGI
HDMIN.A.
DMGIhttps://github.com/pcy1302/DMGI
MVGRLhttps://github.com/kavehhassani/mvgrl
HDGIhttps://github.com/YuxiangRen/Heterogeneous-Deep-Graph-Infomax
Subg-Conhttps://github.com/yzjiao/Subg-Con
Cotext Predictionhttp://snap.stanford.edu/gnn-pretrain
GIChttps://github.com/cmavro/Graph-InfoClust-GIC
GraphLoGhttps://openreview.net/forum?id=DAaaaqPv9-q
MHCNhttps://github.com/Coder-Yu/RecQ
EGIhttps://openreview.net/forum?id=J_pvI6ap5Mn
MICRO-Graphhttps://drive.google.com/file/d/1b751rpnV-SDmUJvKZZI-AvpfEa9eHxo9/
InfoGraphhttps://github.com/fanyun-sun/InfoGraph
SUGARhttps://github.com/RingBDStack/SUGAR
BiGIhttps://github.com/clhchtcjj/BiNE
HTCN.A.
DITNEThttps://github.com/FangpingWan/NeoDTI
Node Property Predictionhttps://github.com/ChandlerBang/SelfTask-GNN
S2GRLN.A.
PairwiseDistancehttps://github.com/ChandlerBang/SelfTask-GNN
PairwiseAttrSimhttps://github.com/ChandlerBang/SelfTask-GNN
Distance2Clusterhttps://github.com/ChandlerBang/SelfTask-GNN
EdgeMaskhttps://github.com/ChandlerBang/SelfTask-GNN
TopoTERN.A.
Centrality Score RankingN.A.
Meta-path predictionhttps://github.com/mlvlab/SELAR
SLiCEhttps://github.com/pnnl/SLICE
Distance2Labeledhttps://github.com/ChandlerBang/SelfTask-GNN
ContextLabelhttps://github.com/ChandlerBang/SelfTask-GNN
HCMN.A.
Contextual Molecular Property Predictionhttps://github.com/tencent-ailab/grover
Graph-level Motif Predictionhttps://github.com/tencent-ailab/grover
DrRepairhttps://github.com/michiyasunaga/DrRepair
Multi-stage Self-traininghttps://github.com/Davidham3/deeper_insights_into_GCNs
Node Clusteringhttps://github.com/Shen-Lab/SS-GCNs
Graph Partitioninghttps://github.com/Shen-Lab/SS-GCNs
CAGANN.A.
M3Shttps://github.com/datake/M3S
Cluster PreservingN.A.
SEFhttps://github.com/nealgravindra/self-supervsed_edge_feats

Contribute

If you would like to help contribute this list, please feel free to contact me or add pull request with the following Markdown format:

- Paper Name. 
  - Author List. *Conference Year*. [[pdf]](link) [[code]](link)

This is a Github Summary of our Survey. If you find this file useful in your research, please consider citing:

@article{wu2021self,
  title={Self-supervised Learning on Graphs: Contrastive, Generative, or Predictive},
  author={Wu, Lirong and Lin, Haitao and Tan, Cheng and Gao, Zhangyang and Li, Stan Z},
  journal={IEEE Transactions on Knowledge and Data Engineering},
  year={2021},
  publisher={IEEE}
}

Feedback

If you have any issue about this work, please feel free to contact me by email: