Awesome Graph Self-Supervised Learning

August 15, 2024 · View on GitHub

GitHub stars GitHub forks

A curated list for awesome self-supervised graph representation learning resources. Inspired by awesome-deep-vision, awesome-adversarial-machine-learning, awesome-deep-learning-papers, awesome-architecture-search, and awesome-self-supervised-learning.

Why Self-Supervised?

Self-Supervised Learning has become an exciting direction in AI community.

Jitendra Malik: "Supervision is the opium of the AI researcher"
Alyosha Efros: "The AI revolution will not be supervised"
Yann LeCun: "self-supervised learning is the cake, supervised learning is the icing on the cake, reinforcement learning is the cherry on the cake"

Overview
Training Strategy
Contrastive Learning
- Same-Scale Contrasting
- Corss-Scale Contrasting
Generative Learning
- Graph Autoencoding
- Graph Autoregression
Predictive Learning
A Summary of Methodology Details
A Summary of Implementation Details
A Summary of Common Graph Datasets
A Summary of Open-source Codes

We extend the concept of self-supervised learning, which first emerged in the fields of computer vision and natural language processing, to present a timely and comprehensive review of the existing SSL techniques for graph data. Specifically, we divide existing graph SSL methods into three categories: contrastive, generative, and predictive as shown below.

Contrastive Learning: it contrasts the views generated by different data augmentation methods. The information about the differences and sameness between data-data pairs (inter-data) is used as self-supervision signals.
Generative Learning: it focuses on the (intra-data) information embedded in the data, generally based on prtext tasks such as reconstruction, which exploit the attributes and structure of the data itself as self-supervision signals.
Predictive Learning: it generally self-generates labels from graph data through some simple statistical analysis, or expert knowledge, and designs prediction-based pretext tasks based on the self-generated labels to handle the data-label relationship.

Training Strategy

Considering the relationship among bottleneck encoders, self-supervised pretext tasks, and downstream tasks, the training strategies can be divided into three categories: Pre-training and Fine-tuning (P&F), Joint Learning (JL), and Unsupervised Representation Learning (URL), with their detailed workflow shown below.

Pre-train&Fine-tune (P&F): it first pre-trains the encoder with unlabeled nodes by the self-supervised pretext tasks. The pre-trained encoder’s parameters are then used as the initialization of the encoder used in supervised fine-tuning for downstream tasks.
Joint Learning (JL): an auxiliary pretext task with self-supervision is included to help learn the supervised downstream task. The encoder is trained through both the pretext task and the downstream task simultaneously.
Unsupervised Representation Learning (URL): it first pre-trains the encoder with unlabeled nodes by the self-supervised pretext tasks. The pre-trained encoder’s parameters are then frozen and used in the supervised downstream task with additional labels.

Contrastive Learning

A general framework for contrastive learning is shown below. The two contrasting components may be local, contextual, or global, corresponding to node-level (marked in red), subgraph-level (marked in green), or graph-level (marked in yellow) information in the graph. The contrastive learning can thus contrast two views (at the same or different scales), which leads to two categories of algorithm: (1) same-scale contrasting, including Local-Local (L-L) contrasting, Context-Context (C-C) contrasting, and Global-Global (G-G) contrasting; and (2) cross-scale contrasting, including Local-Context (L-C) contrasting, Local-Global (L-G) contrasting, and Context-Global (C-G) contrasting.

Global-Global Contrasting

GraphCL: Graph Contrastive Learning with Augmentations.
- Y. You, T. Chen, Y. Sui, T. Chen, Z. Wang, and Y. Shen. NIPS 2020. [pdf] [code]
IGSD: Iterative Graph Self-Distillation.
- H. Zhang, S. Lin, W. Liu, P. Zhou, J. Tang, X. Arxiv 2020. [pdf]
DACL: Towards Domain-Agnostic Contrastive Learning.
- V. Verma, M.-T. Luong, K. Kawaguchi, H. Pham, andQ. V. Le. Arxiv 2020. [pdf]
LCC: Label Contrastive Coding Based Graph Neural Network for Graph Classification.
- Y. Ren, J. Bai, and J. Zhang. Arxiv 2021. [pdf] [code]
CCGL: Contrastive Cascade Graph Learning.
- X. Xu, F. Zhou, K. Zhang, and S. Liu. TKDE 2022. [pdf] [code]
CSSL: Contrastive Self-Supervised Learning for Graph Classification.
- J. Zeng and P. Xie. Arxiv 2020. [pdf]

Context-Context Contrasting

GCC: Graph Contrastive Coding for Graph Neural Network Pre-training.
- J. Qiu, Q. Chen, Y. Dong, J. Zhang, H. Yang, M. Ding, K. Wang, and J. Tang. KDD 2020. [pdf] [code]

Local-Local Contrasting

CDNMF: Contrastive Deep Nonnegative Matrix Factorization for Community Detection.
- Y. Li, J. Chen, C. Chen, L. Yang, Z. Zheng. ICASSP 2024. [pdf] [code]
GRACE: Deep Graph Contrastive Representation Learning.
- Y. Zhu, Y. Xu, F. Yu, Q. Liu, S. Wu, and L. Wang. Arxiv 2020. [pdf] [code]
GCA: Graph Contrastive Learning with Adaptive Augmentation.
- Y. Zhu, Y. Xu, F. Yu, Q. Liu, S. Wu, and L. Wang. Arxiv 2020. [pdf] [code]
GROC: Towards Robust Graph Contrastive Learning.
- N. Jovanovi´c, Z. Meng, L. Faber, and R. Wattenhofer. Arxiv 2021. [pdf]
SEPT: Socially-Aware Self-Supervised Tri-Training for Recommendation.
- J. Yu, H. Yin, M. Gao, X. Xia, X. Zhang, and N. Q. V.Hung. Arxiv 2021. [pdf] [code]
STDGI: Spatio-Temporal Deep Graph Infomax.
- F. L. Opolka, A. Solomon, C. Cangea, P. Veliˇckovi´c, P. Li` o, and R. D. Hjelm. Arxiv 2019. [pdf]
GMI: Graph Representation Learning via Graphical Mutual Information Maximization.
- L. Yu, S. Pei, C. Zhang, L. Ding, J. Zhou, L. Li, and X. Zhang. WWW 2020. [pdf] [code]
KS2L: Self-Supervised Smoothing Graph Neural Networks.
- L. Yu, S. Pei, C. Zhang, L. Ding, J. Zhou, L. Li, and X. Zhang. Arxiv 2020. [pdf]
CG3: Contrastive and Generative Graph Convolutional Networks for Graph-based Semi-Supervised Learning.
- S. Wan, S. Pan, J. Yang, and C. Gong. Arxiv 2020. [pdf]
BGRL: Bootstrapped Representation Learning on Graphs.
- S. Thakoor, C. Tallec, M. G. Azar, R. Munos, P. Veliˇckovi´c, and M. Valko. Arxiv 2021. [pdf][code]
SelfGNN: Self-supervised Graph Neural Networks without Explicit Negative Sampling.
- Z. T. Kefato and S. Girdzijauskas. Arxiv 2021. [pdf] [code]
HeCo: Self-supervised Heterogeneous Graph Neural Network with Co-contrastive Learning.
- X. Wang, N. Liu, H. Han, and C. Shi. Arxiv 2021. [pdf] [code]
PT-DGNN: Pre-training on Dynamic Graph Neural Networks.
- J. Zhang, K. Chen, and Y. Wang. Arxiv 2021. [pdf] [code]
COAD: Coad: Contrastive Pretraining with Adversarial Fine-tuning for Zero-shot Expert Linking.
- B. Chen, J. Zhang, X. Zhang, X. Tang, L. Cai, H. Chen, C. Li, P. Zhang, and J. Tang. Arxiv 2020. [pdf] [code]
Contrast-Reg: Improving Graph Representation Learning by Contrastive Regularization.
- K. Ma, H. Yang, H. Yang, T. Jin, P. Chen, Y. Chen, B. F. Kamhoua, and J. Cheng. Arxiv 2021. [pdf]
C-SWM: Contrastive Learning of Structured World Models.
- T. Kipf, E. van der Pol, and M. Welling. *Arxiv 2019. [pdf] [code]

Local-Global Contrasting

DGI: Deep Graph Infomax.
- P. Velickovic, W. Fedus, W. L. Hamilton, P. Li` o, Y. Bengio, and R. D. Hjelm. ICLR 2019. [pdf] [code]
HDMI: Hdmi: High-order Deep Multiplex Infomax.
- B. Jing, C. Park, and H. Tong. Arxiv 2021. [pdf]
DMGI: Unsupervised Attributed Multiplex Network Embedding.
- C. Park, D. Kim, J. Han, and H. Yu. AAAI 2020. [pdf] [code]
MVGRL: Contrastive Multi-View Representation Learning on Graphs.
- K. Hassani and A. H. K. Ahmadi. ICML 2020. [pdf] [code]
HDGI: Heterogeneous Deep Graph Infomax.
- Y. Ren, B. Liu, C. Huang, P. Dai, L. Bo, and J. Zhang. Arxiv 2019. [pdf] [code]

Local-Context Contrasting

CDNMF: Contrastive Deep Nonnegative Matrix Factorization for Community Detection.
- Y. Li, J. Chen, C. Chen, L. Yang, Z. Zheng. ICASSP 2024. [pdf] [code]
Subg-Con: Sub-graph Contrast for Scalable Self-Supervised Graph Representation Learning.
- Y. Jiao, Y. Xiong, J. Zhang, Y. Zhang, T. Zhang, and Y. Zhu. Arxiv 2020. [pdf] [code]
Cotext Prediction: Strategies for Pre-training Graph Neural Networks.
- W. Hu, B. Liu, J. Gomes, M. Zitnik, P. Liang, V. S. Pande, and J. Leskovec. ICLR 2020. [pdf] [code]
GIC: Leveraging Cluster-level Node Information for Unsupervised Graph Representation Learning.
- C. Mavromatis and G. Karypis. Arxiv 2020. [pdf] [code]
GraphLoG: Self-Supervised Graph-level Representation Learning with Local and Global Structure.
- M. Xu, H. Wang, B. Ni, H. Guo, and J. Tang. OpenReview 2021. [pdf] [code]
MHCN: Self-Supervised Multi-channel Hypergraph Convolutional Network for Social Recommendation.
- J. Yu, H. Yin, J. Li, Q. Wang, N. Q. V. Hung, and X. Zhang. Arxiv 2021. [pdf] [code]
EGI: Transfer Learning of Graph Neural Networks with Ego-graph Information Maximization.
- Q. Zhu, Y. Xu, H.Wang, C. Zhang, J. Han, and C. Yang. Arxiv 2020. [pdf] [code]

Context-Global Contrasting

MICRO-Graph: Motif-Driven Contrastive Learning of Graph Representations.
- S. Zhang, Z. Hu, A. Subramonian, and Y. Sun. Arxiv 2020. [pdf] [code]
InfoGraph: Unsupervised and Semi-Supervised Graph-level Representation Learning via Mutual Information Maximization.
- F. Sun, J. Hoffmann, V. Verma, and J. Tang. ICLR 2020. [pdf] [code]
SUGAR: Subgraph Neural Network with Reinforcement Pooling and Self-Supervised Mutual Information Mechanism.
- Q. Sun, H. Peng, J. Li, J. Wu, Y. Ning, P. S. Yu, and L. He. Arxiv 2021. [pdf] [code]
BiGI: Bipartite Graph Embedding via Mutual Information Maximization.
- J. Cao, X. Lin, S. Guo, L. Liu, T. Liu, and B. Wang. WSDM 2021. [pdf] [code]
HTC: Graph Representation Learning by Ensemble Aggregating Subgraphs via Mutual Information Maximization.
- C. Wang and Z. Liu. Arxiv 2021. [pdf]
DITNet: Drug Target Prediction using Graph Representation Learning via Substructures Contrast.
- S. Cheng, L. Zhang, B. Jin, Q. Zhang, and X. Lu. Preprints 2021. [pdf] [code]

Generative Learning

Graph Autoencoding

CDNMF: Contrastive Deep Nonnegative Matrix Factorization for Community Detection.
- Y. Li, J. Chen, C. Chen, L. Yang, Z. Zheng. ICASSP 2024. [pdf] [code]
GraphMAE: Self-supervised Masked Graph Autoencoders
- Z. Hou, X. Liu, Y. Cen, Y. Dong, H. Yang, C. Wang, and J. Tang. KDD 2022 [pdf] [code]
Graph Completion: When Does Self-Supervision Help Graph Convolutional Networks?
- Y. You, T. Chen, Z. Wang, and Y. Shen. PMLR 2020. [pdf] [code]
Node Attribute Masking: Self-Supervised Learning on Graphs: Deep Insights and New Direction.
- W. Jin, T. Derr, H. Liu, Y. Wang, S. Wang, Z. Liu, and J. Tang. Arxiv 2020. [pdf] [code]
Edge Attribute Masking: Strategies for Pre-training Graph Neural Networks.
- W. Hu, B. Liu, J. Gomes, M. Zitnik, P. Liang, V. S. Pande, and J. Leskovec. ICLR 2020. [pdf] [code]
Node Attribute and Embedding Denoising: Graph-based Neural Network Models with Multiple Self-Supervised Auxiliary Tasks.
- F. Manessi and A. Rozza. Arxiv 2020. [pdf]
Adjacency Matrix Reconstruction: Self-Supervised Training of Graph Convolutional Networks.
- Q. Zhu, B. Du, and P. Yan. Arxiv 2020. [pdf]
Graph Bert: Only Attention is Needed for Learning Graph Representations.
- J. Zhang, H. Zhang, C. Xia, and L. Sun. Arxiv 2020. [pdf] [code]
Pretrain-Recsys: Pretraining Graph Neural Networks for Cold-start Users and Items Representation.
- B. Hao, J. Zhang, H. Yin, C. Li, and H. Chen. WSDM 2021. [pdf] [code]
SLAPS: Self-Supervision Improves Structure Learning for Graph Neural Networks.
- B. Fatemi, L. E. Asri, and S. M. Kazemi. Arxiv 2021. [pdf] [code]
G-BERT: Pre-Training of Graph Augmented Transformers for Medication Recommendation.
- J. Shang, T. Ma, C. Xiao, and J. Sun. Arxiv 2019. [pdf] [code]

Graph Autoregression

GPT-GNN: Generative Pre-training of Graph Neural Networks.
- Z. Hu, Y. Dong, K. Wang, K. Chang, and Y. Sun. KDD 2020. [pdf] [code]

Predictive Learning

A comparison of the predictive learning is shown below. The predictive method generally self-generates labels from graph data and then designs prediction-based pretext tasks based on the self-generated labels. Categorized by how the labels areobtained, we summarize predictive learning methods forgraph data into four categories:

Node Property Prediction: it pre-calculates the node properties, such as node degree and used them as self-supervised labels.
Context-based Prediction: the local or global contextual information in the graph, such as the shortest path length between nodes can be extracted as labels to help with self-supervised learning.
Self-Training: it applies algorithms such as unsupervised clustering to obtain pseudo-labels and then updates the pseudo-label set of the previous stage based on the prediction results or losses.
Domain Knowledge-based Prediction: the domain knowledge, such as expert knowledge or specialized tools, can be used in advance to obtain informative labels.

Node Property Prediction

Node Property Prediction: Self-Supervised Learning on Graphs: Deep Insights and New Direction.
- W. Jin, T. Derr, H. Liu, Y. Wang, S. Wang, Z. Liu, and J. Tang. Arxiv 2020. [pdf] [code]

Context-based Prediction

S2GRL: Self-Supervised Graph Representation Learning via Global Context Prediction.
- Z. Peng, Y. Dong, M. Luo, X.-M. Wu, and Q. Zheng. Arxiv 2020. [pdf]
PairwiseDistance: Self-Supervised Learning on Graphs: Deep Insights and New Direction.
- W. Jin, T. Derr, H. Liu, Y. Wang, S. Wang, Z. Liu, and J. Tang. Arxiv 2020. [pdf] [code]
PairwiseAttsim: Self-Supervised Learning on Graphs: Deep Insights and New Direction.
- W. Jin, T. Derr, H. Liu, Y. Wang, S. Wang, Z. Liu, and J. Tang. Arxiv 2020. [pdf] [code]
Distance2Cluster: Self-Supervised Learning on Graphs: Deep Insights and New Direction.
- W. Jin, T. Derr, H. Liu, Y. Wang, S. Wang, Z. Liu, and J. Tang. Arxiv 2020. [pdf] [code]
EdgeMask: Self-Supervised Learning on Graphs: Deep Insights and New Direction.
- W. Jin, T. Derr, H. Liu, Y. Wang, S. Wang, Z. Liu, and J. Tang. Arxiv 2020. [pdf] [code]
TopoTER: Unsupervised Learning of Topology Transformation Equivariant Representations.
- X. Gao, W. Hu, and G.-J. Qi. OpenReview 2021. [pdf]
Centrality Score Ranking: Pretraining Graph Neural Networks for Generic Structural Feature Extraction.
- Z. Hu, C. Fan, T. Chen, K.-W. Chang, and Y. Sun. Arxiv 2019. [pdf]
Meta-path prediction: Self-supervised Auxiliary Learning with Meta-paths for Heterogeneous Graphs.
- D. Hwang, J. Park, S. Kwon, K. Kim, J. Ha, and H. J. Kim. NIPS 2020. [pdf] [code]
SLiCE: Self-Supervised Learning of Contextual Embeddings for Link Prediction in Heterogeneous Networks.
- P. Wang, K. Agarwal, C. Ham, S. Choudhury, and C. K. Reddy. Arxiv 2020. [pdf] [code]
Distance2Labeled: Self-Supervised Learning on Graphs: Deep Insights and New Direction.
- W. Jin, T. Derr, H. Liu, Y. Wang, S. Wang, Z. Liu, and J. Tang. Arxiv 2020. [pdf] [code]
Distance2Labeled: Self-Supervised Learning on Graphs: Deep Insights and New Direction.
- W. Jin, T. Derr, H. Liu, Y. Wang, S. Wang, Z. Liu, and J. Tang. Arxiv 2020. [pdf] [code]
HTM: Hop-count based Self-Supervised Anomaly Detection on Attributed Networks.
- T. Huang, Y. Pei, V. Menkovski, and M. Pechenizkiy. Arxiv 2021. [pdf]

Self-Training

Multi-stage Self-training: Deeper insights into Graph Convolutional Networks for Semi-Supervised Learning.
- Q. Li, Z. Han, and X. Wu. AAAI 2018. [pdf] [code]
Node Clustering and Partitioning: When Does Self-Supervision Help Graph Convolutional Networks.
- Y. You, T. Chen, Z. Wang, and Y. Shen. PMLR 2020. [pdf] [code]
CAGAN: Cluster-Aware Graph Neural Networks for Unsupervised Graph Representation Learning.
- Y. Zhu, Y. Xu, F. Yu, S. Wu, and L. Wang. Arxiv 2020. [pdf]
M3S: Multi-stage Self-Supervised Learning for Graph Convolutional Networks on Graphs with Few Labeled Nodes.
- K. Sun, Z. Lin, and Z. Zhu. AAAI 2020. [pdf] [code]
Cluster Preserving: Pretraining Graph Neural Networks for Generic Structural Feature Extraction.
- Z. Hu, C. Fan, T. Chen, K.-W. Chang, and Y. Sun. Arxiv 2019. [pdf]
SEF: Self-Supervised Edge Features for Improved Graph Neural Network Training.
- A. Sehanobish, N. G. Ravindra, and D. van Dijk. Arxiv 2020. [pdf][code]

Domain Knowledge-based Prediction

Contextual Molecular Property Prediction: Self-Supervised Graph Transformer on Large-Scale Molecular Data.
- Y. Rong, Y. Bian, T. Xu, W. Xie, Y. Wei, W. Huang, and J. Huang. NIPS 2020. [pdf] [code]
Graph-level Motif Prediction: Self-Supervised Graph Transformer on Large-scale Molecular Data.
- Y. Rong, Y. Bian, T. Xu, W. Xie, Y. Wei, W. Huang, and J. Huang. NIPS 2020. [pdf] [code]
DrRepair: Graph-based, Self-Supervised Program Repair from Diagnostic Feedback.
- M. Yasunaga and P. Liang. PMLR 2020. [pdf] [code]

A summary of all the surveyed works is presented below.

A Summary of Methodology Details

About Graph Property, Pretext Task, Data Augmentation, Objective Function, Training Strategy, and Year of publication.

Methods	Graph Property	Pretext-Task	Data Augmentation	Objective Function	Training Strategy	Year
CDNMF	Attributed	Contrastive/L-C + Generative/AE	None	InfoNCE + AE	URL	2024
Graph Completion	Attributed	Generative/AE	Attribute Masking	MAE	P&F/JL	2020
Node Attribute Masking	Attributed	Generative/AE	Attribute Masking	MAE	P&F/JL	2020
Edge Attribute Masking	Attributed	Generative/AE	Attribute Masking	MAE	P&F	2019
Node Attribute and Embedding Denoising	Attributed	Generative/AE	Attribute Masking	MAE	JL	2020
Adjacency Matrix Reconstruction	Attributed	Generative/AE	Attribute Masking Edge Perturbation	MAE	JL	2020
Graph Bert	Attributed	Generative/AE	Attribute Masking Edge Perturbation	MAE	P&F	2020
Pretrain-Recsys	Attributed	Generative/AE	Edge Perturbation	MAE	P&F	2021
GPT-GNN	Heterogeneous	Generative/AR	Attribute Masking Edge Perturbation	MAE/InfoNCE	P&F	2020
GraphCL	Attributed	Contrastive/G-G	Attribute Masking Edge Perturbation Random Walk Sampling	InfoNCE	URL	2020
IGSD	Attributed	Contrastive/G-G	Edge Perturbation Edge Doffisopm	InfoNCE	JL/URL	2020
DACL	Attributed	Contrastive/G-G	Mixup	InfoNCE	URL	2020
LCC	Attributed	Contrastive/G-G	None	InfoNCE	JL	2021
CCGL	Attributed	Contrastive/G-G	Information Re-Diffusion	InfoNCE	P&F	2021
CSSL	Attributed	Contrastive/G-G	NodeInsertion Edge Perturbation Uniform Sampling	InfoNCE	P&F/JL/URL	2020
GCC	Unattributed	Contrastive/C-C	Random Walk Sampling	InfoNCE	P&F/URL	2020
GRACE	Attributed	Contrastive/L-L	Attribute Masking Edge Perturbation	InfoNCE	URL	2020
GCA	Attributed	Contrastive/L-L	Attention-based	InfoNCE	URL	2020
GROC	Attributed	Contrastive/L-L	Gradient-based	InfoNCE	URL	2021
SEPT	Attributed	Contrastive/L-L	Edge Perturbation	InfoNCE	JL	2021
STDGI	Spatial-Temporal	Contrastive/L-L	Attribute Shuffling	JS Estimator	URL	2019
GMI	Attributed	Contrastive/L-L	None	SP Estimator	URL	2020
KS2L	Attributed	Contrastive/L-L	None	InfoNCE	URL	2020
CG3	Attributed	Contrastive/L-L	None	InfoNCE	JL	2020
BGRL	Attributed	Contrastive/L-L	Attribute Masking Edge Perturbation	Inner Product	URL	2021
SelfGNN	Attributed	Contrastive/L-L	Attribute Masking Edge Diffusion	MSE	URL	2021
HeCo	Heterogeneous	Contrastive/L-L	None	InfoNCE	URL	2021
PT-DGNN	Dynamic	Contrastive/L-L	Attribute Masking Edge Perturbation	InforNCE	P&F	2021
COAD	Attributed	Contrastive/L-L	None	Triplet Margin Loss	P&F	2020
Contrst-Reg	Attributed	Contrastive/L-L	Attribute Shuffling	InfoNCE	JL	2021
DGI	Attributed	Contrastive/L-G	Arbitrary	JS Estimator	URL	2019
HDMI	Attributed	Contrastive/L-G	Attribute Shuffling	JS Estimator	URL	2021
DMGI	Heterogeneous	Contrastive/L-G	Attribute Shuffling	JS Estimator/MAE	URL	2020
MVGRL	Attributed	Contrastive/L-G	Attribute Masking Edge Perturbation Edge Diffusion Random Walk Sampling	DV Estimator JS Estimator NT-Xent InfoNCE	URL	2020
HDGI	Heterogeneous	Contrastive/L-G	Attribute Shuffling	JS Estimator	URL	2019
Subg-Con	Attributed	Contrastive/L-C	Importance Sampling	Triplet Margin Loss	URL	2020
Cotext Prediction	Attributed	Contrastive/L-C	Ego-nets Sampling	Cross Entropy	P&F	2019
GIC	Attributed	Contrastive/L-C	Arbitrary	JS Estimator	URL	2020
GraphLoG	Attributed	Contrastive/L-C	Attribute Masking	InfoNCE	URL	2021
MHCN	Heterogeneous	Contrastive/L-C	Attribute Shuffling	InfoNCE	JL	2021
EGI	Attributed	Contrastive/L-C	Ego-nets Sampling	SP Estimator	P&F	2020
MICRO-Graph	Attributed	Contrastive/C-G	Knowledge Sampling	InfoNCE	URL	2020
InfoGraph	Attributed	Contrastive/C-G	None	SP Estimator	URL	2019
SUGAR	Attributed	Contrastive/C-G	BFS Sampling	JS Estimator	JL	2021
BiGI	Heterogeneous	Contrastive/C-G	Edge Perturbation Ego-nets Sampling	JS Estimator	JL	2021
HTC	Attributed	Contrastive/C-G	Attribute Shuffling	SP Estimator DV Estimator	URL	2021
Node Property Prediction	Attributed	Predictive/Node Property	None	MAE	P&F/JL	2020
S2GRL	Attributed	Predictive/Context-based	None	Cross Entropy	URL	2020
PairwiseDistance	Attributed	Predictive/Context-based	None	Cross Entropy	P&F/JL	2020
PairwiseAttrSim	Attributed	Predictive/Context-based	None	MAE	P&F/JL	2020
Distance2Cluster	Attributed	Predictive/Context-based	None	MAE	P&F/JL	2020
EdgeMask	Attributed	Predictive/Context-based	None	Cross Entropy	P&F/JL	2020
TopoTER	Attributed	Predictive/Context-based	Edge Perturbation	Cross Entropy	URL	2021
Centrality Score Ranking	Attributed	Predictive/Context-based	None	Cross Entropy	P&F	2019
Meta-path prediction	Heterogeneous	Predictive/Context-based	None	Cross Entropy	JL	2020
SLiCE	Heterogeneous	Predictive/Context-based	None	Cross Entropy	P&F	2020
Distance2Labeled	Attributed	Predictive/Context-based	None	MAE	P&F/JL	2020
ContextLabel	Attributed	Predictive/Context-based	None	MAE	P&F/JL	2020
HCM	Attributed	Predictive/Context-based	Edge Perturbation	Bayesian Inference	URL	2021
Contextual Molecular Property Prediction	Attributed	Predictive/Domain-based	None	Cross Entropy	P&F	2020
Graph-level Motif Prediction	Attributed	Predictive/Domain-based	None	Cross Entropy	P&F	2020
Multi-stage Self-training	Attributed	Predictive/Self-training	None	None	JL	2018
Node Clustering	Attributed	Predictive/Self-training	None	Clustering	P&F/JL	2020
Graph Partitioning	Attributed	Predictive/Self-training	None	Graph Partitioning	P&F/JL	2020
CAGAN	Attributed	Predictive/Self-training	None	Clustering	URL	2020
M3S	Attributed	Predictive/Self-training	None	Clustering	JL	2020
Cluster Preserving	Attributed	Predictive/Self-training	None	Cross Entropy	P&F	2019

A Summary of Implementation Details

About Task Level, Evaluation Metric, and Evaluation Datasets.

Methods	Task Level	Evaluation Metric	Dataset
CDNMF	Node	Node Clustering (Acc, NMI)	Cora, Citeseer, Pubmed
Graph Completion	Node	Node Classification (Acc)	Cora, Citeseer, Pubmed
Node Attribute Masking	Node	Node Classification (Acc)	Cora, Citeseer, Pubmed, Reddit
Edge Attribute Masking	Graph	Graph Classification (ROC-AUC)	MUTAG, PTC, PPI, BBBP, Tox21, ToxCast, ClinTox, MUV, HIV, SIDER, BACE
Node Attribute and Embedding Denoising	Node	Node Classification (Acc)	Cora, Citeseer, Pubmed
Adjacency Matrix Reconstruction	Node	Node Classification (Acc)	Cora, Citeseer, Pubmed
Graph Bert	Node	Node Classification (Acc) Node Clustering (NMI)	Cora, Citeseer, Pubmed
Pretrain-Recsys	Node/Link	-	ML-1M, MOOCs and Last-FM
GPT-GNN	Node/Link	Node Classification (F1-score) Link Prediction (ROC-AUC)	OAG, Amazon, Reddit
GraphCL	Graph	Graph Classification (Acc, ROC-AUC)	NCI1, PROTEINS, D&D, COLLAB, RDT-B, RDT-M5K, GITHUB, MNIST, CIFAR10, MUTAG, IMDB-B, BBBP, Tox21, ToxCast, SIDER, ClinTox, MUV, HIV, BACE, PPI
IGSD	Graph	Graph Classification (Acc)	MUTAG, PTC_MR, NCI1, IMDB-B, QM9, COLLAB, IMDB-M
DACL	Graph	Graph Classification (Acc)	MUTAG, PTC_MR, IMDB-B, IMDB-M, RDT-B, RDT-M5K
LCC	Graph	Graph Classification (Acc)	IMDB-B, IMDB-M, COLLAB, MUTAG, PROTEINS, PTC, NCI1, D&D
CCGL	Graph	Cascade Graph Prediction (MSLE)	Weibo, Twitter, ACM, APS, DBLP
CSSL	Graph	Graph Classification (Acc)	PROTEINS, D&D, NCI1, NCI109, Mutagenicity
GCC	Node/Graph	Node Classification (Acc) Graph Classification (Acc)	US-Airport, H-index, COLLAB, IMDB-B, IMDB-M, RDT-B, RDT-M5K
GRACE	Node	Node Classification (Acc, Micro-F1)	Cora, Citeseer, Pubmed, DBLP, Reddit, PPI
GCA	Node	Node Classification (Acc)	Wiki-CS, Amazon-Computers, Amazon-Photo, Coauthor-CS, Coauthor-Physics
GROC	Node	Node Classification (Acc)	Cora, Citeseer, Pubmed, Amazon-Photo, Wiki-CS
SEPT	Node/Link	-	Last-FM, Douban, Yelp
STDGI	Node	Node Regression (MAE, RMSE, MAPE)	METR-LA
GMI	Node/Link	Node Classification (Acc, Micro-F1) Link Prediction (ROC-AUC)	Cora, Citeseer, PubMed, Reddit, PPI, BlogCatalog, Flickr
KS2L	Node/Link	Node Classification (Acc) Link Prediction (ROC-AUC)	Cora, Citeseer, Pubmed, Amazon-Computers, Amazon-Photo, Coauthor-CS
CG3	Node	Node Classification (Acc)	Cora, Citeseer, Pubmed, Amazon-Computers, Amazon-Photo, Coauthor-CS
BGRL	Node	Node Classification (Acc, Micro-F1)	Wiki-CS, Amazon-Computers, Amazon-Photo, PPI, Coauthor-CS, Coauthor-Physics, ogbn-arxiv
SelfGNN	Node	Node Classification (Acc)	Cora, Citeseer, Pubmed, Amazon-Computers, Amazon-Photo, Coauthor-CS, Coauthor-Physics
HeCo	Node	Node Classification (ROC-AUC, Micro-F1, Macro-F1) Node Clustering (NMI, ARI)	ACM, DBLP, Freebase, AMiner
PT-DGNN	Link	Link Prediction (ROC-AUC)	HepPh, Math Overflow, Super User
COAD	Node/Link	Node Clustering (Precision, Recall, F1-score) Link Prediction (HitRatio@K, MRR)	AMiner, News, LinkedIn
Contrast-Reg	Node/Link	Node Classification (Acc) Node Clustering (NMI, Acc, Macro-F1) Link Prediction (ROC-AUC)	Cora, Citeseer, Pubmed, Reddit, ogbn-arxiv, Wikipedia, ogbn-products, Amazo-Computers, Amazo-Photo
DGI	Node	Node Classification (Acc, Micro-F1)	Cora, Citeseer, Pubmed, Reddit, PPI
HDMI	Node	Node Classification (Micro-F1, Macro-F1) Node Clustering (NMI)	ACM, IMDB, DBLP, Amazon
DMGI	Node	Node Clustering (NMI) Node Classification (Acc)	ACM, IMDB, DBLP, Amazon
MVGRL	Node/Graph	Node Classification (Acc) Node Clustering (NMI, ARI) Graph Classification (Acc)	Cora, Citeseer, Pubmed, MUTAG, PTC_MR, IMDB-B, IMDB-M, RDT-B
HDGI	Node	Node Classification (Micro-F1, Macro-F1) Node Clustering (NMI, ARI)	ACM, DBLP, IMDB
Subg-Con	Node	Node Classification (Acc, Micro-F1)	Cora, Citeseer, Pubmed, PPI, Flickr, Reddit
Cotext Prediction	Graph	Graph Classification (ROC-AUC)	MUTAG, PTC, PPI, BBBP, Tox21, ToxCast, ClinTox, MUV, HIV, SIDER, BACE
GIC	Node/Link	Node Classification (Acc) Node Clustering (Acc, NMI, ARI) Link Prediction (ROC-AUC, ROC-AP)	Cora, Citeseer, Pubmed, Amazon-Computers, Amazon-Photo, Coauthor-CS, Coauthor-Physics
GraphLoG	Graph	Graph Classification (ROC-AUC)	BBBP, Tox21, ToxCast, ClinTox, MUV, HIV, SIDER, BACE
MHCN	Node/Link	-	Last-FM, Douban, Yelp
EGI	Node/Link	Node Classification (Acc) Link Prediction (ROC-AUC, MRR)	YAGO, Airport
MICRO-Graph	Graph	Graph Classification (ROC-AUC)	BBBP, Tox21, ToxCast, ClinTox, HIV, SIDER, BACE
InfoGraph	Graph	Graph Classification (Acc)	MUTAG, PTC_MR, RDT-B, RDT-M5K, IMDB-B, QM9, IMDB-M
SUGAR	Graph	Graph Classification (Acc)	MUTAG, PTC, PROTEINS, D&D, NCI1, NCI109
BiGI	Link	Link Prediction (AUC-ROC, AUC-PR)	DBLP, ML-100K, ML-1M, Wikipedia
HTC	Graph	Graph Classification (Acc)	MUTAG, PTC_MR, IMDB-B, IMDB-M, RDT-B, QM9, RDT-M5K
Node Property Prediction	Node	Node Classification (Acc)	Cora, Citeseer, Pubmed, Reddit
S2GRL	Node/Link	Node Classification (Acc, Micro-F1) Node Clustering (NMI) Link Prediction (ROC-AUC)	Cora, Citeseer, Pubmed, PPI, Flickr, BlogCatalog, Reddit
PairwiseDistance	Node	Node Classification (Acc)	Cora, Citeseer, Pubmed, Reddit
PairwiseAttrSim	Node	Node Classification (Acc)	Cora, Citeseer, Pubmed, Reddit
Distance2Cluster	Node	Node Classification (Acc)	Cora, Citeseer, Pubmed, Reddit
EdgeMask	Node	Node Classification (Acc)	Cora, Citeseer, Pubmed, Reddit
TopoTER	Node/Graph	Node Classification (Acc) Graph Classification (Acc)	Cora, Citeseer, Pubmed, MUTAG, PTC-MR, RDT-B, RDT-M5K, IMDB-B, IMDB-M
Centrality Score Ranking	Node/Link/Graph	Node Classification (Micro-F1) Link Prediction (Micro-F1) Graph Classification (Micro-F1)	Cora, Pubmed, ML-100K, ML-1M, IMDB-M, IMDB-B
Meta-path prediction	Node/Link	Node Classification (F1-score) Link Prediction (ROC-AUC)	ACM, IMDB, Last-FM, Book-Crossing
SLiCE	Link	Link Prediction (ROC-AUC, Micro-F1)	Amazon, DBLP, Freebase, Twitter, Healthcare
Distance2Labeled	Node	Node Classification (Acc)	Cora, Citeseer, Pubmed, Reddit
ContextLabel	Node	Node Classification (Acc)	Cora, Citeseer, Pubmed, Reddit
HCM	Node	Node Classification (ROC-AUC)	ACM, Amazon, Enron, BlogCatalog, Flickr
Contextual Molecular Property Prediction	Graph	Graph Classification (Acc) Graph Regression (MAE)	BBBP, SIDER, ClinTox, BACE, Tox21, ToxCast, ESOL, FreeSolv, Lipo, QM7, QM8
Graph-level Motif Prediction	Graph	Graph Classification (Acc) Graph Regression (MAE)	BBBP, SIDER, ClinTox, BACE, Tox21, ToxCast, ESOL, FreeSolv, Lipo, QM7, QM8
Multi-stage Self-training	Node	Node Classification (Acc)	Cora, Citeseer, Pubmed
Node Clustering	Node	Node Classification (Acc)	Cora, Citeseer, Pubmed
Graph Partitioning	Node	Node Classification (Acc)	Cora, Citeseer, Pubmed
CAGAN	Node	Node Classfication (Micro-F1, Macro-F1) Node Clustering (Micro-F1, Macro-F1, NMI)	Cora, Citeseer, Pubmed
M3S	Node	Node Classification (Acc)	Cora, Citeseer, Pubmed
Cluster Preserving	Node/Link/Graph	Node Classification (Micro-F1) Link Prediction (Micro-F1) Graph Classification (Micro-F1)	Cora, Pubmed, ML-100K, ML-1M, IMDB-M, IMDB-B

A Summary of Common Graph Datasets

About category, graph number, node number per graph, edge number per graph, dimensionality of node attributes, class number, and citation papers.

Dataset	Category	#Graph	#Node (Avg.)	#Edge (Avg.)	#Feature	#Class
Cora	Citation Network	1	2708	5429	1433	7
Citeseer	Citation Network	1	3327	4732	3703	6
Pubmed	Citation Network	1	19717	44338	500	3
Wiki-CS	Citation Network	1	11701	216123	300	10
Coauthor-CS	Citation Network	1	18333	81894	6805	15
Coauthor-Physics	Citation Network	1	34493	247962	8415	5
DBLP (v12)	Citation Network	1	4894081	45564149	-	-
ogbn-arxiv	Citation Network	1	169343	1166243	128	40
Reddit	Social Network	1	232965	11606919	602	41
BlogCatalog	Social Network	1	5196	171743	8189	6
Flickr	Social Network	1	7575	239738	12047	9
COLLAB	Social Networks	5000	74.49	2457.78	-	2
RDT-B	Social Networks	2000	429.63	497.75	-	2
RDT-M5K	Social Networks	4999	508.52	594.87	-	5
IMDB-B	Social Networks	1000	19.77	96.53	-	2
IMDB-M	Social Networks	1500	13.00	65.94	-	3
ML-100K	Social Networks	1	2625	100000	-	5
ML-1M	Social Networks	1	9940	1000209	-	5
PPI	Protein Networks	24	56944	818716	50	121
D&D	Protein Networks	1178	284.32	715.65	82	2
PROTEINS	Protein Networks	1113	39.06	72.81	4	2
NCI1	Molecule Graphs	4110	29.87	32.30	37	2
MUTAG	Molecule Graphs	188	17.93	19.79	7	2
QM9 (QM7, QM8)	Molecule Graphs	133885	-	-	-	-
BBBP	Molecule Graphs	2039	24.05	25.94	-	2
Tox21	Molecule Graphs	7831	18.51	25.94	-	12
ToxCast	Molecule Graphs	8575	18.78	19.26	-	167
ClinTox	Molecule Graphs	1478	26.13	27.86	-	2
MUV	Molecule Graphs	93087	24.23	26.28	-	17
HIV	Molecule Graphs	41127	25.53	27.48	-	2
SIDER	Molecule Graphs	1427	33.64	35.36	-	27
BACE	Molecule Graphs	1513	34.12	36.89	-	2
PTC	Molecule Graphs	344	14.29	14.69	19	2
NCI109	Molecule Graphs	4127	29.68	32.13	-	2
Mutagenicity	Molecule Graphs	4337	30.32	30.77	-	2
MNIST	Others (Image)	-	70000	-	784	10
CIFAR10	Others (Image)	-	60000	-	1024	10
METR-LA	Others (Traffic)	1	207	1515	2	-
Amazon-Computers	Others (Purchase)	1	13752	245861	767	10
Amazon-Photo	Others (Purchase)	1	7650	119081	745	8
ogbn-products	Others (Purchase)	1	2449029	61859140	100	47

A Summary of Open-source Codes

Methods	Github
CDNMF	https://github.com/6lyc/CDNMF
Graph Completion	https://github.com/Shen-Lab/SS-GCNs
Node Attribute Masking	https://github.com/ChandlerBang/SelfTask-GNN
Edge Attribute Masking	http://snap.stanford.edu/gnn-pretrain
Attribute and Embedding Denoising	N.A.
Adjacency Matrix Reconstruction	N.A.
Graph Bert	https://github.com/anonymous-sourcecode/Graph-Bert
Pretrain-Recsys	https://github.com/jerryhao66/Pretrain-Recsys
SLAPS	https://github.com/BorealisAI/SLAPS-GNN
G-BERT	https://github.com/jshang123/G-Bert
GPT-GNN	https://github.com/acbull/GPT-GNN
GraphCL	https://github.com/Shen-Lab/GraphCL
IGSD	N.A.
DACL	N.A.
LCC	https://github.com/YuxiangRen
CCGL	https://github.com/Xovee/ccgl
CSSL	N.A.
GCC	https://github.com/THUDM/GCC
GRACE	https://github.com/CRIPAC-DIG/GRACE
GCA	https://github.com/CRIPAC-DIG/GCA
GROC	N.A.
SEPT	https://github.com/Coder-Yu/QRec
STDGI	N.A.
GMI	https://github.com/zpeng27/GMI
KS2L	N.A.
CG3	N.A.
BGRL	N.A.
SelfGNN	https://github.com/zekarias-tilahun/SelfGNN
HeCo	https://github.com/liun-online/HeCo
PT-DGNN	https://github.com/Mobzhang/PT-DGNN
COAD	https://github.com/allanchen95/Expert-Linking
Contrast-Reg	N.A.
C-SWM	https://github.com/tkipf/c-swm
DGI	https://github.com/PetarV-/DGI
HDMI	N.A.
DMGI	https://github.com/pcy1302/DMGI
MVGRL	https://github.com/kavehhassani/mvgrl
HDGI	https://github.com/YuxiangRen/Heterogeneous-Deep-Graph-Infomax
Subg-Con	https://github.com/yzjiao/Subg-Con
Cotext Prediction	http://snap.stanford.edu/gnn-pretrain
GIC	https://github.com/cmavro/Graph-InfoClust-GIC
GraphLoG	https://openreview.net/forum?id=DAaaaqPv9-q
MHCN	https://github.com/Coder-Yu/RecQ
EGI	https://openreview.net/forum?id=J_pvI6ap5Mn
MICRO-Graph	https://drive.google.com/file/d/1b751rpnV-SDmUJvKZZI-AvpfEa9eHxo9/
InfoGraph	https://github.com/fanyun-sun/InfoGraph
SUGAR	https://github.com/RingBDStack/SUGAR
BiGI	https://github.com/clhchtcjj/BiNE
HTC	N.A.
DITNET	https://github.com/FangpingWan/NeoDTI
Node Property Prediction	https://github.com/ChandlerBang/SelfTask-GNN
S2GRL	N.A.
PairwiseDistance	https://github.com/ChandlerBang/SelfTask-GNN
PairwiseAttrSim	https://github.com/ChandlerBang/SelfTask-GNN
Distance2Cluster	https://github.com/ChandlerBang/SelfTask-GNN
EdgeMask	https://github.com/ChandlerBang/SelfTask-GNN
TopoTER	N.A.
Centrality Score Ranking	N.A.
Meta-path prediction	https://github.com/mlvlab/SELAR
SLiCE	https://github.com/pnnl/SLICE
Distance2Labeled	https://github.com/ChandlerBang/SelfTask-GNN
ContextLabel	https://github.com/ChandlerBang/SelfTask-GNN
HCM	N.A.
Contextual Molecular Property Prediction	https://github.com/tencent-ailab/grover
Graph-level Motif Prediction	https://github.com/tencent-ailab/grover
DrRepair	https://github.com/michiyasunaga/DrRepair
Multi-stage Self-training	https://github.com/Davidham3/deeper_insights_into_GCNs
Node Clustering	https://github.com/Shen-Lab/SS-GCNs
Graph Partitioning	https://github.com/Shen-Lab/SS-GCNs
CAGAN	N.A.
M3S	https://github.com/datake/M3S
Cluster Preserving	N.A.
SEF	https://github.com/nealgravindra/self-supervsed_edge_feats

Contribute

If you would like to help contribute this list, please feel free to contact me or add pull request with the following Markdown format:

- Paper Name. 
  - Author List. *Conference Year*. [[pdf]](link) [[code]](link)

This is a Github Summary of our Survey. If you find this file useful in your research, please consider citing:

@article{wu2021self,
  title={Self-supervised Learning on Graphs: Contrastive, Generative, or Predictive},
  author={Wu, Lirong and Lin, Haitao and Tan, Cheng and Gao, Zhangyang and Li, Stan Z},
  journal={IEEE Transactions on Knowledge and Data Engineering},
  year={2021},
  publisher={IEEE}
}

Feedback

If you have any issue about this work, please feel free to contact me by email:

Lirong Wu: wulirong@westlake.edu.cn
Haitao Lin: linhaitao@westlake.edu.cn