ClustGDD
July 5, 2025 ยท View on GitHub
The code for paper "Simple yet Effective Graph Distillation via Clustering" published at KDD 2025
Brief Introduction
This paper proposes a simple yet effective approach ClustGDD for graph data distillation. ClustGDD achieves superior performance in condensation effectiveness and efficiency over previous GDD solutions on real datasets through two major contributions: a simple clustering method minimizing the WCSS and a lightweight module augmenting synthetic attributes with class-relevant features.
environment
numpy 1.26.0
matplotlib 3.8.2
ogb 1.3.6
python 3.11.5
scikit-learn 1.3.2
torch 2.0.1
torch-cluster 1.6.1+pt20cu118
torch-geometric 2.6.1
torch-scatter 2.1.2+pt20cu118
torch-sparse 0.6.17+pt20cu118
torch-spline-conv 1.2.2+pt20cu118
PS: deep_robust_data.py and deep_robust_uitls.py are two scource code deeprobust.graph.data and deeprobust.graph.utils from repo deeprobust, you can install it by
pip install deeprobust
and see the code.
Datasets
See the datasets used in https://github.com/ChandlerBang/GCond
Runs
sh main_induct.sh
sh main_transduct.sh
Reference
https://github.com/ChandlerBang/GCond
https://github.com/Amanda-Zheng/SFGC