C$^2$MIL: Dual-Causal Graph-Based MIL for Survival Analysis

November 2, 2025 · View on GitHub

Official PyTorch implementation of $C^2$ MIL, a dual-causal graph-based multiple instance learning (MIL) model designed for robust and interpretable survival analysis on whole slide images (WSIs). Minor revisions have been made in the arXiv version of the $C^2$ MIL to make the work more rigorous. The details can be verified from the arXiv version and the GitHub code.

🔍 Overview

Graph-based MIL is widely used in computational pathology but faces two key challenges:

Semantic Confounding Bias
Variations in staining, sectioning, and scanning introduce irrelevant features that harm generalization.
Topological Noise
Not all subgraphs in WSIs are causally relevant to survival outcomes, leading to biased representations.

To tackle these, we propose C2MIL, which synchronizes semantic and topological causalities via a dual structural causal model.

✨ Key Features

Cross-Scale Adaptive Feature Disentangling (CAFD):
Removes trivial semantic confounders via backdoor adjustment, adaptively learning confounders without prior knowledge.
Bernoulli Differentiable Subgraph Sampling:
Identifies causal subgraphs within WSIs using a straight-through estimator for robust topology learning.
Joint Optimization:
Combines semantic supervision and topological contrastive learning under causal invariance.
Generalizable & Interpretable:
Achieves state-of-the-art survival prediction while providing interpretable attention heatmaps and adaptive clustering.

📊 Performance

C2MIL achieves state-of-the-art C-index across three TCGA cohorts, with significant improvements in both cross-validation and out-of-distribution generalization.

Model	Graph	Causal	KIRC (CV)	ESCA (CV)	BLCA (CV)	KIRC (OOD)	ESCA (OOD)	BLCA (OOD)
ABMIL	✗	✗	0.679	0.639	0.577	0.597	0.614	0.673
TransMIL	✗	✗	0.666	0.565	0.568	0.610	0.539	0.676
RRTMIL	✗	✗	0.678	0.620	0.566	0.584	0.589	0.679
DeepGraphConv	✓	✗	0.667	0.612	0.572	0.509	0.598	0.613
PatchGCN	✓	✗	0.686	0.652	0.576	0.606	0.568	0.697
ProtoSurv	✓	✗	0.698	0.619	0.593	0.610	0.598	0.695
IBMIL	✗	✓	0.697	0.589	0.553	0.616	0.571	0.654
C2MIL (Ours)	✓	✓	0.708	0.690	0.608	0.628	0.650	0.702

⚙️ Installation

git clone https://github.com/mimic0127/C2MIL.git
cd C2MIL
conda create -n c2mil python=3.9
conda activate c2mil
pip install -r requirements.txt

Dependencies:

python patch_fea_sample.py

Thumbnail features:

python thumbnail_svs.py
python thumbnail_fea_pocess.py

2. Graph Construction

Construct patch-level graphs with KNN based on patch coordinates:

python to_Graph.py

3. Training

Split the dataset into training, validation, and test subsets:

python fold.py

Train the model:

python train.py

4. Evaluation

Run evaluation and prediction:

python test_prediction.py

📂 Repository Structure

📜 Citation

If you find this repository useful, please ⭐️ star it and cite our paper:

@inproceedings{cen2025c2mil,
  title={C2MIL: Synchronizing Semantic and Topological Causalities in Multiple Instance Learning for Robust and Interpretable Survival Analysis},
  author={Cen, Min and Zhuang, Zhenfeng and Zhang, Yuzhe and Zeng, Min and Magnier, Baptiste and Yu, Lequan and Zhang, Hong and Wang, Liansheng},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
  pages={24392--24401},
  year={2025}
}

📝 License

This project is licensed under the MIT License.