SAM 2-Driven Self-Training for Mammogram Segmentation: Zero-Shot Mask Generation via Pseudo-Video

September 18, 2025 · View on GitHub

Mauricio Fernandez M.¹ (mfernandez@csu.edu.cn), Yixiong Liang¹ (yxliang@csu.edu.cn), Christopher A. Cochran²*

¹School of Computer Science, Central South University, Changsha 410083, P.R. China
²School of Automation, Central South University, Changsha 410083, P.R. China

Read the Paper (IEEE Xplore) View the Code (Kaggle) Cite this Work

Methodology Overview

Overview of the SAM 2-driven self-training methodology for mammogram segmentation.

Accurate mammogram segmentation is crucial for breast cancer diagnosis. However, existing Deep Learning methods often require large, annotated datasets, which are time-consuming and expensive to obtain.

We introduce SAM 2-driven self-training, a novel approach that leverages SAM 2 for efficient and accurate mammogram segmentation. By constructing a pseudo-video sequence from static mammograms, we use SAM 2 video inference to generate initial masks, subsequently applying them for parameter-efficient adaptation of SAM, focusing on the mask decoder, and an automatic point prompt generator for enhanced usability.

Our method significantly reduces the need for manual annotation while maintaining high accuracy. Evaluations on the mini-MIAS and CBIS-DDSM datasets demonstrate significant improvements in accuracy and efficiency compared to established techniques and the original SAM. This robust solution facilitates rapid mammogram segmentation and aids creating annotated datasets with minimal user intervention.

Code

The implementation of our work is available in a series of Kaggle notebooks, covering the entire pipeline from data preparation to model training.

1. Data Preparation: This notebook covers the initial steps of processing the mammogram datasets, including artifact removal and normalization.
2. Pseudo-Mask Generation: This notebook demonstrates how to create a pseudo-video and use SAM 2 to generate the initial zero-shot segmentation masks.
3. Self-Training and Augmentation: This notebook details the self-training process, where the pseudo-masks are used to fine-tune the SAM decoder.

Key Contributions

Self-Training for Zero-Shot Mammogram Segmentation: Developed a novel and efficient self-training methodology that leverages the video mode of SAM 2 to generate accurate pseudo-labels for training a SAM decoder, enabling high-quality mammogram segmentation without the need for manual annotations.
Application of SAM 2 Video Mode for Static Medical Images: Adapted SAM 2 video mode for static mammogram segmentation using pseudo video sequences, achieving high zero-shot accuracy.
Automatic and Interpretable Single-Point Prompt Generation: Introduced an automatic and interpretable prompting methodology for mammograms.

@ARTICLE{11084376,
  author={Fernandez M., Mauricio and Liang, Yixiong and Cochran, Christopher A.},
  journal={IEEE Access}, 
  title={SAM 2-Driven Self-Training for Mammogram Segmentation: Zero-Shot Mask Generation via Pseudo-Video}, 
  year={2024},
  volume={},
  number={},
  pages={1-1},
  doi={10.1109/ACCESS.2024.3444453}}

SAM 2-Driven Self-Training for Mammogram Segmentation: Zero-Shot Mask Generation via Pseudo-Video

Methodology Overview

Abstract

Code

Key Contributions

Visual Overview

Example Preprocessing

Qualitative Results

Citation