🌴 [ICCV25] PALM: Performance Analysis of Active Learning Models

October 14, 2025 · View on GitHub

This is the official repository of “To Label or Not to Label: PALM – A Predictive Model for Evaluating Sample Efficiency in Active Learning Models”, presented at ICCV 2025.

The goal of PALM is to provide a unified and interpretable mathematical model designed to predict and analyze the behavior of Active Learning (AL) methods.

PALM provides a predictive description of learning dynamics from partial observations, enabling:

Estimation of future performance from early-stage results
Principled comparison across different AL strategies
Quantitative analysis of key factors affecting sample efficiency

🏝️ The PALM Model

PALM models AL performance trajectories using four interpretable parameters:

A = A_{\text{max}} \left( 1 - (1 - \delta)^{\left(\frac{B}{b} + \alpha\right)^{\beta}} \right)

Parameter	Meaning	Interpretation
A	Accuracy	Accuracy achieved by the model in the current AL episode.
Aₘₐₓ	Achievable accuracy	The asymptotic upper bound of accuracy achievable by the model.
δ (delta)	Coverage efficiency	Higher δ indicates better utilization of each labeled sample, improving data-space coverage.
α (alpha)	Initial learning efficiency	Lower α boosts initial accuracy, crucial in low-budget scenarios.
β (beta)	Scalability	Higher β increases the accuracy improvement rate as more samples are labeled.
b	Budget efficiency	Smaller `b` means more frequent updates with smaller batches, allowing smoother learning progression.

⚙️ Usage Guide

PALM can be used in two main ways:

Standalone mode: as a separate script integrated into your AL framework.
Input required: cumulative budget values per episode and corresponding accuracy results.
Framework mode (recommended): as a follow-up step inside the TypiClust framework for reproducible evaluation.

🧩 Environment Installation — Case 1 (Standalone PALM)

Use this setup if you only want to run PALM.py for curve fitting, without the full Active Learning framework.

1️⃣ Create an environment

conda create -n palm pytho=3.12 -y
conda activate palm
conda install numpy scipy matplotlib

🧩 Environment Installation — Case 2 (Framework mode)

To use PALM as a part of AL framework please refer to USAGE_Typiclust. Note that original TypiClust uses python 3.7. If you prefere to use python 3.11, here is the additional guide.

▶️ Running PALM

In both Cases, PALM.py automatically searches for .npy files in the following structure:

/path/to/AL/output/<DATASET>/<MODEL>/
├── <METHOD_1>_1/plot_episode_yvalues.npy
├── <METHOD_1>_2/plot_episode_yvalues.npy
├── <METHOD_2>_2/plot_episode_yvalues.npy
├── <METHOD_2>_2/plot_episode_yvalues.npy
└── ...

Each .npy file should be a 1D array of accuracies recorded after each Active Learning episode. The <METHOD_1> is name of your AL method and _N is the repetition under the same settings.

A) Fit on episode index (default mode) - Example

python PALM.py \
  --base_path /path/to/output/IMAGENET50/resnet50 \
  --dataset IMAGENET50 \
  --model resnet50 \
  --method random_ff_moco_b50 \
  --num_variants 5 \
  --num_samples 15 \
  --x_mode episodes \
  --out_dir palm_fit_out \
  --plot \
  --save_csv

B) Fit on cumulative-normalized budget - Example

python PALM.py \
  --base_path /path/to/output/IMAGENET50/resnet50 \
  --dataset IMAGENET50 \
  --model resnet50 \
  --method random_ff_moco_b50 \
  --num_variants 5 \
  --num_samples 15 \
  --x_mode cumulative_normalized \
  --budget_size 100 \
  --out_dir palm_fit_out \
  --plot \
  --save_csv

📦 Output Files

After running PALM.py, the following files will be saved in the directory specified by --out_dir
(default: ./palm_fit_out):

File	Description
`palm_params.json`	Fitted PALM parameters and metadata (dataset, model, method, episodes, etc.)
`palm_params.csv`	(Optional) CSV summary of the parameters, created if `--save_csv` is used
`y_avg.npy`	Averaged accuracy values across all selected runs
`y_fit.npy`	Fitted PALM curve values corresponding to the averaged data
`palm_fit.png`	Visualization of the PALM fit (generated if `--plot` is used)

💡 These outputs allow you to easily reproduce, visualize, or compare fitted curves across Active Learning methods.

🚩 Common Flags

Flag	Description
`--base_path`	Path to the folder containing run subdirectories (e.g. `output/CIFAR10/resnet50/`)
`--dataset`	Dataset name (for logging and output files)
`--model`	Model name (for logging and output files)
`--method`	Prefix of run folders (e.g. `typiclust_moco_b50`)
`--num_variants`	Number of repeated runs to average (e.g. 5 for `_1`, `_2`, `_3`, `_4`, `_5`)
`--num_samples`	Number of Active Learning episodes to use per run (`-1` = use all)
`--x_mode`	X-axis type: `episodes` (default) or `cumulative_normalized`
`--budget_size`	Budget per episode (only used if `x_mode=cumulative_normalized`)
`--out_dir`	Directory to save PALM outputs (default: `./palm_fit_out`)
`--plot`	Display and save the fitted PALM curve as a figure
`--save_csv`	Save fitted parameters to `palm_params.csv` in addition to JSON
`--init_A_max`, `--init_delta`, `--init_alpha`, `--init_beta`	Optional custom initial guesses for the curve-fitting procedure
`--alpha_lo`, `--alpha_hi`, `--beta_hi`	Bounds for fitting parameters (advanced use)

🔹 Use --help to see the full list of options and defaults:
python PALM.py --help

📚 Citing this Repository

If you find PALM useful in your research, please consider citing our ICCV 2025 paper and the repositories our work builds upon.

This repository builds on concepts and frameworks designed by TypiClust, SCAN, and Deep-AL. Please consider citing their work along with ours.

🏝️ PALM (ICCV 2025)

@article{machnio2025label,
  title={To Label or Not to Label: PALM--A Predictive Model for Evaluating Sample Efficiency in Active Learning Models},
  author={Machnio, Julia and Nielsen, Mads and Ghazi, Mostafa Mehdipour},
  journal={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
  year={2025}
}

@article{hacohen2022active,
  title={Active learning on a budget: Opposite strategies suit high and low budgets},
  author={Hacohen, Guy and Dekel, Avihu and Weinshall, Daphna},
  journal={arXiv preprint arXiv:2202.02794},
  year={2022}
}

@article{yehudaActiveLearningCovering2022,
  title = {Active {{Learning Through}} a {{Covering Lens}}},
  author = {Yehuda, Ofer and Dekel, Avihu and Hacohen, Guy and Weinshall, Daphna},
  journal={arXiv preprint arXiv:2205.11320},
  year={2022}
}

@article{mishal2024dcom,
      title={DCoM: Active Learning for All Learners}, 
      author={Mishal, Inbal and Weinshall, Daphna},
      journal={arXiv preprint arXiv:2407.01804},
      year={2024}
}

@inproceedings{vangansbeke2020scan,
  title={Scan: Learning to classify images without labels},
  author={Van Gansbeke, Wouter and Vandenhende, Simon and Georgoulis, Stamatios and Proesmans, Marc and Van Gool, Luc},
  booktitle={Proceedings of the European Conference on Computer Vision},
  year={2020}
}

@article{Chandra2021DeepAL,
    Author = {Akshay L Chandra and Vineeth N Balasubramanian},
    Title = {Deep Active Learning Toolkit for Image Classification in PyTorch},
    Journal = {https://github.com/acl21/deep-active-learning-pytorch},
    Year = {2021}
}

@article{Munjal2020TowardsRA,
  title={Towards Robust and Reproducible Active Learning Using Neural Networks},
  author={Prateek Munjal and N. Hayat and Munawar Hayat and J. Sourati and S. Khan},
  journal={ArXiv},
  year={2020},
  volume={abs/2002.09564}
}

License

This toolkit and PALM is released under the MIT license. Please see the LICENSE file for more information.