Generalizable-Prompt-Learning-for-VLMs
June 5, 2026 · View on GitHub
A curated list of prompt learning methods for vision-language models which can be used for base-to-novel generalizaiton.
Tips:
- All the papers included in this list contain base-to-novel generalization experiments. In other words, methods that do not demonstrate generalization capabilities are not listed here.
- The layout of this list is inspired by this repository, which is initiated and maintained by Zheng Li. He is currently a third-year Ph.D. student at Nankai University, with one more CCF A-level paper to be published before graduation. I wish him success in his further research and happy trail running.
- I was saddened to unexpectedly find that Yaohui Li (1997-2024), the second author of Conditional Prototype Rectification Prompt Learning (CPR, TCSVT 2025), tragically passed away in an accident after completing this work. In his homepage, his education timeline noted his PhD as expected to span from 2023 to 2027, but sadly, his 2027 will never come. Although I did not know him personally, I extend my heartfelt gratitude for his contributions to the field of prompt learning, and I hope that his legacy will inspire others to build upon his vision. May you rest in peace.
Table of Contents
Keywords
Use text-based parameter-efficient fine-tuning.
Use image-based parameter-efficient fine-tuning.
Use text- and image-based parameter-efficient fine-tuning
Published in 2026
ProLoGProLoG: Hybrid Prompt and LoRA Based Adaptation of Vision-Language Models for OOD Generalization AAAI 2026.
[Paper LinK] [Code Link(Empty)]RMAdapterRMAdapter: Reconstruction-based Multi-Modal Adapter for Vision-Language Models AAAI 2026.
[Paper LinK] [No code available]LOREALLOREAL: Mitigating Low-Resolution Challenges in Vision-Language Models with Attribute-driven Prompt Self-Distillation CVPR 2026.
[Paper LinK] [Code Link]CPTCluster-Aware Neural Collapse Prompt Tuning for Long-Tailed Generalization of Vision-Language Models CVPR 2026.
[No paper available] [No code available]ReBaPLReBaPL: Repulsive Bayesian Prompt Learning CVPR 2026.
[Paper LinK] [Code Link]PromisePROMISE: Prompt-Robust Vision-Language Models Via Meta-Finetuning ICLR 2026.
[Paper LinK] [No code available]NeRPNeRP: Neutral-Reference Prompting for Vision–Language Models ICML 2026.
[Paper LinK] [Code Link]SpecPLSpecPL: Disentangling Spectral Granularity for Prompt Learning ICML 2026.
[Paper LinK] [Code Link]LightRALightRA: Lightweight Residual Attention for Adaptation of Vision-Language Models TMM 2026.
[Paper LinK] [Code Link]
Published in 2025
Beyond the SeenBeyond the Seen: Bounded Distribution Estimation for Open-Vocabulary Learning NIPS 2025.
[Paper LinK] [Code Link]TextRefinerTextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning AAAI 2025.
[Paper LinK] [Code Link]ProTextLearning to Prompt with Text Only Supervision for Vision-Language Models AAAI 2025.
[Paper Link] [Code Link]SPTRA Similarity Paradigm Through Textual Regularization Without Forgetting AAAI 2025.
[Paper Link] [No code available]FATEFATE: Feature-Adapted Parameter Tuning for Vision-Language Models AAAI 2025.
[Paper Link] [No code available]PTinCASPrompt Tuning In a Compact Attribute Space AAAI 2025.
[Paper Link] [No code available]DsRAExploring the Better Multimodal Synergy Strategy for Vision-Language Models AAAI 2025.
[Paper Link] [No code available]KAIDKAID: Knowledge-Aware Interactive Distillation for Vision-Language Models ACM MM 2025.
[No paper available] [No code available]CLIP-ASTAdaptive Parameter Selection for Tuning Vision-Language Models CVPR 2025.
[Paper Link] [No code available]MMRLMMRL: Multi-Modal Representation Learning for Vision-Language Models CVPR 2025.
[Paper Link] [Code Link]DPCDPC: Dual-Prompt Collaboration for Tuning Vision-Language Models CVPR 2025.
[Paper Link] [Code Link]2SFSRethinking Few-Shot Adaptation of Vision-Language Models in Two Stages CVPR 2025.
[Paper Link] [Code Link]SkipTSkip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters Themselves CVPR 2025.
[Paper Link] [Code Link]TACTask-Aware Clustering for Prompting Vision-Language Models CVPR 2025.
[Paper Link] [Code Link]ATPromptAdvancing Textual Prompt Learning with Anchored Attributes ICCV 2025.
[Paper Link] [Code Link]CaPLCausality-guided Prompt Learning for Vision-language Models via Visual Granulation ICCV 2025.
[No paper available] [Code Link]HicroPLHierarchical Cross-modal Prompt Learning for Vision-Language Models ICCV 2025.
[Paper Link] [Code Link]FMEnhancing Target-unspecific Tasks through a Features Matrix ICML 2025.
[Paper Link] [No code available]SurPLSurrogate Prompt Learning: Towards Efficient and Diverse Prompt Learning for Vision-Language Models ICML 2025.
[Paper Link] [Code Link]TAPTree of Attributes Prompt Learning For Vision Language Models ICLR 2025.
[Paper Link] [Code Link]DeKgDivergence-enhanced Knowledge-guided Context Optimization for Visual-Language Prompt Tuning ICLR 2025.
[Paper Link] [Code Link]DiSaDiSa: Directional Saliency-Aware Prompt Learning for Generalizable Vision-Language Models KDD 2025.
[Paper Link] [No code available]BIPBi-modality Individual-aware Prompt tuning for Visual-Language Model TPAMI 2025.
[Paper Link] [Code Link]CPRConditional Prototype Rectification Prompt Learning TCSVT 2025.
[Paper Link] [Code Link]LwEIBLearning with Enriched Inductive Biases for Vision-Language Models IJCV 2025.
[Paper Link] [Code Link]FCPromptFrequency-based Comprehensive Prompt Learning for Vision-Language Models TPAMI 2025.
[Paper Link] [Code Link]