Awesome-Prompt-Adapter-Learning-for-VLMs

June 3, 2026 · View on GitHub

A curated list of prompt/adapter learning methods for vision-language models (e.g., CLIP, ALIGN).

Table of Contents

💡Tips:

  • If you know that some papers published in top conferences (CVPR, ICCV, ECCV, ICML, NeurlPS, ICLR) or journals (TPAMI, IJCV, TIP) have not been included in this list, please feel free to contact me at any time, either by sending an email (zhengli97[at]qq.com) or submitting an issue.
  • We would appreciate more people joining us in maintaining this list of papers.
  • Note that papers without open-source code are not recommended.
  • We sincerely thank the following people for contributing to this list: Lingfeng Yang, Ge Wu, Jiazuo Yu, Qiji Ma[List]

Keywords

Use text-based prompts/adapters.

Use image-based prompts/adapters.

Use text- and image-based prompts/adapters.

Surveys

  • A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models. [Paper]
  • Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey. [Paper]
  • Generalizable Prompt Learning of CLIP: A Brief Overview. [Paper]

Foundation Models

  • CLIP Learning Transferable Visual Models From Natural Language Supervision. ICML 2021.
    [Paper] [Code]
  • ALIGN Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision. ICML 2021.
    [Paper]
  • LiT LiT: Zero-Shot Transfer with Locked-image text Tuning. CVPR 2022.
    [Paper] [Code]
  • EVA-CLIP EVA-CLIP: Improved Training Techniques for CLIP at Scale. 2023.
    [Paper] [Code]
  • SigLIP Sigmoid Loss for Language Image Pre-Training. ICCV 2023.
    [Paper] [Code]
  • AlphaCLIP Alpha-CLIP: A CLIP Model Focusing on Wherever You Want. CVPR 2024.
    [Paper] [Code]
  • CLIP-KD CLIP-KD: An Empirical Study of CLIP Model Distillation. CVPR 2024.
    [Paper] [Code] [论文解读]
  • LongCLIP Long-CLIP: Unlocking the Long-Text Capability of CLIP. ECCV 2024.
    [Paper] [Code]
  • CLIP-Refine Post-pre-training for Modality Alignment in Vision-Language Foundation Models. CVPR 2025.
    [Paper] [Code]
  • KUEA Kernel-based Unsupervised Embedding Alignment for Enhanced Visual Representation in Vision-language Models. ICML 2025.
    [Paper] [Code]

Datasets

Base-to-Novel: ImageNet-1K, Caltech101, Oxford Pets, StanfordCars, Flowers102, Food101, FGVC Aircraft, SUN397, DTD, EuroSAT, UCF101.

Domain Generalization: ImageNet-V2, ImageNet-Sketch, ImageNet-Adversarial, ImageNet-Rendition.

Due to various factors, the links to some datasets may be outdated or invalid.
To make it easy for you to download these datasets, we maintain a repository on HuggingFace, which contains all the datasets to be used (except ImageNet). Each dataset also includes the corresponding split_zhou_xx.json file.

[Huggingface_Dataset_Download_Link]

General Prompt Learning

Experimental Comparison

Base-to-Novel Generalization. (ViT-B/16 CLIP)

MethodsPaperPubBaseNovelHM (main)CodeType
CLIPLinkICML 2169.3474.2271.70LinkModel
CoOpLinkIJCV 2282.6963.2271.66Link-
ATPromptLinkICCV 2582.6868.0474.65Link-
ATPrompt+PromptKD--87.0581.8284.35-Plugin
CoCoOpLinkCVPR 2280.4771.6975.83Link-
DPCLinkCVPR 2585.1568.8476.13Link-
DPC+PromptKD--87.5580.5583.91-Plugin
ProDALinkCVPR 2281.5672.3076.65Link-
TextRefinerLinkAAAI 2579.7474.3276.94Link-
TextRefiner+PromptKD--85.2279.6482.33-Plugin
KgCoOpLinkCVPR 2380.7373.6077.00Link-
RPOLinkICCV 2381.1375.0077.78Link-
DePTLinkCVPR 2483.8072.8977.97Link-
DePT+PromptSRC--85.1976.1780.43-Plugin
MaPLeLinkCVPR 2382.2875.1478.55Link-
QNetLinkICLR 2483.3275.6579.30Link-
CasPLLinkECCV 2484.7874.4979.30Link-
CasPL+PromptSRC--86.1179.5482.69-Plugin
TCPLinkCVPR 2484.1375.3679.51Link-
MMALinkCVPR 2483.2076.8079.87Link-
PromptSRCLinkICCV 2384.2676.1079.97Link-
2SFSLinkCVPR 2585.5575.4880.20Link-
HPTLinkAAAI 2484.3276.8680.23Link-
CoPromptLinkICLR 2484.0077.2380.48Link-
TAPLinkICLR 2584.7577.6381.04Link-
SkipTLinkCVPR 2585.0477.5381.11Link-
MMRLLinkCVPR 2585.6877.1681.20Link-
LLaMPLinkCVPR 2485.1677.7181.27Link-
DeARLinkCVPR 2685.9479.7382.72Link-
PromptKDLinkCVPR 2486.9680.7383.73Link-

Table 1. Average results on 11 datasets. (Only works with open-source code will be listed.)

Paper List

2022

  • CoOp Learning to Prompt for Vision-Language Models. IJCV 2022.
    [Paper] [Code]
  • CoCoOp Conditional Prompt Learning for Vision-Language Models. CVPR 2022.
    [Paper] [Code]
  • ProDA Prompt Distribution Learning. CVPR 2022.
    [Paper] [Code]
  • VPT Visual Prompt Tuning. ECCV 2022.
    [Paper] [Code]
  • VP Exploring Visual Prompts for Adapting Large-Scale Models. Arxiv 2022.
    [Paper] [Code]

2023

  • MaPLe MaPLe: Multi-modal Prompt Learning. CVPR 2023.
    [Paper] [Code]
  • KgCoOp Visual-Language Prompt Tuningx with Knowledge-guided Context Optimization. CVPR 2023.
    [Paper] [Code]
  • LASP LASP: Text-to-Text Optimization for Language-Aware Soft Prompting of Vision & Language Models. CVPR 2023.
    [Paper] [Code]
  • DAM-VP Diversity-Aware Meta Visual Prompting. CVPR 2023.
    [Paper] [Code]
  • TaskRes Task Residual for Tuning Vision-Language Models. CVPR 2023.
    [Paper] [Code]
  • RPO Read-only Prompt Optimization for Vision-Language Few-shot Learning. ICCV 2023.
    [Paper] [Code]
  • KAPT Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models. ICCV 2023.
    [Paper] [Code Not Found]
  • CuPL What does a platypus look like? Generating customized prompts for zero-shot image classification. ICCV 2023.
    [Paper] [Code]
  • ProGrad Prompt-aligned Gradient for Prompt Tuning. ICCV 2023.
    [Paper][Code]
  • PromptSRC Self-regulating Prompts: Foundational Model Adaptation without Forgetting. ICCV 2023.
    [Paper] [Code]
  • LFA Black Box Few-Shot Adaptation for Vision-Language models. ICCV 2023.
    [Paper] [Code]
  • DeFo Learning to Decompose Visual Features with Latent Textual Prompts. ICLR 2023.
    [Paper] [Code Not Found]
  • PLOT PLOT: Prompt Learning with Optimal Transport for Vision-Language Models. ICLR 2023.
    [Paper] [Code]
  • POMP Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition. NeurIPS 2023.
    [Paper] [Code]

2024

  • MetaPrompt Learning Domain Invariant Prompt for Vision-Language Models. TIP 2024.
    [Paper] [Code Not Found]
  • ProVP Progressive Visual Prompt Learning with Contrastive Feature Re-formation. IJCV 2024.
    [Paper] [Code]
  • CoPL CoPL: Contextual Prompt Learning for Vision-Language Understanding. AAAI 2024.
    [Paper] [Code Not Found]
  • SA2VP SA2VP: Spatially Aligned-and-Adapted Visual Prompt. AAAI 2024.
    [Paper] [Code]
  • HPT Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models. AAAI 2024.
    [Paper] [Code]
  • LaViP LaViP: Language-Grounded Visual Prompts. AAAI 2024.
    [Paper] [Code Not Found]
  • CoPrompt Consistency-guided Prompt Learning for Vision-Language Models. ICLR 2024.
    [Paper] [Code]
  • PromptKD PromptKD: Unsupervised Prompt Distillation for Vision Language Models. CVPR 2024.
    [Paper] [Code] [中文版] [论文解读] [视频解读]
  • DePT DePT: Decoupled Prompt Tuning. CVPR 2024.
    [Paper] [Code]
  • ArGue ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models. CVPR 2024.
    [Paper] [Code Not Found]
  • TCP TCP: Textual-based Class-aware Prompt tuning for Visual-Language Model. CVPR 2024.
    [Paper] [Code]
  • MMA MMA: Multi-Modal Adapter for Vision-Language Models. CVPR 2024.
    [Paper] [Code]
  • LLaMP Large Language Models are Good Prompt Learners for Low-Shot Image Classification. CVPR 24.
    [Paper] [Code]
  • KDPL Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation. ECCV 2024.
    [Paper] [Code]
  • CoCoLe Conceptual Codebook Learning for Vision-Language Models. ECCV 2024.
    [Paper] [Code Not Found]
  • CasPL Cascade Prompt Learning for Vision-Language Model Adaptation. ECCV 2024.
    [Paper] [Code] [论文解读]
  • GalLoP GalLoP: Learning Global and Local Prompts for Vision-Language Models. ECCV 2024.
    [Paper] [Code Not Found]
  • AWT AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation. NeurIPS 2024.
    [Paper] [Code]
  • QNet Prompt Learning with Quaternion Networks. ICLR 2024.
    [Paper] [Code(Empty)]
  • QMaPLe Quantized Prompt for Efficient Generalization of Vision-Language Models. ECCV 2024.
    [Paper] [Code(Empty)]

2025

  • TextRefiner TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning. AAAI 2025.
    [Paper] [Code] [论文解读]
  • ProText Learning to Prompt with Text Only Supervision for Vision-Language Models. AAAI 2025.
    [Paper] [Code]
  • FATE FATE: Feature-Adapted Parameter Tuning for Vision-Language Models. AAAI 2025.
    [Paper] [Code Not Found]
  • TAP Tree of Attributes Prompt Learning For Vision Language Models. ICLR 2025.
    [Paper] [Code]
  • DeKg Divergence-enhanced Knowledge-guided Context Optimization for Visual-Language Prompt Tuning. ICLR 2025.
    [Paper] [Code]
  • MMRL MMRL: Multi-Modal Representation Learning for Vision-Language Models. CVPR 2025.
    [Paper] [Code]
  • DPC DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models. CVPR 2025.
    [Paper] [Code] [论文解读]
  • 2SFS Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages. CVPR 2025.
    [Paper] [Code]
  • SkipT Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters Themselves. CVPR 2025.
    [Paper] [Code]
  • NLPrompt NLPrompt: Noise-Label Prompt Learning for Vision-Language Models. CVPR 2025.
    [Paper] [Code]
  • TAC Task-Aware Clustering for Prompting Vision-Language Models. CVPR 2025.
    [Paper] [Code]
  • OpenworldAUC OpenworldAUC: Towards Unified Evaluation and Optimization for Open-world Prompt Tuning. ICML 2025.
    [Paper] [Code]
  • FM Enhancing Target-unspecific Tasks through a Features Matrix. ICML 2025.
    [Paper] [Code Not Found]
  • SurPL Surrogate Prompt Learning: Towards Efficient and Diverse Prompt Learning for Vision-Language Models. ICML 2025.
    [Paper] [Code]
  • ATPrompt Advancing Textual Prompt Learning with Anchored Attributes. ICCV 2025.
    [Paper] [Code] [论文解读] [中文版]
  • HicroPL Hierarchical Cross-modal Prompt Learning for Vision-Language Models. ICCV 2025.
    [Paper] [Code]
  • CaPL Causality-guided Prompt Learning for Vision-language Models via Visual Granulation. ICCV 2025.
    [Paper] [Code(Empty)]
  • LwEIB Learning with Enriched Inductive Biases for Vision-Language Models IJCV 2025.
    [Paper] [Code]
  • BIP Bi-modality Individual-aware Prompt tuning for Visual-Language Model. TPAMI 2025.
    [Paper] [Code]
  • DAPT Decouple before Align: Visual Disentanglement Enhances Prompt Tuning. TPAMI 2025.
    [Paper] [Code]
  • Spotlighter Spotlighter: Revisiting Prompt Tuning from a Representative Mining View. EMNLP 2025 Findings.
    [Paper] [Code]
  • VaMP VaMP: Variational Multi-Modal Prompt Learning for Vision-Language Models. NeurIPS 2025.
    [Paper] [Code Not Found]
  • KAID KAID: Knowledge-Aware Interactive Distillation for Vision-Language Models. ACM MM 2025.
    [Paper] [Code Not Found] [论文解读]
  • AnchorOPT AnchorOPT: Towards Optimizing Dynamic Anchors for Adaptive Prompt Learning. arxiv 25.
    [Paper] [Code]

2026

  • Dropout Prompt Learning: Towards Robust and Adaptive Vision-Language Models. AAAI 2026.
    [Paper] [Code]
  • End-to-End Knowledge Distillation for Unsupervised Domain Adaptation with Large Vision-language Models. AAAI 2026.
    [Paper] [Code Not Found]
  • Towards Calibrating Prompt Tuning of Vision-Language Models. CVPR 2026.
    [Paper] [Code]
  • AVION AVION: Aerial Vision–Language Instruction from Offline Teacher to Prompt-Tuned Network. CVPR 2026.
    [Paper][Code(Empty)]
  • CAPT CAPT: Confusion-Aware Prompt Tuning for Reducing Vision-Language Misalignment. CVPR 2026.
    [Paper] [Code]
  • DeAR DeAR: Fine-Grained VLM Adaptation by Decomposing Attention Head Roles. CVPR 2026.
    [Paper] [Code]
  • CAKI Plug-and-play Class-aware Knowledge Injection for Prompt Learning with Visual-Language Model. IJCV 2026.
    [Paper] [Code(Empty)]
  • NeRP Neutral-Reference Prompting for Vision–Language Models. ICML 2026.
    [Paper] [Code]
  • SDPT SDPT: Synchronous Dual Prompt Tuning for Visual-Language Pre-trained Models. TPAMI 2026.
    [Paper] [Code]
  • MMA++ MMA++: Effective Multi-Modal Adaptation for Vision-Language Models. TPAMI 2026.
    [Paper] [Code(Empty)]
  • CASPA-G CASPA: Graph-Structured Concept Anchors for Modality-Agnostic Adaptation in Vision-Language Models. CVPR 2026.
    [Paper] [Code Not Found]

Another form of Prompt

Paper List

  • CPT CPT: Colorful Prompt Tuning for pre-trained vision-language models. Arxiv 2021.
    [Paper] [Code]
  • DetPro Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model. CVPR 2022.
    [Paper] [Code]
  • PromptDet PromptDet: Towards Open-vocabulary Detection using Uncurated Images. ECCV 2022.
    [Paper] [Code]
  • Visual Prompting via Image Inpainting. NeurIPS 2022.
    [Paper]
  • OVSeg Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP. CVPR 2023.
    [Paper] [Code]
  • LoGoPrompt LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for Vision-Language Models. ICCV 2023.
    [Paper]
  • RedCircle What does CLIP know about a red circle? Visual prompt engineering for VLMs. ICCV 2023.
    [Paper]]
  • FGVP Fine-Grained Visual Prompting. NeurIPS 2023.
    [Paper] [Code]
  • SoM Set-of-mark prompting unleashes extraordinary visual grounding in gpt-4v. Arxiv 2023.
    [Paper] [Code]
  • Alpha-CLIP Alpha-CLIP: A CLIP Model Focusing on Wherever You Want. CVPR 2024.
    [Paper] [Code]
  • ViP-LLaVA Making Large Multimodal Models Understand Arbitrary Visual Prompts. CVPR 2024.
    [Paper] [Code]
  • SSC Segment, Select, Correct: A Framework for Weakly-Supervised Referring Segmentation. ECCV 2024.
    [Paper] [Code]

General Test-time Prompt Learning

Experimental Comparison

MethodsPubImageNet-A-V2-R-SAvg. (main)Code
CoOpIJCV 2271.5149.7164.2075.2147.9959.28Link
CoCoOpCVPR 2271.0250.6364.0776.1848.7559.91Link
DiffTPTICCV 2370.3055.6865.1075.0046.8060.65Link
TPTNeurIPS 2268.9854.7763.4577.0647.9460.81Link
TPT+CoOpNeurIPS 2273.6157.9566.8377.2749.2962.84Link
PromptAlignNeurIPS 23---59.3765.2979.3359.3763.55Link
TPS+CoOpArxiv 2473.7360.4966.8477.4449.0865.52Link
RLCFICLR 2473.2365.4569.7783.3554.7468.33Link
RLCF+CoOpICLR 2476.0569.7470.6284.5156.4970.34Link
COSMICCVPR 2578.1973.3269.6285.6062.7972.83Link

Table 2. Test-time prompt tuning methods on OOD data.

Paper List

  • TPT Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models. NeurIPS 2022.
    [Paper] [Code]
  • SwapPrompt SwapPrompt: Test-Time Prompt Adaptation for Vision-Language Models. NeurIPS 2023.
    [Paper]
  • PrompAlign Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization. NeurIPS 2023.
    [Paper] [Code]
  • TPS Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models. Arxiv 2024.
    [Paper] [Code]
  • RLCF Test-time Adaptation with CLIP reward for zero-shot generalization in Vision-Language Models. ICLR 2024.
    [Paper] [Code]
  • InTTA Invariant Test-Time Adaptation for Vision-Language Model Generalization. Arxiv 2024.
    [Paper] [Code]
  • TDA Efficient Test-Time Adaptation of Vision-Language Models. CVPR 2024.
    [Paper] [Code]
  • DMN Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models. CVPR 2024.
    [Paper] [Code]
  • C-TPT C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion. ICLR 2024.
    [Paper] [Code]
  • DynaPrompt DynaPrompt: Dynamic Test-Time Prompt Tuning. ICLR 2025.
    [Paper]
  • R-TPT R-TPT: Improving Adversarial Robustness of Vision-Language Models through Test-Time Prompt Tuning. CVPR 2025.
    [Paper] [Code]
  • StatA Realistic Test-Time Adaptation of Vision-Language Models. CVPR 2025.
    [Paper] [Code]
  • O-TPT O-TPT: Orthogonality Constraints for Calibrating Test-time Prompt Tuning in Vision-Language Models. CVPR 2025.
    [Paper] [Code]
  • COSMIC COSMIC: Clique-Oriented Semantic Multi-space Integration for Robust CLIP Test-Time Adaptation. CVPR 2025.
    [Paper] [Code]
  • Multi-Cache Enhanced Prototype Learning for Test-Time Generalization of Vision-Language Models. ICCV 2025.
    [Paper] [Code]

General Adapter Learning

Paper List

  • CLIP-Adapter CLIP-Adapter: Better Vision-Language Models with Feature Adapters. Arxiv 2021.
    [Paper] [Code]
  • Tip-Adapter Tip-Adapter: Training-free Adaption of CLIP for Few-shot Classification. ECCV 2022.
    [Paper] [Code]
  • APE Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement. ICCV 2023.
    [Paper] [Code]
  • CaFoPrompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners. CVPR 2023.
    [Paper] [Code]
  • Meta-Adapter Meta-Adapter: An Online Few-shot Learner for Vision-Language Model. NeurIPS 2023.
    [Paper] [Code]
  • AMU-Tuning AMU-Tuning: Effective Logit Bias for CLIP-based Few-shot Learning. CVPR 2024.
    [Paper] [Code]
  • LDC Logits DeConfusion with CLIP for Few-Shot Learning. CVPR 2025.
    [Paper] [Code]
  • VtT Reclaiming Lost Text Layers for Source-Free Cross-Domain Few-Shot Learning. CVPR 2026.
    [Paper] [Code(Empty)]

Video Understanding

Prompt Learning

  • ActionCLIP Actionclip: A new paradigm for video action recognition. arxiv 21.
    [Paper] [Code]
  • VideoPrompt Prompting visual-language models for efficient video understanding. ECCV 2022.
    [Paper] [Code]
  • InTTA Expanding Language-Image Pretrained Models for General Video Recognition. ECCV 2022.
    [Paper] [Code]
  • RePro Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection. ICLR 2023.
    [Paper] [Code]
  • Vita-CLIP Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting. CVPR 2023.
    [Paper] [Code]
  • ViFi-CLIP Fine-tuned CLIP Models are Efficient Video Learners. CVPR 2023.
    [Paper] [Code]
  • OpenVCLIP Open-VCLIP: Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight Optimization. ICML 2023.
    [Paper] [Code]
  • M2-CLIP M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition. AAAI 2024.
    [Paper] [Code]
  • ViLT-CLIP ViLT-CLIP: Video and Language Tuning CLIP with Multimodal Prompt Learning and Scenario-Guided Optimization. AAAI 2024.
    [Paper] [Code(None)]
  • FROSTER FROSTER: Frozen CLIP Is A Strong Teacher for Open-Vocabulary Action Recognition. ICLR 2024.
    [Paper] [Code]

Adapter Learning

  • BT-Adapter BT-Adapter: Video Conversation is Feasible Without Video Instruction Tuning. CVPR 2024.
    [Paper] [Code]

Continual Learning

Prompt Learning

  • L2P Learning to Prompt for Continual Learning. CVPR 2022.
    [Paper] [Code]
  • DualPrompt DualPrompt: Complementary Prompting for Rehearsal-free Continual Learning. ECCV 2022.
    [Paper] [Code]
  • EvoPrompt Evolving Parameterized Prompt Memory for Continual Learning. AAAI 2024.
    [Paper]
  • CPrompt Consistent Prompting for Rehearsal-Free Continual Learning. CVPR 2024.
    [Paper] [Code]
  • DIKI Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models. ECCV 2024.
    [Paper] [Code]

Adapter Learning

  • MoE-Adapters4CL Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters. CVPR 2024.
    [Paper] [Code]
  • SSIAT Semantically-Shifted Incremental Adapter-Tuning is A Continual ViTransformer. CVPR 2024.
    [Paper] [Code]

Others

OOD

  • LoCoOp LoCoOp: Few-Shot Out-of-Distribution Detection via Prompt Learning. NeurIPS 2023.
    [Paper] [Code]
  • DeCoOp DeCoOp: Robust Prompt Tuning with Out-of-Distribution Detection. ICML 2024.
    [Paper] [Code]

Point Cloud

  • IDPT Instance-aware Dynamic Prompt Tuning for Pre-trained Point Cloud Models. ICCV 2023.
    [Paper] [Code]
  • PPT Parameter-efficient Prompt Learning for 3D Point Cloud Understanding. ICRA 2024.
    [Paper] [Code]
  • Point-PRC Point-PRC: A Prompt Learning Based Regulation Framework for Generalizable Point Cloud Analysis. NeurIPS 2024.
    [Paper] [Code]

BioMedical

  • BiomedCoOp BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models. CVPR 2025.
    [Paper] [Code]
  • MAPLE MAPLE: Multi-scale Attribute-enhanced Prompt Learning for Few-shot Whole Slide Image Classification. NeurIPS 2025.
    [Paper] [Code]

Robot

  • PPL Think Small, Act Big: Primitive Prompt Learning for Lifelong Robot Manipulation. CVPR 2025.
    [Paper]

Retrieval

  • CLIP4clip Clip4clip: An empirical study of clip for end to end video clip retrieval and captioning. Neurocomputing 2022.
    [Paper] [Code]
  • VoP VoP: Text-Video Co-Operative Prompt Tuning for Cross-Modal Retrieval. CVPR 2023.
    [Paper] [Code]
  • DGL DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval. AAAI 2024.
    [Paper] [Code]

Action

  • SCoPLe Semantic-guided Cross-Modal Prompt Learning for Skeleton-based Zero-shot Action Recognition. CVPR 2025.
    [Paper] [Code Not Found]

Federal Learning

  • FedTPG Federated Text-driven Prompt Generation for Vision-Language Models. ICLR 2024.
    [Paper] [[Code Not Found]]
  • FedMVP FedMVP: Federated Multimodal Visual Prompt Tuning for Vision-Language Models. ICCV 2025.
    [Paper] [Code]
  • FedMPT FedMPT: Federated Multi-label Prompt Tuning of Vision-Language Models. CVPR 2026.
    [Paper] [Web Page]

Anomaly Detection

  • PromptAD PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection. CVPR 2024.
    [Paper] [Code]

Low Resolution

  • LOREAL LOREAL: Mitigating Low-Resolution Challenges in Vision-Language Models with Attribute-driven Prompt Self-Distillation. CVPR 2026.
    [Paper] [Web Page]