LLaVA Interpretability

October 15, 2024 · View on GitHub

This repository provides code and resources for our paper, Towards Interpreting Visual Information Processing in Vision-Language Models. Our work explores techniques like logit lens, token ablation, and attention blocking to better understand how vision-language models process visual data.

Installation & Data Setup
Usage
Citation
Contact

Installation & Data Setup

Prerequisites

Ensure you have Python 3.8+ and pip installed.

Steps

Clone the repository:

git clone https://github.com/clemneo/llava-interp
cd llava-interp

Install required Python packages:
```
pip install -r requirements.txt
```
Download and unzip the COCO dataset images (2017):
```
wget -P data/ http://images.cocodataset.org/zips/train2017.zip
unzip data/train2017.zip -d data/
```
Note: The ZIP file is 19 GB, and the unzipped content is also 19 GB. Make sure you have at least 38 GB of free space available.

Download and unzip the annotations:

wget -P data/ http://images.cocodataset.org/annotations/annotations_trainval2017.zip
unzip data/annotations_trainval2017.zip -d data/

Usage

1. Logit Lens

scripts/logit_lens/create_logit_lens.py Run the model and create interative logit lens HTMLs for a set of images
scripts/logit_lens/generate_overview.py Generate an index.html to view a set of logit_lens HTMLs files.

2. Token Ablation Experiments

Preparation

Before running ablation experiments, create the mean vector used for ablation:

scripts/save_post_adapter_acts.py Caches activations of visual tokens
scripts/esimate_acts_size.py Estimates the size of the total cache
scripts/calculate_mean_vector.py Generates a mean vector using cached visual tokens.

The mean vector used in the paper for LLaVA 1.5 and LLaVA-Phi can be found in data/.

Running Experiments

scripts/ablation_experiment.py Runs ablation experiments on LLaVA 1.5 (generative and polling settings)
scripts/ablation_experiment_curate.py Runs ablation experiments on LLaVA-1.5 (VQA setting)
scripts/ablation_experiment_phi.py Runs ablation experiments on LLaVA-Phi (generative and polling settings)
scripts/ablation_experiment_phi_curate.py Runs ablation experiments on LLaVA-Phi (VQA setting)

3. Attention Blocking experiments

scripts/attention_experiment_curate.py Run attention blocking experiments on LLaVA 1.5

Citation

To cite our work, please use the following BibTeX entry:

@misc{neo2024interpretingvisualinformationprocessing,
      title={Towards Interpreting Visual Information Processing in Vision-Language Models}, 
      author={Clement Neo and Luke Ong and Philip Torr and Mor Geva and David Krueger and Fazl Barez},
      year={2024},
      eprint={2410.07149},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2410.07149}, 
}