README.md

April 1, 2026 · View on GitHub

FGVEdit

Official code and data release for
Visual-Oriented Fine-Grained Knowledge Editing for Multimodal Large Language Models

Dataset

Table of Contents

🛠️ About This Project
🚀 Getting Started
🧪 Usage
🎉 Acknowledgments

🛠️ About This Project

FGVEdit is a benchmark and codebase for fine-grained multimodal knowledge editing. The current release contains data processing and experiment entrypoints for four editing methods:

IKE
MEND
SERAC
MSCKE

The released code currently supports two multimodal backbones:

BLIP-2
MiniGPT-4

The default data split used by this repository is:

train_data.json: 8334 samples
test_data.json: 2778 samples

Each sample contains the edit target, rephrase query, textual locality query, fine-grained generality query, fine-grained locality query, and the associated image path.

(back to top)

🧪 Usage

All experiment entrypoints are at the repository root:

edit_IKE.py
edit_MEND.py
edit_SERAC.py
edit_MSCKE.py

All hyper-parameters are stored in hparams/.

IKE

Configs:

hparams/IKE/blip2.yaml
hparams/IKE/minigpt4.yaml

Run embedding generation only:

python edit_IKE.py --model blip2 --mode embed
python edit_IKE.py --model minigpt4 --mode embed

Run evaluation:

python edit_IKE.py --model blip2 --mode eval
python edit_IKE.py --model minigpt4 --mode eval

The script will build train-set embeddings and then evaluate on data/FGVEdit/test_data.json.

MEND

Training configs:

hparams/TRAINING/MEND/blip2.yaml
hparams/TRAINING/MEND/minigpt4.yaml

Evaluation configs:

hparams/MEND/blip2.yaml
hparams/MEND/minigpt4.yaml

Run training:

python edit_MEND.py --model blip2 --mode train
python edit_MEND.py --model minigpt4 --mode train

Run evaluation:

python edit_MEND.py --model blip2 --mode eval
python edit_MEND.py --model minigpt4 --mode eval

Important:

--mode eval requires a trained checkpoint.
Before evaluation, update the archive field in the chosen evaluation YAML so that it points to a concrete .pt checkpoint file produced during training.
Training outputs are saved under results_dir/models/<ALG>/.

SERAC

Training configs:

hparams/TRAINING/SERAC/blip2.yaml
hparams/TRAINING/SERAC/minigpt4.yaml

Evaluation configs:

hparams/SERAC/blip2.yaml
hparams/SERAC/minigpt4.yaml

Run training:

python edit_SERAC.py --model blip2 --mode train
python edit_SERAC.py --model minigpt4 --mode train

Run evaluation:

python edit_SERAC.py --model blip2 --mode eval
python edit_SERAC.py --model minigpt4 --mode eval

Important:

--mode eval requires a trained checkpoint.
Before evaluation, update the archive field in the chosen evaluation YAML to the actual checkpoint file path.

MSCKE

Training configs:

hparams/TRAINING/MSCKE/blip2.yaml
hparams/TRAINING/MSCKE/minigpt4.yaml

Evaluation configs:

hparams/MSCKE/blip2.yaml
hparams/MSCKE/minigpt4.yaml

Run training:

python edit_MSCKE.py --model blip2 --mode train
python edit_MSCKE.py --model minigpt4 --mode train

Run evaluation:

python edit_MSCKE.py --model blip2 --mode eval
python edit_MSCKE.py --model minigpt4 --mode eval

Important:

--mode eval requires a trained checkpoint.
Before evaluation, update the archive field in the chosen evaluation YAML to the actual checkpoint file path produced during training.

Practical Notes

The current scripts read device from the selected YAML file, so change that field before running on your machine.
If you move data or checkpoints, update the corresponding paths in the YAML file instead of relying on implicit defaults.
MEND, SERAC, and MSCKE evaluation configs currently contain placeholder archive paths. They must be replaced with real checkpoint filenames.

(back to top)

This repository builds on the multimodal editing ecosystem around EasyEdit, and uses pretrained components or model implementations from LAVIS / BLIP-2, MiniGPT-4, Transformers, Sentence-Transformers, and CLIP.

We thank the authors and maintainers of these projects for making their code and models publicly available.

(back to top)

all-MiniLM-L6-v2	bert-base-uncased	distilbert-base-cased
opt-2.7b	opt-125m	vicuna-7b
blip2_pretrained_flant5xxl.pth	blip2_pretrained_opt2.7b.pth	prerained_minigpt4_7b.pth
eva_vit_g.pth	clip-vit-large-patch14

README.md

FGVEdit

🛠️ About This Project

🚀 Getting Started

Download Data

Environment Setup

Download Pre-trained Models

🧪 Usage

IKE

MEND

SERAC

MSCKE

Practical Notes

🎉 Acknowledgments