EAMET-massive-editing

February 2, 2026 · View on GitHub

Official code implementation of "EAMET: ROBUST MASSIVE MODEL EDITING VIA EMBEDDING ALIGNMENT OPTIMIZATION (https://arxiv.org/abs/2505.11876)"

News

At least one NVIDIA GPU with 80GB

The main configuration is done through general.sh. Here are the key parameters you can customize:

Algorithm Selection (alg_name):
- EAMET (default)
- MEMIT
- PMET
- ROME
- FT
- MEND
- ALPHAEDIT
Model Selection (model_name):
- NousResearch/Llama-2-7b-hf (default)
- meta-llama/Llama-3.1-8B
- NousResearch/Llama-2-13b-hf
- tiiuae/falcon-7b
- deepseek-ai/deepseek-llm-7b-base
- Qwen/Qwen2.5-7B
- google/gemma-7b-it
- microsoft/phi-1_5
Dataset Selection (ds_name):
- counterfact (default)
- zsre
- wikirecent
Hyperparameters:
- Choose appropriate hparams_fname based on your model.
GLEU Benchmark Evaluation: To evaluate the edited models using the GLEU benchmark, modify the evaluation command in general.sh:
```
- python -m experiments.evaluate \
+ python -m experiments.evaluate_gleu \
```
The GLEU benchmark provides additional metrics for assessing the general ability of edited models.

Set dataset_size_limit to control the number of editing tasks (default: 10000)
Use --use_cache flag to cache KV pairs if needed
Adjust assigned_prefix_len for evaluation (default: 5)

Results will be saved in the specified output directory with your chosen ./results/out_name.