NeuroAda: Activating Each Neuron's Potential for Parameter-Efficient Fine-Tuning ๐Ÿš€

November 5, 2025 ยท View on GitHub

Paper License: MIT

This is the official repository for our EMNLP 2025 paper: NeuroAda: Activating Each Neuron's Potential for Parameter-Efficient Fine-Tuning

Table of Contents

Installation

Step-by-Step Installation

  1. Clone the repository:

    git clone https://github.com/FightingFighting/NeuroAda.git
    cd NeuroAda
    
  2. Create and activate the conda environment:

    conda env create -f environment.yml
    conda activate peft
    

Dataset Preparation

Option 1: Use Provided Datasets

The repository includes datasets in the dataset/ folder. These are identical to those used in LLM-Adapters and LoReFT. Download them and unzip them in the folder.

Option 2: Download Original Datasets

You can download the original datasets from:

Training

Quick Start

For a basic training run:

python train_our.py \
   -task commonsense \
   -data_dir dataset \
   -model yahma/llama-7b-hf \
   -seed 42 \
   -e 3 \
   -lr 7e-4 \
   -batch_size 16 \
   --micro_batch_size 16 \
   -eval_batch_size 16 \
   --test_split test \
   --greedy_decoding \
   --warmup_ratio 0.06 \
   --weight_decay 0 \
   --wandb_project=xxx \
   --wandb_entity=xxx \
   --wandb_watch all \
   --times_num 20 \
   --peft_type perCell_mag_add \
   --max_length 512 \
   --target_modules q_proj k_proj v_proj o_proj gate_proj up_proj down_proj

Using Pre-configured Scripts

We provide pre-configured training scripts for different tasks and the trainable parameters budget:

Commonsense Reasoning

# LLaMA-7B on commonsense tasks with top-20 parameters
bash scripts/percell/perCell_mag_add/LLaMA-7B/top20/commonsense.sh

# LLaMA-7B on commonsense tasks with top-1 parameters
bash scripts/percell/perCell_mag_add/LLaMA-7B/top1/commonsense.sh

Arithmetic Reasoning

# LLaMA-7B on commonsense tasks with top-20 parameters
bash scripts/percell/perCell_mag_add/LLaMA-7B/top20/math.sh

# LLaMA-7B on commonsense tasks with top-1 parameters
bash scripts/percell/perCell_mag_add/LLaMA-7B/top1/math.sh

Training Parameters

ParameterDescriptionOptions
-taskTask typecommonsense, math
-modelBase model pathyahma/llama-7b-hf, yahma/llama-13b-hf, meta-llama/Llama-2-7b-hf,meta-llama/Meta-Llama-3-8B
--peft_typePEFT methodperCell_mag_add
--target_modulesTarget modules for selecting parametersSee below
--times_numTop-K input connection for each neuron1, 5, 10, 20, etc.
-eNumber of epochs-

Target Modules

Common target modules for different models:

  • LLaMA/LLaMA2/LLaMA3: q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
  • Custom selection: You can specify any subset of these modules

Results

We provide Weights & Biases links to present our results reported in the paper. Below are the results on commonsense and arithmetic reasoning tasks.

๐Ÿง  Commonsense Reasoning Results

๐Ÿ—๏ธ Base Modelโš™๏ธ Params (%)๐Ÿงฉ BoolQ๐Ÿ’ก PIQA๐Ÿค” SIQA๐Ÿ“– HellaS.๐Ÿง WinoG.๐Ÿงฎ ARC-e๐Ÿง  ARC-c๐Ÿ“š OBQA๐ŸŒŸ Avg.
LLaMA (7B)0.404%73.185.480.994.384.387.871.784.282.7
LLaMA (7B)0.020%69.683.680.592.381.184.068.184.080.0
LLaMA (13B)0.327%73.387.982.796.086.990.277.188.685.3
LLaMA (13B)0.016%73.086.482.294.584.087.674.586.083.5
Llama2 (7B)0.404%73.686.581.194.887.889.175.985.684.3
Llama2 (7B)0.020%71.482.879.893.384.085.470.181.281.0
Llama3 (8B)0.343%75.089.383.096.589.293.082.989.687.3
Llama3 (8B)0.017%71.784.981.493.985.488.877.083.883.4

โž— Arithmetic Reasoning Results

๐Ÿ—๏ธ Base Modelโš™๏ธ Params (%)๐Ÿ”ข MulAri๐Ÿ“š GSM8Kโž• AddSub๐Ÿ’ง AQuA๐Ÿงฎ SinEq๐Ÿ“Š SVAMP๐Ÿ“˜ MAWPS๐ŸŒŸ Avg.
LLaMA (7B)0.404%96.036.592.422.094.153.284.568.4
LLaMA (7B)0.020%89.030.387.122.883.748.977.762.8
LLaMA (13B)0.327%97.543.992.221.793.961.489.171.4
LLaMA (13B)0.016%94.543.088.625.690.456.783.668.9
LLaMA2 (7B)0.404%97.839.891.920.596.354.289.570.0
LLaMA2 (7B)0.020%90.836.188.422.887.652.182.465.7
Llama3 (8B)0.343%99.747.892.727.695.760.488.773.2
Llama3 (8B)0.017%97.263.791.926.492.975.088.776.5

Citation

@inproceedings{zhang-etal-2025-neuroada,
    title = "{N}euro{A}da: Activating Each Neuron{'}s Potential for Parameter-Efficient Fine-Tuning",
    author = "Zhang, Zhi  and
      Shen, Yixian  and
      Cao, Congfeng  and
      Shutova, Ekaterina",
    editor = "Christodoulopoulos, Christos  and
      Chakraborty, Tanmoy  and
      Rose, Carolyn  and
      Peng, Violet",
    booktitle = "Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing",
    month = nov,
    year = "2025",
    address = "Suzhou, China",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2025.emnlp-main.555/",
    pages = "10960--10977",
    ISBN = "979-8-89176-332-6"
}

Acknowledgements

Our code is based on LLM-Adapters and LoReFT. We thank the authors for their valuable contributions to the open-source community.

License

This project is licensed under the MIT License - see the LICENSE file for details.