Fine-Tuned Models for "Are Decoder-Only Large Language Models the Silver Bullet for Code Search?"

November 6, 2025 ยท View on GitHub

This repository contains the fine-tuned models evaluated in our study . We provide these models to promote further research in the code search domain.

The complete Hugging Face Collection, which includes all models listed below, is available here: SYSUSELab/are-decoder-only-llms-the-silver-bullet

๐Ÿ“š Table of Contents

๐Ÿค– Hugging Face ID of TABLE III: Zero-Shot Performance on Code Search Benchmarks

These models correspond to the base models evaluated for zero-shot performance.

CategoryModelHugging Face ID
Encoder-OnlyUniXcodermicrosoft/unixcoder-base
Encoder-OnlyCodeBERTmicrosoft/codebert-base
General LLMLlama3meta-llama/Meta-Llama-3-8B
General LLMMistralmistralai/Mistral-7B-Instruct-v0.2
General LLMDeepSeekLLMdeepseek-ai/deepseek-llm-7b-chat
General LLMGemmagoogle/gemma-7b-it
General LLMLlama2meta-llama/Llama-2-7b-hf
Code LLMQwen2.5-CoderQwen/Qwen2.5-Coder-7B-Instruct
Code LLMStarCoder2bigcode/starcoder2-7b
Code LLMCodeMistraluukuguy/speechless-code-mistral-7b-v1.0
Code LLMDeepSeekCoderdeepseek-ai/deepseek-coder-6.7b-instruct
Code LLMCodeGemmagoogle/codegemma-7b-it
Code LLMCodeLlamameta-llama/CodeLlama-7b-hf

๐Ÿค– Hugging Face ID of TABLE IV: Performance of Fine-Tuned Models on Code Search Benchmarks

These models were fine-tuned using Supervised Contrastive Learning (SupCon) on the CSN dataset.

CategorySup ModelHugging Face ID
Decoder-OnlyCodeGemmaSYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN
Decoder-OnlyGemmaSYSUSELab/DCS-Gemma-7b-It-SupCon-CSN
Decoder-OnlyDeepSeekCoderSYSUSELab/DCS-DeepSeekCoder-6.7B-It-SupCon-CSN
Decoder-OnlyDeepSeekLLMSYSUSELab/DCS-DeepSeekLLM-7B-It-SupCon-CSN
Decoder-OnlyQwen2.5-CoderSYSUSELab/DCS-Qwen2.5-Coder-7B-It-SupCon-CSN
Decoder-OnlyCodeLlamaSYSUSELab/DCS-CodeLlama-7B-It-SupCon-CSN
Decoder-OnlyCodeMistralSYSUSELab/DCS-CodeMistral-7B-It-SupCon-CSN
Decoder-OnlyLlama3SYSUSELab/DCS-Llama3-8B-It-SupCon-CSN
Decoder-OnlyMistralSYSUSELab/DCS-Mistral-7B-It-SupCon-CSN
Decoder-OnlyLlama2SYSUSELab/DCS-Llama2-7B-It-SupCon-CSN

๐Ÿค– Hugging Face ID of TABLE V: Results of Different Fine-Tuning Methods

Sup ModelFinetuningHugging Face ID
CodeGemmaZero-Shotgoogle/codegemma-7b-it
CodeGemmaSimCSESYSUSELab/DCS-CodeGemma-7B-It-SimCSE
CodeGemmaSupSYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN
Llama3Zero-Shotmeta-llama/Meta-Llama-3-8B
Llama3SimCSEMcGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp-unsup-simcse
Llama3SupSYSUSELab/DCS-Llama3-8B-It-SupCon-CSN
MistralZero-Shotmistralai/Mistral-7B-v0.2
MistralSimCSEMcGill-NLP/LLM2Vec-Mistral-7B-Instruct-v2-mntp-unsup-simcse
MistralSupSYSUSELab/DCS-Mistral-7B-It-SupCon-CSN

๐Ÿค– Hugging Face ID of TABLE VI: Results of Different Fine-Tuning Datasets

Sup ModelDatasetHugging Face ID
CodeGemmaE5SYSUSELab/DCS-CodeGemma-7B-It-SupCon-E5
CodeGemmaCSNSYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN
Llama3E5McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp-supervised
Llama3CSNSYSUSELab/DCS-Llama3-8B-It-SupCon-CSN
MistralE5McGill-NLP/LLM2Vec-Mistral-7B-Instruct-v2-mntp-supervised
MistralCSNSYSUSELab/DCS-Mistral-7B-It-SupCon-CSN
ModelTraining LanguageHugging Face ID
CodegemmaRubySYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN-ruby
CodegemmaJavascriptSYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN-javascript
CodegemmaGoSYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN-go
CodegemmaPythonSYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN-python
CodegemmaJavaSYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN-java
CodegemmaPhpSYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN-php
CodegemmaMulti-languageSYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN

๐Ÿค– Hugging Face ID of TABLE VIII: Performance of Fine-Tuned CodeGemma Models by Discarding Language-Specific Data

ModelDiscard Language (SupCon)Discard Ratio (SupCon)Hugging Face ID
CodeGemmaJava0SYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN
CodeGemmaJava0.2SYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN-java-discard-0.2
CodeGemmaJava0.5SYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN-java-discard-0.5
CodeGemmaJava0.8SYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN-java-discard-0.8
CodeGemmaJava1SYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN-java-discard-1.0
CodeGemmaRuby0SYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN
CodeGemmaRuby0.2SYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN-ruby-discard-0.2
CodeGemmaRuby0.5SYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN-ruby-discard-0.5
CodeGemmaRuby0.8SYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN-ruby-discard-0.8
CodeGemmaRuby1SYSUSELab/DCS-CodeGemma-7B-It-SupCon-CSN-ruby-discard-1.0

๐Ÿค– Hugging Face ID of TABLE IX: Results of Different Model Sizes

Sup ModelSizeHugging Face ID
Llama21.3BSYSUSELab/DCS-Llama2-1.3B-It-SupCon-CSN
Llama27BSYSUSELab/DCS-Llama2-7B-It-SupCon-CSN
Llama213BSYSUSELab/DCS-Llama2-13B-It-SupCon-CSN
Qwen2.5-Coder0.5BSYSUSELab/DCS-Qwen2.5-Coder-0.5B-It-SupCon-CSN
Qwen2.5-Coder1.5BSYSUSELab/DCS-Qwen2.5-Coder-1.5B-It-SupCon-CSN
Qwen2.5-Coder3BSYSUSELab/DCS-Qwen2.5-Coder-3B-It-SupCon-CSN
Qwen2.5-Coder7BSYSUSELab/DCS-Qwen2.5-Coder-7B-It-SupCon-CSN
Qwen2.5-Coder14BSYSUSELab/DCS-Qwen2.5-Coder-14B-It-SupCon-CSN

๐ŸŽ“ How to Cite If you use our models or findings in your research, please cite our paper:

@article{chen2024decoder,
  title={Are Decoder-Only Large Language Models the Silver Bullet for Code Search?},
  author={Chen, Yuxuan and Liu, Mingwei and Ou, Guangsheng and Li, Anji and Dai, Dekun and Wang, Yanlin and Zheng, Zibin},
  journal={arXiv preprint arXiv:2410.22240},
  year={2024}
}