๐Ÿš€ CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models ๐Ÿ›ก๏ธ

January 7, 2025 ยท View on GitHub

Arxiv:CySecBench

License

The largest and most comprehensive Generative AI-based CyberSecurity-focused Dataset for Benchmarking Large Language Models

๐ŸŒŸ Overview

The CySecBench paper offers:

  • ๐ŸŽฏ A cutting-edge dataset of 12662 prompts tailored to cybersecurity challenges.
  • ๐Ÿง  Novel jailbreaking methods leveraging prompt obfuscation and refinement.
  • ๐Ÿ“Š Comprehensive performance evaluation of LLMs like ChatGPT, Claude, and Gemini.

Why CySecBench?

Existing datasets are too broad and often lack focus on cybersecurity. CySecBench fills this gap by providing domain-specific prompts organized into 10 categories, enabling a precise evaluation of LLM security mechanisms.

๐Ÿ“„ Access the Paper

You can download the full research paper here: CySecBench (PDF)


โœจ Features

๐Ÿ—‚๏ธ Dataset


๐Ÿ—‚๏ธ Repository Structure

/
โ”œโ”€โ”€ Code/
โ”‚   โ”œโ”€โ”€ dataset_generation.py
โ”‚   โ”œโ”€โ”€ keywords.txt
โ”œโ”€โ”€ Dataset/
โ”‚   โ”œโ”€โ”€ Category sets/
โ”‚   โ”‚   โ”œโ”€โ”€ cysecbench-cloud-attacks.csv
โ”‚   โ”‚   โ”œโ”€โ”€ cysecbench-control-system-attacks.csv
โ”‚   โ”‚   โ”œโ”€โ”€ cysecbench-cryptographic-attacks.csv
โ”‚   โ”‚   โ”œโ”€โ”€ cysecbench-evasion-techniques.csv
โ”‚   โ”‚   โ”œโ”€โ”€ cysecbench-hardware-attacks.csv
โ”‚   โ”‚   โ”œโ”€โ”€ cysecbench-intrusion-techniques.csv
โ”‚   โ”‚   โ”œโ”€โ”€ cysecbench-iot-attacks.csv
โ”‚   โ”‚   โ”œโ”€โ”€ cysecbench-malware-attacks.csv
โ”‚   โ”‚   โ”œโ”€โ”€ cysecbench-network-attacks.csv
โ”‚   โ”‚   โ”œโ”€โ”€ cysecbench-web-application-attacks.csv
โ”‚   โ”œโ”€โ”€ Full dataset/
โ”‚   โ”‚   โ”œโ”€โ”€ cysecbench.csv
โ”‚   โ”œโ”€โ”€ Sample sets/
โ”‚       โ”œโ”€โ”€ cysecbench-500.csv
โ”‚       โ”œโ”€โ”€ cysecbench-2000.csv
โ”‚       โ”œโ”€โ”€ cysecbench-6000.csv

๐Ÿš€ Getting Started

โš™๏ธ Prerequisites

  • ๐Ÿ Python 3.8+
  • ๐Ÿ“ฆ Required libraries: openai (only for dataset generation)

๐Ÿ“Š Results using CySecBench

๐ŸŽฏ Evaluation Metrics

  • โœ… Success Rate (SR): Percentage of prompts bypassing ethical guidelines.
  • ๐Ÿ“ˆ Average Rating (AR): Degree of harmfulness in LLM responses (on a scale of 1-5, where 5 is the most harmful).

โšก Jailbreaking Performance

LLMSuccess Rate (SR)Average Rating (AR)
๐Ÿค– Claude17.4%2.00
๐Ÿค– ChatGPT65.4%4.06
๐Ÿค– Gemini88.4%4.77

๐Ÿ“œ Citation

If you use CySecBench, please cite:

@article{CySecBench2024,
	title        = {{CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models}},
	author       = {Johan Wahrรฉus and Ahmed Mohamed Hussain and Panos Papadimitratos},
	year         = {2025},
	journal      = {arXiv preprint arXiv:2501.01335},
	url          = {https://arxiv.org/abs/2501.01335}
}

โญ Star This Repository!

If you found CySecBench helpful or interesting, please give this repository a star โญ to show your support!


๐Ÿ”’ License

This project is licensed under the MIT License. See the LICENSE file for details.