โœจ Awesome Issue Resolution

April 22, 2026 ยท View on GitHub

Advances and Frontiers of LLM-based Issue Resolution in Software Engineering A Comprehensive Survey

GitHub Stars Forks Awesome Paper arXiv Hugging Face Tables Contributors Papers Count

๐Ÿ“– Documentation Website | ๐Ÿ“„ Full Paper | ๐Ÿ“‹ Tables & Resources

๐ŸŽ™๏ธ Interactive Exploration:

NotebookLM Discord Issues

Awesome Issue Resolution

๐Ÿ“– Abstract

Based on a systematic review of 217 papers and online resources, this survey establishes a holistic theoretical framework for Issue Resolution in software engineering. We examine how Large Language Models (LLMs) are transforming the automation of GitHub issue resolution. Beyond the theoretical analysis, we have curated a comprehensive collection of datasets and model training resources, which are continuously synchronized with our GitHub repository and project documentation website.

๐Ÿ“ฐ News

We continuously update the paper list in real time. If your work is unintentionally missing, please let us know by opening an issue. You are also welcome to cite Cite our survey our survey to support this project.

Recent Papers

  • AgentForge: AgentForge: Execution-Grounded Multi-Agent LLM Framework for Autonomous Software Engineering arXiv GitHub
  • Agent-CoEvo: Beyond Fixed Tests: Repository-Level Issue Resolution as Coevolution of Code and Behavioral Constraints arXiv
  • SWE-Shield: Does Pass Rate Tell the Whole Story? Evaluating Design Constraint Compliance in LLM-based Issue Resolution arXiv
  • From SWE-ZERO to SWE-HERO: From SWE-ZERO to SWE-HERO: Execution-free to Execution-based Fine-tuning for Software Engineering Agents arXiv HuggingFace Website
  • GALA: GALA: Multimodal Graph Alignment for Bug Localization in Automated Program Repair arXiv
  • REAgent: REAgent: Requirement-Driven LLM Agents for Software Issue Resolution arXiv
  • RTMC: RTMC: Step-Level Credit Assignment via Rollout Trees arXiv
  • SWE-AGILE: SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context arXiv GitHub HuggingFace
  • BeyondSWE: BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing? arXiv GitHub HuggingFace Website
  • SWE-Adept: SWE-Adept: An LLM-Based Agentic Framework for Deep Codebase Analysis and Structured Issue Resolution arXiv

Recent Updates

  • Survey Update (2026-03): Our repository now covers 200+ papers on LLM-based GitHub issue resolution!
  • Survey Update (2026-03): Added 21 new papers covering the latest advances in LLM-based issue resolution!
  • Survey Launch (2026-01): Our survey paper is now publicly available on arXiv: https://arxiv.org/abs/2601.11655. It covers 175 papers and resources on LLM-based GitHub issue resolution, with continuously updated datasets and leaderboards!

๐Ÿ” Explore This Survey:

๐Ÿงญ Survey Exploration Skill: We provide a local skill at ./skill/survey-repo-explorer to extract categorized paper links from survey READMEs and build a structured knowledgebase under ./survey2knowledgebase for agent-assisted literature exploration.

๐Ÿ”Ž Browse & Export: The full paper database is searchable and exportable at deepsoftwareanalytics.github.io/Awesome-Issue-Resolution/admin/ โ€” filter by category, date, or keyword, and export results as CSV.

Admin Interface Demo

๐Ÿ“š Complete Paper List

Total: 217 works across 14 categories

๐Ÿ“Š Evaluation Datasets

Benchmarks for evaluating issue resolution systems

  • (2026-04) SWE-Shield: Does Pass Rate Tell the Whole Story? Evaluating Design Constraint Compliance in LLM-based Issue Resolution arXiv
  • (2026-03) BeyondSWE: BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing? arXiv Website GitHub HuggingFace
  • (2026-03) SWE-CI: SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration arXiv GitHub HuggingFace
  • (2026-03) SWE-Atlas Website
  • (2026-03) SWE-Skills-Bench: SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering? arXiv GitHub
  • (2026-03) MobileDev-Bench: MobileDev-Bench: A Comprehensive Benchmark for Evaluating Language Models on Mobile Application Development arXiv
  • (2026-02) SWE Context Bench: SWE Context Bench: A Benchmark for Context Learning in Coding arXiv
  • (2026-02) SWE-ABS: SWE-ABS: Adversarial Benchmark Strengthening Exposes Inflated Success Rates on Test-based Benchmark arXiv
  • (2026-02) Rust-SWE-bench: Evaluating and Improving Automated Repository-Level Rust Issue Resolution with LLM-based Agents arXiv GitHub
  • (2026-02) SWE-Bench Mobile: SWE-Bench Mobile: Can Large Language Model Agents Develop Industry-Level Mobile Applications? arXiv Website
  • (2025-12) SWE-InfraBench: SWE-InfraBench: Evaluating Language Models on Cloud Infrastructure Code OpenReview
  • (2025-12) SWE-EVO: SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios arXiv
  • (2025-11) SWE-Sharp-Bench: SWE-Sharp-Bench: A Reproducible Benchmark for C# Software Engineering Tasks arXiv
  • (2025-11) SWE-fficiency: SWE-fficiency: Can Language Models Optimize Real-World Repositories on Real Workloads? arXiv
  • (2025-11) SWE-Compass: SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models arXiv
  • (2025-09) SWE-Bench Pro: SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks? arXiv
  • (2025-07) SWE-Perf: SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories? arXiv OpenReview
  • (2025-05) SwingArena: SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving arXiv
  • (2025-05) OmniGIRL: Omnigirl: A multilingual and multimodal benchmark for github issue resolution arXiv
  • (2025-05) SWE-bench-Live: SWE-bench Goes Live! arXiv OpenReview
  • (2025-04) Multi-SWE-bench: Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving arXiv OpenReview
  • (2025-04) SWE-PolyBench: SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents arXiv
  • (2025-04) SWE-bench Multilingual: SWE-smith: Scaling Data for Software Engineering Agents arXiv OpenReview
  • (2025-03) FEA-Bench: FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation arXiv
  • (2025-03) SetUpAgent, SWEE-bench, SWA-bench: Automated Benchmark Generation for Repository-Level Coding Tasks arXiv
  • (2025-02) SWE-Lancer: SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering? arXiv
  • (2024-12) Visual SWE-bench: CodeV: Issue Resolving with Visual Data arXiv DOI
  • (2024-10) SWE-bench Multimodal: SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains? arXiv OpenReview
  • (2024-08) SWE-bench-java: SWE-bench-java: A GitHub Issue Resolving Benchmark for Java arXiv

๐ŸŽฏ Training Datasets

Datasets for training issue resolution agents

  • (2026-04) From SWE-ZERO to SWE-HERO: From SWE-ZERO to SWE-HERO: Execution-free to Execution-based Fine-tuning for Software Engineering Agents arXiv Website HuggingFace
  • (2026-03) OpenSWE: daVinci-Env: Open SWE Environment Synthesis at Scale arXiv GitHub HuggingFace
  • (2026-02) SWE-Universe: SWE-Universe: Scale Real-World Verifiable Environments to Millions arXiv
  • (2026-02) SWE-rebench V2: SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale arXiv
  • (2026-02) Scale-SWE: Immersion in the GitHub Universe: Scaling Coding Agents to Mastery arXiv GitHub HuggingFace
  • (2026-01) daVinci-Dev: daVinci-Dev: Agent-native Mid-training for Software Engineering arXiv GitHub HuggingFace
  • (2025-06) Skywork-SWE: Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs arXiv
  • (2025-05) SWELoc: SweRank: Software Issue Localization with Code Ranking arXiv
  • (2025-04) Multi-SWE-RL: Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving arXiv OpenReview
  • (2025-04) SWE-Smith: SWE-smith: Scaling Data for Software Engineering Agents arXiv OpenReview
  • (2025-02) LocAgent: OrcaLoca: An LLM Agent Framework for Software Issue Localization arXiv OpenReview
  • (2025-01) SWE-Fixer: SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution arXiv
  • (2023-10) SWE-bench-extra: SWE-bench: Can Language Models Resolve Real-world Github Issues? arXiv

๐Ÿ“ฅ Data Collection Methods

Techniques for collecting training data

  • (2026-03) OpenSWE: daVinci-Env: Open SWE Environment Synthesis at Scale arXiv GitHub HuggingFace
  • (2026-03) SWE-Next: SWE-Next: Scalable Real-World Software Engineering Tasks for Agents arXiv GitHub
  • (2026-03) RepoLaunch: RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform arXiv
  • (2026-02) DockSmith: DockSmith: Scaling Reliable Coding Environments via an Agentic Docker Builder arXiv HuggingFace
  • (2026-02) SWE-rebench V2: SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale arXiv
  • (2026-02) Scale-SWE: Immersion in the GitHub Universe: Scaling Coding Agents to Mastery arXiv GitHub HuggingFace
  • (2026-01) MEnvAgent: MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering arXiv GitHub
  • (2025-12) Multi-Docker-Eval: Multi-Docker-Eval: A 'Shovel of the Gold Rush' Benchmark on Automatic Environment Building for Software Engineering arXiv
  • (2025-08) RepoForge: RepoForge: Training a SOTA Fast-thinking SWE Agent with an End-to-End Data Curation Pipeline Synergizing SFT and RL at Scale arXiv
  • (2025-07) SWE-MERA: SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks arXiv
  • (2025-06) SWE-Factory: SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks arXiv
  • (2025-05) SWE-rebench: SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents arXiv OpenReview
  • (2025-03) SetUpAgent, SWEE-bench, SWA-bench: Automated Benchmark Generation for Repository-Level Coding Tasks arXiv

๐Ÿ”ฌ Data Synthesis Methods

Approaches for synthetic data generation

  • (2026-02) SWE-World: SWE-World: Building Software Engineering Agents in Docker-Free Environments arXiv GitHub
  • (2026-02) SWE-Hub: SWE-Hub: A Unified Production System for Scalable, Executable Software Engineering Tasks arXiv
  • (2025-09) SWE-Mirror: SWE-Mirror: Scaling Issue-Resolving Datasets by Mirroring Issues Across Repositories arXiv
  • (2025-06) SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner arXiv OpenReview
  • (2025-04) R2E-Gym: R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents arXiv OpenReview
  • (2025-04) SWE-Synth: SWE-Synth: Synthesizing Verifiable Bug-Fix Data to Enable Large Language Models in Resolving Real-World Bugs arXiv
  • (2025-04) SWE-smith: SWE-smith: Scaling Data for Software Engineering Agents arXiv OpenReview
  • (2025-01) Learn-by-interact: Learn-by-interact: A Data-Centric Framework For Self-Adaptive Agents in Realistic Environments arXiv OpenReview

๐Ÿค– Single-Agent Systems

Individual autonomous agents for issue resolution

  • (2026-04) REAgent: REAgent: Requirement-Driven LLM Agents for Software Issue Resolution arXiv
  • (2025-12) Confucius Code Agent: Confucius Code Agent: Scalable Agent Scaffolding for Real-World Codebases arXiv
  • (2025-10) TOM-SWE: TOM-SWE: User Mental Modeling For Software Engineering Agents arXiv
  • (2025-09) Lita: Lita: Light Agent Uncovers the Agentic Coding Capabilities of LLMs arXiv
  • (2025-08) Live-SWE-agent: SE-Agent: Self-Evolution Trajectory Optimization in Multi-Step Reasoning with LLM-Based Agents arXiv OpenReview
  • (2025-07) Trae Agent: Trae Agent: An LLM-based Agent for Software Engineering with Test-time Scaling arXiv
  • (2025-05) LCLM: Putting It All into Context: Simplifying Agents with LCLMs arXiv
  • (2025-02) PatchPilot: PatchPilot: A Cost-Efficient Software Engineering Agent with Early Attempts on Formal Verification arXiv OpenReview
  • (2024-05) SWE-agent: Swe-agent: Agent-computer interfaces enable automated software engineering arXiv
  • (2024-03) Devin: SWE-bench technical report Website
  • (2023-06) Aider Website GitHub

๐Ÿ‘ฅ Multi-Agent Systems

Collaborative multi-agent frameworks

  • (2026-04) AgentForge: AgentForge: Execution-Grounded Multi-Agent LLM Framework for Autonomous Software Engineering arXiv GitHub
  • (2026-04) Agent-CoEvo: Beyond Fixed Tests: Repository-Level Issue Resolution as Coevolution of Code and Behavioral Constraints arXiv
  • (2026-03) SWE-Adept: SWE-Adept: An LLM-Based Agentic Framework for Deep Codebase Analysis and Structured Issue Resolution arXiv
  • (2025-08) Meta-RAG: Meta-RAG on Large Codebases Using Code Summarization arXiv
  • (2025-07) SWE-Debate: SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution arXiv
  • (2025-06) AgentScope: SWE-Bench - AgentScope Website
  • (2025-05) Devlo: Achieving SOTA on SWE-bench Website
  • (2025-05) Refact.ai Agent: AI Coding Agent for Software Development - Refact.ai Website
  • (2025-03) Lingxi: Lingxi/docs/Lingxi Technical Report 2505.pdf at master ยท lingxi-agent/Lingxi GitHub
  • (2025-02) OrcaLora: OrcaLoca: An LLM Agent Framework for Software Issue Localization arXiv OpenReview
  • (2025-01) CodeCoR: CodeCoR: An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation arXiv
  • (2024-09) MarsCode Agent: MarsCode Agent: AI-native Automated Bug Fixing arXiv
  • (2024-09) HyperAgent: HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale arXiv
  • (2024-08) DEI: Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents arXiv OpenReview
  • (2024-07) OpenHands: OpenHands: An Open Platform for AI Software Developers as Generalist Agents arXiv OpenReview
  • (2024-06) CodeR: CodeR: Issue Resolving with Multi-Agent and Task Graphs arXiv
  • (2024-04) AutoCodeRover: AutoCodeRover: Autonomous Program Improvement arXiv DOI
  • (2024-03) MAGIS: MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution arXiv OpenReview

๐Ÿ”„ Workflow-Based Methods

Structured pipeline approaches

  • (2025-07) SynFix: SynFix: Dependency-Aware Program Repair via RelationGraph Analysis DOI Website
  • (2025-06) GUIRepair: Seeing is Fixing: Cross-Modal Reasoning with Multimodal LLMs for Visual Software Issue Fixing arXiv
  • (2024-12) CodeV: CodeV: Issue Resolving with Visual Data arXiv DOI
  • (2024-10) Conversational Pipeline: Exploring the Potential of Conversational Test Suite Based Program Repair on SWE-bench arXiv
  • (2024-07) Agentless: Demystifying LLM-Based Software Engineering Agents arXiv Website

๐Ÿ› ๏ธ Tool-Augmented Methods

Methods leveraging external tools

  • (2026-04) AgentForge: AgentForge: Execution-Grounded Multi-Agent LLM Framework for Autonomous Software Engineering arXiv GitHub
  • (2026-04) GALA: GALA: Multimodal Graph Alignment for Bug Localization in Automated Program Repair arXiv
  • (2026-03) SWE-Adept: SWE-Adept: An LLM-Based Agentic Framework for Deep Codebase Analysis and Structured Issue Resolution arXiv
  • (2026-03) RepoRepair: RepoRepair: Leveraging Code Documentation for Repository-Level Automated Program Repair arXiv GitHub
  • (2026-02) Closing the Loop: Closing the Loop: Universal Repository Representation with RPG-Encoder arXiv Website GitHub
  • (2026-01) SWE-Tester: SWE-Tester: Training Open-Source LLMs for Issue Reproduction in Real-World Repositories arXiv
  • (2025-12) GraphLocator: GraphLocator: Graph-guided Causal Reasoning for Issue Localization arXiv
  • (2025-11) InfCode: InfCode: Adversarial Iterative Refinement of Tests and Patches for Reliable Software Issue Resolution arXiv
  • (2025-10) BugPilot: BugPilot: Complex Bug Generation for Efficient Learning of SWE Skills arXiv
  • (2025-10) TestPrune: When Old Meets New: Evaluating the Impact of Regression Tests on SWE Issue Resolution arXiv
  • (2025-09) Nemotron-CORTEXA: Nemotron-CORTEXA: Enhancing LLM Agents for Software Engineering Tasks via Improved Localization and Solution Diversity OpenReview Website
  • (2025-08) Git Context Controller: Git Context Controller: Manage the Context of LLM-based Agents like Git arXiv
  • (2025-07) Prometheus: Prometheus: Unified Knowledge Graphs for Issue Resolution in Multilingual Codebases arXiv
  • (2025-06) SACL: SACL: Understanding and Combating Textual Bias in Code Retrieval with Semantic-Augmented Reranking and Localization arXiv
  • (2025-06) OpenHands-Versa: Coding Agents with Multimodal Browsing are Generalist Problem Solvers arXiv
  • (2025-06) SemAgent: SemAgent: A Semantics Aware Program Repair Agent arXiv
  • (2025-06) Repeton: Repeton: Structured Bug Repair with ReAct-Guided Patch-and-Test Cycles arXiv
  • (2025-06) cAST: cAST: Enhancing Code Retrieval-Augmented Generation with Structural Chunking via Abstract Syntax Tree arXiv
  • (2025-05) InfantAgent-Next: InfantAgent-Next: A Multimodal Generalist Agent for Automated Computer Interaction arXiv
  • (2025-05) SWERank: SweRank: Software Issue Localization with Code Ranking arXiv
  • (2025-03) DARS: DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal arXiv
  • (2025-03) Issue2Test: Issue2Test: Generating Reproducing Test Cases from Issue Reports arXiv
  • (2025-03) KGCompass: Enhancing repository-level software repair via repository-aware knowledge graphs arXiv
  • (2025-03) CoSIL: Issue Localization via LLM-Driven Iterative Code Graph Searching arXiv
  • (2025-02) OrcaLoca: OrcaLoca: An LLM Agent Framework for Software Issue Localization arXiv OpenReview
  • (2025-02) Otter: Otter: Generating Tests from Issues to Validate SWE Patches arXiv OpenReview
  • (2025-02) Quadropic Insiders: Quadropic Insiders : Syntheo Tops Swelite Feb Website
  • (2024-12) CoRNStack: CoRNStack: High-Quality Contrastive Data for Better Code Retrieval and Reranking arXiv OpenReview
  • (2024-11) AEGIS: AEGIS: An Agent-based Framework for General Bug Reproduction from Issue Descriptions arXiv
  • (2024-10) RepoGraph: RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph arXiv
  • (2024-09) SuperCoder2.0: SuperCoder2.0: Technical Report on Exploring the feasibility of LLMs as Autonomous Programmer arXiv
  • (2024-08) SpecRover: SpecRover: Code Intent Extraction via LLMs arXiv
  • (2024-06) Alibaba LingmaAgent: Alibaba LingmaAgent: Improving Automated Issue Resolution via Comprehensive Repository Exploration arXiv DOI

๐Ÿง  Memory-Enhanced Methods

Systems with memory mechanisms

  • (2026-01) MemGovern: MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences arXiv
  • (2025-10) RepoMem: Improving Code Localization with Repository Memory arXiv
  • (2025-09) AgentDiet: Improving the Efficiency of LLM Agent Systems through Trajectory Reduction arXiv
  • (2025-07) Agent KB: Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving arXiv
  • (2025-07) SWE-Exp: SWE-Exp: Experience-Driven Software Issue Resolution arXiv
  • (2025-06) ExpeRepair: EXPEREPAIR: Dual-Memory Enhanced LLM-based Repository-Level Program Repair arXiv
  • (2025-05) DGM: Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents arXiv
  • (2024-11) Infant Agent: Infant Agent: A Tool-Integrated, Logic-Driven Agent with Cost-Effective API Usage arXiv
  • (2024-11) EvoCoder: LLMs as Continuous Learners: Improving the Reproduction of Defective Code in Software Issues arXiv

๐Ÿ“š Supervised Fine-Tuning (SFT)

Models trained via supervised learning

  • (2026-04) SWE-AGILE: SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context arXiv GitHub HuggingFace
  • (2026-04) From SWE-ZERO to SWE-HERO: From SWE-ZERO to SWE-HERO: Execution-free to Execution-based Fine-tuning for Software Engineering Agents arXiv Website HuggingFace
  • (2026-03) OpenSWE: daVinci-Env: Open SWE Environment Synthesis at Scale arXiv GitHub HuggingFace
  • (2026-02) Scale-SWE: Immersion in the GitHub Universe: Scaling Coding Agents to Mastery arXiv GitHub HuggingFace
  • (2026-01) SWE-Lego: SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving arXiv
  • (2026-01) SWE-Replay: SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents arXiv
  • (2025-12) SWE-Compressor: Context as a Tool: Context Management for Long-Horizon SWE-Agents arXiv
  • (2025-09) Devstral: Devstral: Fine-tuning Language Models for Coding Agent Applications arXiv
  • (2025-06) MCTS-Refined CoT: MCTS-Refined CoT: High-Quality Fine-Tuning Data for LLM-Based Repository Issue Resolution arXiv
  • (2025-05) Search for training: Guided Search Strategies in Non-Serializable Environments with Applications to Software Engineering Agents arXiv
  • (2025-05) Co-PatcheR: Co-PatcheR: Collaborative Software Patching with Component(s)-specific Small Reasoning Models arXiv
  • (2025-05) CGM: Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks arXiv GitHub HuggingFace
  • (2025-03) Thinking Longer: Thinking Longer, Not Larger: Enhancing Software Engineering Agents via Scaling Test-Time Compute arXiv
  • (2024-12) ReSAT: Repository Structure-Aware Training Makes SLMs Better Issue Resolver arXiv
  • (2024-12) Scaling data collection: Scaling Data Collection for Training SWE Agents Website
  • (2024-12) SWE-Gym: Training Software Engineering Agents and Verifiers with SWE-Gym arXiv
  • (2024-11) Lingma SWE-GPT: SWE-GPT: A Process-Centric Language Model for Automated Software Improvement arXiv DOI GitHub
  • (2024-11) CodeXEmbed: CodeXEmbed: A Generalist Embedding Model Family for Multilingual and Multi-task Code Retrieval arXiv OpenReview

๐ŸŽฎ Reinforcement Learning (RL)

Models trained via reinforcement learning

  • (2026-04) SWE-AGILE: SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context arXiv GitHub HuggingFace
  • (2026-04) RTMC: RTMC: Step-Level Credit Assignment via Rollout Trees arXiv
  • (2026-03) SWE-Fuse: SWE-Fuse: Empowering Software Agents via Issue-free Trajectory Learning and Entropy-aware RLVR Training arXiv
  • (2026-02) SWE-Master: SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training arXiv GitHub
  • (2026-02) SWE-Protรฉgรฉ: SWE-Protรฉgรฉ: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents arXiv
  • (2026-02) SWE-MiniSandbox: SWE-MiniSandbox: Container-Free Reinforcement Learning for Building Software Engineering Agents arXiv GitHub
  • (2026-01) MiMo-V2-Flash: MiMo-V2-Flash Technical Report arXiv
  • (2026-01) SWE-Manager: SWE-Manager: Selecting and Synthesizing Golden Proposals Before Coding arXiv GitHub
  • (2025-12) Self-play SWE-RL: Toward Training Superintelligent Software Agents through Self-Play SWE-RL arXiv
  • (2025-12) SWE-Playground: Training Versatile Coding Agents in Synthetic Environments arXiv
  • (2025-12) SWE-RM: SWE-RM: Execution-free Feedback For Software Engineering Agents arXiv
  • (2025-12) One Tool Is Enough: One Tool Is Enough: Reinforcement Learning for Repository-Level LLM Agents arXiv
  • (2025-12) Let It Flow: Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem arXiv
  • (2025-12) Deepseek V3.2: DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models arXiv
  • (2025-11) TSP: Think-Search-Patch: A Retrieval-Augmented Reasoning Framework for Repository-Level Code Repair DOI
  • (2025-10) CWM: CWM: An Open-Weights LLM for Research on Code Generation with World Models arXiv
  • (2025-10) FoldGRPO: Scaling Long-Horizon LLM Agent via Context-Folding arXiv
  • (2025-10) GRPO-based Method: A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning arXiv OpenReview Website
  • (2025-10) Supervised RL: Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning arXiv
  • (2025-10) KAT-Coder: KAT-Coder Technical Report arXiv
  • (2025-09) CoreThink: CoreThink: A Symbolic Reasoning Layer to reason over Long Horizon Tasks with LLMs arXiv
  • (2025-09) EntroPO: Building Coding Agents via Entropy-Enhanced Multi-Turn Preference Optimization arXiv
  • (2025-09) Kimi-Dev: Kimi-Dev: Agentless Training as Skill Prior for SWE-Agents arXiv
  • (2025-09) LongCat-Flash-Think: Introducing LongCat-Flash-Thinking: A Technical Report arXiv
  • (2025-08) Tool-integrated RL: Tool-integrated Reinforcement Learning for Repo Deep Search arXiv
  • (2025-08) SWE-Swiss: SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution Website
  • (2025-08) SeamlessFlow: SeamlessFlow: A Trainer Agent Isolation RL Framework Achieving Bubble-Free Pipelines via Tag Scheduling arXiv
  • (2025-08) DAPO: Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning arXiv
  • (2025-08) GLM-4.6: gpt-oss-120b & gpt-oss-20b model card arXiv
  • (2025-07) DeepSWE: DeepSWE: Training a State-of-the-Art Coding Agent from Scratch by Scaling RL Website
  • (2025-07) Kimi-K2-Instruct: Kimi K2: Open Agentic Intelligence arXiv
  • (2025-06) Agent-RLVR: Agent-RLVR: Training Software Engineering Agents via Guidance and Environment Rewards arXiv
  • (2025-06) SWE-Dev2: SWE-Dev: Building Software Engineering Agents with Training and Inference Scaling arXiv
  • (2025-06) Minimax M2: MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention arXiv
  • (2025-05) SWE-Dev1: SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development arXiv
  • (2025-05) Satori-SWE: Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering arXiv
  • (2025-05) Qwen3-Coder: Qwen3 Technical Report arXiv
  • (2025-04) Seed1.5-Thinking: Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning arXiv
  • (2025-03) SEAlign: SEAlign: Alignment Training for Software Engineering Agent arXiv DOI
  • (2025-02) SWE-RL: SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution arXiv OpenReview
  • (2025-02) SoRFT: SoRFT: Issue Resolving with Subtask-oriented Reinforced Fine-Tuning arXiv
  • (2024-10) OSCA: Scaling LLM Inference Efficiently with Optimized Sample Compute Allocation arXiv DOI

โšก Inference-Time Scaling

Methods for scaling at inference time

  • (2026-01) Agentic Rubrics: Agentic Rubrics as Contextual Verifiers for SWE Agents arXiv Website
  • (2025-10) SIADAFIX: SIADAFIX: issue description response for adaptive program repair arXiv
  • (2025-09) SWE-PRM: When Agents go Astray: Course-Correcting SWE Agents with PRMs arXiv
  • (2025-01) ReasoningBank: CodeMonkeys: Scaling Test-Time Compute for Software Engineering arXiv
  • (2024-10) SWE-Search: SWE-Search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative Refinement arXiv OpenReview

๐Ÿ“ˆ Data Analysis

Analysis of datasets and benchmarks

  • (2025-12) Data contamination: Does SWE-Bench-Verified Test Agent Ability or Model Memory? arXiv
  • (2025-11) Test Overfitting on SWE-bench: Investigating Test Overfitting on SWE-bench arXiv
  • (2025-07) Rigorous agentic benchmarks: Establishing Best Practices for Building Rigorous Agentic Benchmarks arXiv
  • (2025-07) SPICE: SPICE: An Automated SWE-Bench Labeling Pipeline for Issue Clarity, Test Coverage, and Effort Estimation arXiv
  • (2025-06) UTBoost: UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench arXiv
  • (2025-06) Trustworthiness: Is Your Automated Software Engineer Trustworthy? arXiv
  • (2025-06) The SWE-Bench Illusion: The SWE-Bench Illusion: When State-of-the-Art LLMs Remember Instead of Reason arXiv
  • (2025-04) Revisiting SWE-Bench: Revisiting SWE-Bench: On the Importance of Data Quality for LLM-Based Code Models DOI
  • (2025-03) Patch Correctness: Are "Solved Issues" in SWE-bench Really Solved Correctly? An Empirical Study arXiv
  • (2024-08) SWE-bench Verified: Introducing SWE-bench Verified | OpenAI Website

๐Ÿ” Methods Analysis

Comparative analysis of different methods

  • (2026-02) ContextBench: ContextBench: A Benchmark for Context Retrieval in Coding Agents arXiv Website GitHub HuggingFace
  • (2025-12) SWEnergy: SWEnergy: An Empirical Study on Energy Efficiency in Agentic Issue Resolution Frameworks with SLMs arXiv
  • (2025-09) Failures analysis: An Empirical Study on Failures in Automated Issue Solving arXiv
  • (2025-07) Security analysis: How Safe Are AI-Generated Patches? A Large-scale Study on Security Risks in LLM and Agentic Automated Program Repair on SWE-bench arXiv
  • (2025-06) Dissecting the SWE-Bench Leaderboards: Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems arXiv
  • (2025-05) GSO: GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents arXiv
  • (2025-05) Strong-Weak Model Collaboration: An Empirical Study on Strong-Weak Model Collaboration for Repo-level Code Generation arXiv
  • (2025-05) Agents in the Wild Website
  • (2025-04) SeaView: SeaView: Software Engineering Agent Visual Interface for Enhanced Workflow arXiv
  • (2025-03) Beyond final code: Beyond Final Code: A Process-Oriented Error Analysis of Software Development Agents in Real-World GitHub Scenarios arXiv
  • (2025-02) Overthinking: The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks arXiv
  • (2024-10) Evaluating software development agents: Evaluating Software Development Agents: Patch Patterns, Code Quality, and Issue Complexity in Real-World GitHub Scenarios arXiv DOI
  • (2024-06) Context Retrieval: On The Importance of Reasoning for Context Retrieval in Repository-Level Code Editing arXiv

Evaluation & Training Datasets

A comprehensive survey and statistical overview of issue resolution datasets. We categorize these datasets based on programming language, modality support, source repositories, data scale (Amount), and the availability of reproducible execution environments.

DatasetLanguageMultimodalReposAmountEnvironmentLink
Single-PL Datasets
SWE-FixerPythonNo856115,406NoGitHub HuggingFace HuggingFace
SWE-smithPythonNo12850kYesGitHub HuggingFace
SWE-LegoPythonNo3,25132,119YesGitHub HuggingFace
SWE-rebenchPythonNo3,46821,336YesGitHub HuggingFace
SWE-bench-trainPythonNo3719kNoGitHub HuggingFace
SWE-FlowPythonNo7418,081YesGitHub
Skywork-SWEPythonNo2,53110,169Yes-
R2E-GymPythonNo108,135YesGitHub HuggingFace
RepoForgePythonNo-7.3kYes-
SWE-bench-extraPythonNo2k6.38kYesHuggingFace
SWE-GymPythonNo112,438YesGitHub HuggingFace
SWE-benchPythonNo122,294YesGitHub HuggingFace
SWE-bench-javaJavaNo191,797YesGitHub HuggingFace
FEA-benchPythonNo831,401YesGitHub HuggingFace
SWE-bench-LivePythonNo1641,565YesGitHub HuggingFace
Loc-BenchPythonNo-560NoGitHub HuggingFace
SWE-bench VerifiedPythonNo-500YesGitHub HuggingFace
SWE-bench LitePythonNo12300YesGitHub HuggingFace
SWE-MERAPythonNo200300YesGitHub HuggingFace
SWE-Bench-CLPythonNo8273YesGitHub
SWE-Sharp-BenchC#No17150YesGitHub HuggingFace
SWE-PerfPythonNo12140YesGitHub HuggingFace
Visual SWE-benchPythonYes11133YesGitHub HuggingFace
SWE-EVOPythonNo748YesGitHub
Multi-PL Datasets
SWE-MirrorPython, Rust, GoNo4060kYes-
Multi-SWE-benchJava, JS, TS, Go, Rust, C, C++No764,723YesGitHub HuggingFace
Swing-BenchPython, Go, C++, RustNo4002300Yes-
SWE-PolyBenchPython, Java, JS, TSNo212,110YesGitHub HuggingFace HuggingFace
SWE-CompassPython, JS, TS, Java, C, C++, Go, Rust, Kotlin, C#No-2,000YesGitHub HuggingFace
SWE-Bench ProPython, Go, TSNo411,865YesGitHub HuggingFace
SWE-bench++Python, Go, TS, JS, Ruby, PHP, Java, Rust, C++, C#, CNo3,9711,782YesGitHub HuggingFace
SWE-LancerJS, TSNo-1,488YesGitHub
OmniGIRLPython, TS, Java, JSYes15959YesGitHub HuggingFace
SWE-bench MultimodalJS, TS, HTML, CSSYes17619YesGitHub HuggingFace
SWE-fficiencyPython, CythonNo9498YesGitHub
SWE-FactoryPython, Java, JS, TSNo12430YesGitHub HuggingFace
SWE-bench-Live-MultiLang & WindowsPython, JS, TS, C, C++, C#, Java, Go, RustNo238418YesGitHub HuggingFace HuggingFace
SWE-bench MultilingualC, C++, Go, Java, JS, TS, Rust, Python, Ruby, PHPNo42300YesGitHub HuggingFace
SWE-InfraBenchPython, TSNo-100Yes-

Training Trajectory Datasets

A survey of trajectory datasets used for agent training or analysis. We list the programming language, number of source repositories, and total trajectories for each dataset.

DatasetLanguageReposAmountLink
SWE-FixerPython85669,752GitHub HuggingFace
SWE-rebenchPython1,82367,074HuggingFace
R2E-GymPython103,321GitHub HuggingFace
SWE-SynthPython113,018GitHub HuggingFace
SWE-FactoryPython102,809GitHub HuggingFace
SWE-GymPython11491GitHub HuggingFace
SWE-LegoPython325114.6kGitHub

SFT-based Methods

Overview of SFT-based methods for issue resolution. This table categorizes models by their base architecture and training scaffold (Sorted by Performance).

Model NameBase ModelSizeArch.Training ScaffoldRes.(%)CodeDataModel
SWE-rebench-openhands-Qwen3-235B-A22BQwen3-235B-A22B235B-A22BMoEOpenHands59.9-HuggingFaceHuggingFace
SWE-Lego-Qwen3-32BQwen3-32B32BDenseOpenHands57.6GitHubHuggingFaceHuggingFace
CGM-SWE-PYQwen2.5-Coder-72B72BDenseGraph RAG50.4GitHub-HuggingFace
SWE-rebench-openhands-Qwen3-30B-A3BQwen3-30B-A3B30B-A3BMoEOpenHands49.7-HuggingFaceHuggingFace
DevstralMistral Small 322BDenseOpenHands46.8--HuggingFace
Co-PatcheRQwen2.5-Coder-14B3\times\14BDensePatchPilot-mini46.0GitHub-HuggingFace
SWE-Swiss-32BQwen2.5-32B-Instruct32BDenseAgentless45.0GitHubHuggingFaceHuggingFace
SWE-Lego-Qwen3-8BQwen3-8B8BDenseOpenHands44.4GitHubHuggingFaceHuggingFace
Lingma SWE-GPTQwen2.5-72B-Instruct72BDenseSWESynInfer30.2GitHub--
SWE-Gym-Qwen-32BQwen2.5-Coder-32B32BDenseOpenHands, MoatlessTools20.6GitHub-HuggingFace
SWE-Gym-Qwen-14BQwen2.5-Coder-14B14BDenseOpenHands, MoatlessTools16.4GitHub-HuggingFace
SWE-Gym-Qwen-7BQwen2.5-Coder-7B7BDenseOpenHands, MoatlessTools10.6GitHub-HuggingFace

RL-based Methods

A comprehensive overview of specialized models for issue resolution, categorized by parameter size. The table details each model's base architecture, the training scaffold used for rollout, the type of reward signal employed (Outcome vs. Process), and their performance results (Res. %) on issue resolution benchmarks.

Model NameBase ModelSizeArch.Train. ScaffoldRewardRes.(%)CodeDataModel
560B Models (MoE)
LongCat-Flash-ThinkLongCatFlash-Base560B-A27BMoER2E-GymOutcome60.4GitHub-HuggingFace
72B Models
Kimi-DevQwen 2.5-72B-Base72BDenseBugFixer + TestWriterOutcome60.4GitHub-HuggingFace
SWE-RLLlama-3.3-70B-Instruct70BDenseAgentless-miniOutcome41.0GitHub--
Multi-turn RL(Nebius)Qwen2.5-72B-Instruct72BDenseSWE-agentOutcome39.0---
Agent-RLVR-RM-72BQwen2.5-Coder-72B72BDenseLocalization + RepairOutcome27.8---
Agent-RLVR-72BQwen2.5-Coder-72B72BDenseLocalization + RepairOutcome22.4---
32B Models
OpenHands CriticQwen2.5-Coder-32B32BDenseSWE-Gym-66.4GitHub-HuggingFace
KAT-Dev-32BQwen3-32B32BDense--62.4--HuggingFace
SWE-Swiss-32BQwen2.5-32B-Instruct32BDense-Outcome60.2GitHubHuggingFaceHuggingFace
FoldAgentSeed-OSS-36B-Instruct36BDenseFoldAgentProcess58.0GitHub--
SeamlessFlow-32BQwen3-32B32BDenseSWE-agentOutcome45.8GitHub--
DeepSWEQwen3-32B32BDenseR2E-GymOutcome42.2GitHubHuggingFaceHuggingFace
SA-SWE-32B-32BDenseSkyRL-Agent-39.4---
OpenHands LM v0.1Qwen2.5-Coder-32B32BDenseSWE-Gym-37.2GitHub-HuggingFace
SWE-Dev-32BQwen2.5-Coder-32B32BDenseOpenHandsOutcome36.6GitHub-HuggingFace
Satori-SWEQwen2.5-Coder-32B32BDenseRetriever + Code editorOutcome35.8GitHubHuggingFaceHuggingFace
SoRFT-32BQwen2.5-Coder-32B32BDenseAgentlessOutcome30.8---
Agent-RLVR-32BQwen2.5-Coder-32B32BDenseLocalization + RepairOutcome21.6---
14B Models
Agent-RLVR-14BQwen2.5-Coder-14B14BDenseLocalization + RepairOutcome18.0---
SEAlign-14BQwen2.5-Coder-14B14BDenseOpenHandsProcess17.7---
7-8B Models
SeamlessFlow-8BQwen3-8B8BDenseSWE-agentOutcome27.4GitHub--
SWE-Dev-7BQwen2.5-Coder-7B7BDenseOpenHandsOutcome23.4GitHub-HuggingFace
SoRFT-7BQwen2.5-Coder-7B7BDenseAgentlessOutcome21.4---
SWE-Dev-8BLlama-3.1-8B8BDenseOpenHandsOutcome18.0GitHub-HuggingFace
SEAlign-7BQwen2.5-Coder-7B7BDenseOpenHandsProcess15.0---
SWE-Dev-9BGLM-4-9B9BDenseOpenHandsOutcome13.6GitHub-HuggingFace

General Foundation Models

Overview of general foundation models evaluated on issue resolution. The table details the specific inference scaffolds (e.g., OpenHands, Agentless) employed during the evaluation process to achieve the reported results.

Model NameSizeArch.Inf. ScaffoldRewardRes.(%)CodeModel
KAT-Coder--Claude CodeOutcome73.4-Website
MiMo-V2-Flash309B-A15BMoEAgentlessOutcome73.4GitHubHuggingFace
Deepseek V3.2671B-A37BMoEClaude Code, RooCode-73.1GitHubHuggingFace
Kimi-K2-Instruct1TMoEAgentlessOutcome71.6-HuggingFace
Qwen3-Coder480B-A35BMoEOpenHandsOutcome69.6GitHubHuggingFace
GLM-4.6355B-A32BMoEOpenHandsOutcome68.0-HuggingFace
gpt-oss-120b116.8B-A5.1BMoEInternal toolOutcome62.0GitHubHuggingFace
Minimax M2230B-10BMoER2E-GymOutcome61.0GitHubHuggingFace
gpt-oss-20b20.9B-A3.6BMoEInternal toolOutcome60.0GitHubHuggingFace
GLM-4.5-Air106B-A12BMoEOpenHandsOutcome57.6--
Minimax M1-80k456B-A45.9BMoEAgentlessOutcome56.0GitHubWebsite
Minimax M1-40k456B-A45.9BMoEAgentlessOutcome55.6GitHubWebsite
Seed1.5-Thinking200B-A20BMoE-Outcome47.0GitHub-
Llama 4 Maverick400B-A17BMoEmini-SWE-agentOutcome21.0GitHubHuggingFace
Llama 4 Scout109B-17BMoEmini-SWE-agentOutcome9.1GitHubHuggingFace


๐Ÿš€ Quick Start

# First time: install dependencies
pip install flask flask-cors sqlalchemy pyyaml requests

# Full update + start admin server
# (refreshes news, re-renders README/docs, builds static site, then serves)
python app.py

# Or force re-import from YAML/CSV first
python app.py --init

Survey exploration skill is available at ./skill/survey-repo-explorer.
It can work with your local agent to build a categorized literature knowledgebase under ./survey2knowledgebase.

Open http://localhost:5000/admin to manage papers, datasets, and methods.

CommandDescription
python app.pyFull update (news + render + build) then start server
python app.py --initRe-import from YAML/CSV, then full update + start
python app.py --no-updateStart server without running update steps
python app.py --port 8080Use a custom port
python app.py --newsRefresh Recent Papers section only and exit
python app.py --renderRe-render README/docs from DB only and exit
python app.py --buildBuild static site (mkdocs) only and exit


๐Ÿค Contributing

We welcome contributions! To add new papers or tables:

  1. Fork this repository
  2. Add entries via the admin interface (python app.py โ†’ localhost:5000/admin)
    โ€” or manually edit the YAML/CSV files in data/
  3. Run python app.py --init if you edited files directly
  4. Submit a PR with your changes

Code Generation

The application of LLMs in the programming domain has witnessed explosive growth. Early research focused primarily on function-level code generation, with benchmarks such as HumanEval serving as standard metrics. However, generic benchmarks often fail to capture the nuances of real-world development. To bridge this gap, recent initiatives have attempted to extend evaluation tasks to align more closely with realistic software development scenarios, revealing the limitations of general models in specialized domains. Concurrently, methods are also evolving to capture these broader contexts. While foundational approaches primarily relied on SFT or standard retrieval-augmented generation, RL-based methods emerged as a pivotal direction for handling complex coding tasks.

Related:

Automated Software Generation

The primary goal of this task is to autonomously construct complete and executable software systems starting from high-level natural language requirements. Unlike code completion, it necessitates covering the Software Development Life Cycle (SDLC), including requirement analysis, system design, coding, and testing. To address the complexity and potential logic inconsistencies in this process, state-of-the-art frameworks leverage multi-agent collaboration, simulating human development teams to decompose complex tasks into streamlined and verifiable workflows.

Related:

Automated Software Maintenance

Issue resolution is intrinsically linked to the broader domain of automated software maintenance. Methodologies established in this field are frequently encapsulated as callable tools to augment the capabilities of LLMs in software development tasks.

Related:

Automated Environment Setup

Recent initiatives focus on automating the configuration of runtime environments for entire repositories. This capability develops in parallel with data construction for issue resolution.

Related:

Existing surveys primarily focus on code generation or other tasks within the software engineering domain. This paper bridges this gap by offering the first systematic survey dedicated to the entire spectrum of issue resolution, ranging from non-agent approaches to the latest agentic advancements.

Related:


๐Ÿ“„ Citation

If you use this project or related survey in your research or system, please cite the following:

Li, Caihua, Guo, Lianghong, Wang, Yanlin, et al. (2026). Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey. arXiv preprint arXiv:2601.11655.

BibTeX:

@article{li2026advances,
  title={Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey},
  author={Li, Caihua and Guo, Lianghong and Wang, Yanlin and Guo, Daya and Tao, Wei and Shan, Zhenyu and Liu, Mingwei and Chen, Jiachi and Song, Haoyu and Tang, Duyu and Zhang, Hongyu and Zheng, Zibin},
  journal={arXiv preprint arXiv:2601.11655},
  year={2026},
  eprint={2601.11655},
  archivePrefix={arXiv},
  primaryClass={cs.SE}
}

๐Ÿ™ Acknowledgements

We would like to express our sincere gratitude to:

  • The authors of cited papers who provided valuable feedback on how their work is presented in this survey, greatly improving its accuracy and comprehensiveness.

  • All contributors who have helped improve this project through issues, pull requests, and discussions.

  • The open-source community for developing the amazing tools and frameworks that made this project possible.

Special Thanks

  • @chao-peng (Dr. Chao Peng), ByteDance Software Engineering Lab, for providing valuable suggestions on the Challenges and Opportunities section of our survey.

  • @EuniAI/awesome-code-agents for providing an excellent reference on managing survey papers through documentation systems and inspiring our project structure.


๐Ÿ“ฌ Contact

If you have any questions or suggestions, please contact us through:


๐Ÿ“œ License

This project is licensed under the MIT License - see the LICENSE file for details.


โญ Star this repository if you find it helpful!

Made with โค๏ธ by the DeepSoftwareAnalytics team

Documentation | Paper | Tables | About | Cite