Claude Deep Research Agent

June 9, 2026 · View on GitHub

See example outputs here: https://claude-code-deep-research.vercel.app/

Why This Exists
Repo Structure
Version 4.0
Quick Start
How It Works
Customization
Credits & Acknowledgements
License

UPDATE: Added Claude2.md — updated for deeper research and closer Graph-of-Thought patterns.

Version 4.0 (June 2026)

V4 replaces thinking scaffolds with verification scaffolds: deterministic stop rules and citation gates (scripts), cross-model verification, consequence-tier confidence floors, run cards, and signed residue statements. See Version4/README.md. V3 remains unchanged below for existing users.

Recommended default for new research: start with V4. Use V3 only when you need the original public Graph-of-Thoughts workflow or are comparing V3 against V4.

Why This Exists

Large Language Models (LLMs) excel at single queries but struggle with complex, multi-step research requiring iterative querying, source verification, and citations—what OpenAI and Google call "Deep Research." Anthropic’s Claude Code can achieve the same results, provided the right instructions. This repo supplies those instructions, streamlined into an easy-to-use workflow.

Repo Structure

File/Folder	Purpose
CLAUDE.md	Master instructions for Claude Code. Includes Graph-of-Thoughts integration and deep-research methodology.
Deep Research Question Generator System Prompt.md	ChatGPT system prompt (o3/o3-pro) refining raw questions into structured prompts (OpenAI format recommended).
deepresearchprocess.md	Comprehensive 7-phase deep research playbook inspired by OpenAI & Google Gemini, foundational to `CLAUDE.md`.
.template_mcp.json	Optional MCP server configuration for local filesystem and browser automation with Claude.
`examples/`	Sample refined questions and completed Claude reports compared to other outputs.
`Version4/`	Current verification-gated workflow: control file, installable agents, scripts, schemas, prompt refiner, skill wrapper, benchmark artifacts, and deployment reports.
`AGENTS.md` / `GEMINI.md`	Pointers for Codex/Gemini-style surfaces to read `Version4/skills/deep-research/SKILL.md` and follow the V4 control logic.

Quick Start

Recommended: Version 4.0 verification-gated workflow

Read Version4/README.md for the V3 → V4 changes.
Use Version4/CLAUDE.md as the controller and Version4/agents/*.md for the six research roles.
Keep Version4/scripts/ and Version4/schemas/ with the run so stop_rule.py, citation_audit.py, run cards, ledgers, and residue statements are available.
Start with: Deep research [your question].
Before finalizing, run the deterministic gates and record any unresolved claims in the residue statement.

Legacy V3 workflow

Example output and comparisons from the examples folder: https://claude-code-deep-research.vercel.app/

Step 1: Refine Your Question with ChatGPT (or your favorite LLM)

Open ChatGPT (model o3/o3-pro or other thinking models work best).
Set the system prompt to the contents of Deep Research Question Generator System Prompt.md.
Paste your raw research question into the user prompt.
Respond to any clarifying questions from ChatGPT.
Copy the generated OpenAI-formatted prompt.

Step 2: Prepare Claude Code

Launch a new Claude Code session.
Set the model using /model opus.

Type:

Please read the CLAUDE.md file and confirm when ready for my deep research question.

Wait for Claude’s confirmation.

Step 3: Launch Deep Research

Paste your refined question.
Claude autonomously performs:
- Research planning with Graph-of-Thoughts.
- Spins up multiple subagents to do the work faster
- Iterative search and data scraping.
- Fact verification and cross-referencing.
- Markdown report generation with citations and bibliography.
Review and refine as needed.

Tip ⚡: Include directory instructions, such as:

"Save all outputs in the /RESEARCH/[topic] folder."

[ Question Refinement ] → [ Claude Code / agent surface ] → [ V4 verification-gated research pipeline ] → [ Cited Markdown Report + Evidence Ledger + Run Card + Residue ]

Version4: Current workflow. It uses branch quotas, disjointness checks, deterministic stop rules, citation audits, cross-model verification, consequence-tier confidence floors, run cards, and signed residue statements.
DeepResearchProcess: Implements a 7-phase pipeline—Scope → Plan → Retrieve → Triangulate → Draft → Critique → Package.
Graph-of-Thoughts: Allows Claude to branch and merge multiple reasoning paths rather than relying on linear chains.
CLAUDE.md: Integrates instructions, enabling Claude to autonomously select tools, verify information, and embed citations systematically.

Build Notes

Research Methodology: Derived from OpenAI and Gemini’s deep-research playbooks.
Graph-of-Thoughts Integration: Adapted from Graph-of-Thoughts to support dynamic research pathways.
Prompt Generation: ChatGPT-based structured prompt ensures clarity, reducing confusion during Claude’s research by over 50% in tests.
Automation Hooks: The .template_mcp.json demonstrates local automation options via MCP servers, enabling advanced Claude operations.

Customization

Output Styles: Adjust formatting and citation preferences directly within the CLAUDE.md file.
Model Flexibility: Alternative Gemini-specific prompts provided by the ChatGPT system prompt generator if preferred.
Tool Integration: Expand automation via MCP by updating .template_mcp.json and referencing additional tools within CLAUDE.md.

Credits & Acknowledgements

Graph-of-Thoughts Framework: SPCL, ETH Zürich (MIT License).
Methodologies inspired by publicly available OpenAI and Google Gemini documentation.
Developed by Ankit at My Business Care Team (MyBCAT).

License

MIT License. See LICENSE file for full details.