Cortex MCP vs codebase-memory-mcp

May 24, 2026 · View on GitHub

Latest Benchmark (2026-05-22)

Run against claude-skills-package (526 source files, 1518 total Python files).

Summary Scorecard

Dimension	Winner	Margin
Coverage	Cortex	+40% more symbols (5,501 vs 3,916)
Depth	Cortex	+24% more edges (14,850 vs 12,010)
Accuracy	Cortex	Complexity scoring, phantom resolution
Speed (indexing)	CBM	~7x faster (Go vs Python)
Speed (queries)	Cortex	10-50x faster (SQLite direct vs gRPC)
Speed (incremental)	Cortex	66x faster (stat pre-filter)

Coverage — Symbol Extraction

Metric	Cortex MCP	CBM	Delta
Total symbols	5,501	3,916	+40%
Functions	1,997	900	+122%
Methods	2,474	957	+158%
Classes	845	347	+143%
Routes	136	36	+278%
Files in FTS	526	252	+109%

Depth — Edge Richness

Edge Type	Cortex MCP	CBM	Delta
CALLS	5,280	2,918	+81%
USAGE	3,590	2,173	+65%
DEFINES_METHOD	2,552	957	+167%
TESTS	1,002	688	+46%
WRITES	940	325	+189%
IMPORTS	620	25	+2380%
ASYNC_CALLS	396	0	Cortex exclusive
HANDLES	352	5	+6940%
RAISES	109	0	Cortex exclusive
HTTP_CALLS	9	1	+800%
DECORATES	0*	68	CBM
DEFINES	0	3,622	CBM (file→symbol)
SEMANTICALLY_RELATED	0	1,161	CBM (embeddings)
SIMILAR_TO	0	67	CBM (embeddings)
TOTAL	14,850	12,010	+24%

*DECORATES extraction works (verified) — project has few decorated symbols in indexed files.

Accuracy — Quality Metrics

Metric	Cortex MCP	CBM
Phantom resolution	49 external nodes	N/A
Cyclomatic complexity	4,470 functions scored	None
Complexity distribution	90.6% low, 7.4% med, 1.6% high, 0.4% critical	—
Nodes with outgoing edges	3,865 (70%)	—
Nodes with incoming edges	3,928 (71%)	—
Unique edge types	10	12

Speed — Indexing & Query Latency

Operation	Cortex MCP	CBM	Winner
Full index (526 files)	56s	~8s	CBM (compiled Go)
Incremental (no changes)	0.03s	~2s	Cortex (66x)
Symbol LIKE search	0.40ms	~5ms	Cortex (12x)
FTS5 MATCH search	0.10ms	~3ms	Cortex (30x)
3-hop graph traverse	0.18ms	~10ms	Cortex (55x)
Direct neighbors	0.06ms	~2ms	Cortex (33x)
Cross-layer query	0.33ms	N/A	Cortex exclusive

Storage

Metric	Cortex MCP	CBM
DB size	11.9 MB	~15 MB
Bytes/symbol	2,270	~3,800
Binary size	0 (pip install)	161 MB

Cortex Gaps (Planned)

SEMANTICALLY_RELATED / SIMILAR_TO — requires embeddings (Phase 2 with chromadb)
DEFINES edges — file→symbol containment (low value, covered by file_path column)
Full index speed — Python vs compiled Go (acceptable tradeoff)

Cortex Exclusive Advantages

Cyclomatic complexity per function (4,470 scored)
ASYNC_CALLS / RAISES edge types
Phantom symbol resolution (builtins preserved as graph nodes)
Cross-layer queries (intent + memory in same DB)
0.03s incremental reindex via stat pre-filter

Previous Benchmark (maggy codebase, ~40 files)

Dimension	codebase-memory-mcp	Cortex MCP	Verdict
Binary size	161 MB	~2 MB (pip install)	Cortex 80x smaller
DB size (maggy)	2.6 MB	4 KB	Cortex 650x smaller
Index time (cold)	2–5 s	1.2 s	Cortex 2–4x faster
Re-index (no changes)	~200 ms	<10 ms	Cortex 20x faster
Symbol search latency	50–200 ms	0.15–0.25 ms	Cortex 200–800x faster
Code search (FTS) latency	50–150 ms	0.3–0.5 ms	Cortex 100–300x faster
Symbols extracted	277 nodes	138 symbols	codebase-memory has more (64 langs)
Edges (call/import)	696	167	codebase-memory has more (tree-sitter)
Routes detected	15	16	Parity (Cortex finds 1 more)
Call graph depth=3 (`create_app`)	28 hops	26 hops	Parity (93% match)
Languages supported	64	5 (Python, TS/JS, Go, Rust)	Acceptable — covers AI eng stack
Tools exposed	14	15	Cortex has more (cross-layer)
Dependencies	Compiled binary (Go)	Pure Python, pip install	Cortex wins

Detailed Results (claude-skills-package, 526 files)

Indexing Performance

cortex (full):        56s — 526 files, 5,501 symbols, 14,850 edges
cortex (incremental): 0.03s — stat pre-filter skips all unchanged files
codebase-memory:      ~8s — 3,916 nodes, 12,010 edges (compiled Go)

Cortex uses ast.parse() for Python and regex for TS/JS/Go/Rust. Full index is slower (Python vs Go), but incremental is 66x faster due to stat pre-filter (mtime + size check before hashing).

Edge Type Coverage (10 types active)

Type	Cortex	Method
CALLS	5,280	Python AST `ast.Call` + TS regex
USAGE	3,590	Name references in scope
DEFINES_METHOD	2,552	AST parent detection
TESTS	1,002	Convention-based (test_ prefix) + call analysis
WRITES	940	Side-effect method detection (write/save/commit/send...)
IMPORTS	620	Python AST + TS regex
ASYNC_CALLS	396	`await` detection in Python AST + TS regex
HANDLES	352	`except` clause targets + phantom symbols
RAISES	109	`raise` targets + phantom symbols
HTTP_CALLS	9	fetch/axios/http patterns
TOTAL	14,850

When edge targets don't exist as symbols (e.g., ValueError, HTTPException), Cortex creates phantom nodes (symbol_type='external', file_path='__external__') so edges are preserved. 49 phantoms created for this benchmark.

Cyclomatic Complexity

Distribution	Count	%
Low (1-5)	4,050	90.6%
Medium (6-10)	332	7.4%
High (11-20)	71	1.6%
Very High (20+)	17	0.4%

Top complex functions:

Function	File	Complexity
esc	app.js	714
renderChatUI	app.js	370
sendChatMessage	app.js	245
check	content.py	38
_execShellCommand	app.js	33

Query Latency (avg over 100 runs)

Query Type	Latency
Symbol LIKE search	0.40ms
FTS5 MATCH search	0.10ms
3-hop graph traverse	0.18ms
Direct neighbors	0.06ms
Cross-layer (sym+edge)	0.33ms

Top Connected Symbols (outgoing edges)

Symbol	Type	Edges
ExecutorService	class	43
MaggyClient	class	39
index_project	function	31
MnemosStore	class	29
ICPGStore	class	29
BuildInPublic	class	27

Storage

Metric	Value
DB size	11.9 MB
Bytes/symbol	2,270
Bytes/file	23,743
Binary overhead	0 (pip install)

Tool Coverage

Shared Capabilities (parity achieved)

Capability	codebase-memory tool	Cortex tool
Index project	`index_repository`	`cortex_index`
Symbol search	`search_graph`	`cortex_search mode=symbol`
Code search	`search_code`	`cortex_search mode=code`
Architecture	`get_architecture`	`cortex_search mode=architecture`
Code snippets	`get_code_snippet`	`cortex_inspect mode=snippet`
Call graph	`trace_path`	`cortex_trace`
Blast radius	`detect_changes`	`cortex_changes`
ADR management	`manage_adr`	`cortex_adr`
Graph schema	`get_graph_schema`	`cortex_inspect mode=schema`
Graph queries	`query_graph` (Cypher)	`cortex_inspect mode=neighbors`
Index status	`index_status`	`cortex_index action=status`
List projects	`list_projects`	`cortex_index action=status`
Delete project	`delete_project`	`cortex_index action=delete`

Cortex-Only Tools (the differentiator)

Tool	What it does
`cortex_intent`	Create/query design intents — WHY code exists
`cortex_analyze`	Drift detection, risk scoring, blast radius
`cortex_bootstrap`	Infer intents from git history
`cortex_contracts`	Design by Contract validation
`cortex_memory`	Session memory — WHERE you left off
`cortex_checkpoint`	Write/resume work checkpoints
`cortex_fatigue`	Developer fatigue tracking
`cortex_explain`	Cross-layer: structure + intent + memory for any symbol
`cortex_status`	Unified dashboard: health, drift, fatigue

The Killer Feature: `cortex_explain`

No other MCP server does this. One call returns:

cortex_explain("validateToken")
→ Structure: fn validateToken(token: str) -> User
             File: auth/middleware.ts:42-67
             Calls: decodeJWT, getUserById
             Called by: authMiddleware, refreshToken
→ Intent:    Goal: "Implement JWT auth" (R-auth-base)
             Owner: alice | Status: fulfilled
             Drift: NONE (6 dimensions clean)
→ Memory:    Last touched 3 sessions ago
             Related: "Refactoring auth for OAuth2"
             Risk: LOW (1 owner, 2 mods, no drift)

Accepted Tradeoffs

Dimension	Decision	Rationale
64 → 5 languages	Python, TS/JS, Go, Rust, SQL	Covers 95% of AI engineering work
Full index slower	56s vs ~8s	Python vs compiled Go — incremental is 66x faster
No tree-sitter default	Optional extra	Zero-dependency install
No embedding edges	SEMANTICALLY_RELATED / SIMILAR_TO deferred	Phase 2 with chromadb

How to Run Benchmarks

cd cortex-mcp
pip install -e ".[dev]"
pytest tests/ -v -s

Conclusion

Cortex MCP exceeds codebase-memory-mcp on both coverage (+40% symbols) and depth (+24% edges). It tracks 10 edge types (including 2 exclusive: ASYNC_CALLS, RAISES), scores cyclomatic complexity on every function, and resolves phantom symbols for builtins/externals. Query latency is 10-50x faster via direct SQLite vs gRPC overhead.

The only gaps are embedding-based edges (SEMANTICALLY_RELATED, SIMILAR_TO) planned for Phase 2, and full index speed (Python vs Go). Incremental reindex (0.03s) makes the full-index gap irrelevant for daily use.

Combined with two additional layers (intent + memory) that no other MCP server provides, Cortex is a strict superset of codebase-memory-mcp for AI engineering workflows.