Selectools Development Roadmap

June 15, 2026 Β· View on GitHub

An open-source project from NichevLabs.

Status Legend

  • βœ… Implemented - Merged and available in latest release
  • πŸ”΅ In Progress - Actively being worked on
  • 🟑 Planned - Scheduled for implementation
  • ⏸️ Deferred - Postponed to later release
  • ❌ Cancelled - No longer planned

v0.17.0 βœ… Eval Framework 39 evaluators β†’ A/B testing β†’ snapshots β†’ regression β†’ HTML/JUnit β†’ CI β†’ templates

v0.17.1 βœ… MCP Client/Server MCPClient β†’ mcp_tools() β†’ MCPServer β†’ MultiMCPClient β†’ tool interop

v0.17.3 βœ… Agent Runtime Controls Token budget β†’ Cancellation β†’ Cost attribution β†’ Structured results β†’ Approval gate β†’ SimpleStepObserver

v0.17.4 βœ… Agent Intelligence Token estimation β†’ Model switching β†’ Knowledge memory enhancement (4 store backends)

v0.17.5 βœ… Bug Hunt & Async Guardrails 91 validated fixes (13 critical, 26 high, 52 medium+low) β†’ Async guardrails β†’ 40 regression tests β†’ 5 new Common Pitfalls

v0.17.6 βœ… Quick Wins ReAct/CoT reasoning strategies β†’ Tool result caching β†’ Python 3.9–3.13 CI matrix

v0.17.7 βœ… Caching & Context Semantic caching β†’ Prompt compression β†’ Conversation branching (55 tests, 3 examples)

v0.18.0 βœ… Multi-Agent Orchestration + Composable Pipelines AgentGraph β†’ GraphState β†’ Typed reducers β†’ Resume-from-yield interrupts β†’ Scatter fan-out β†’ Checkpointing β†’ SupervisorAgent β†’ Graph visualization β†’ Pipeline β†’ @step β†’ | operator β†’ parallel() β†’ branch()

v0.19.0 βœ… Serve, Deploy & Complete Composition selectools serve CLI β†’ Playground UI β†’ YAML config β†’ 5 agent templates β†’ Structured AgentConfig β†’ compose() β†’ retry() / cache_step() β†’ Type-safe step contracts β†’ Streaming composition β†’ pipeline.astream() β†’ PostgresCheckpointStore β†’ TraceStore (3 backends) β†’ selectools doctor

v0.19.1 βœ… Advanced Agent Patterns PlanAndExecute β†’ ReflectiveAgent β†’ Debate β†’ TeamLead β†’ 50+ evaluators

v0.19.2 βœ… Enterprise Hardening Security audit β†’ Stability markers (@stable/@beta/@deprecated) β†’ Deprecation policy β†’ Compatibility matrix β†’ trace_to_html() waterfall viewer β†’ SBOM β†’ Property-based tests (Hypothesis) β†’ Concurrency smoke suite β†’ 5 production simulations β†’ 3,344 tests, 76 examples

v0.19.3 βœ… Stability Markers Applied to All Public APIs @stable on 60+ core symbols β†’ @beta on 30+ orchestration/pipeline/patterns symbols β†’ Full stability introspection via .stability on every exported class and function

v0.20.0 βœ… Visual Agent Builder Zero-install web UI β†’ Drag-drop graph builder β†’ YAML/Python export β†’ Self-contained HTML (no React, no CDN) β†’ One command: selectools serve --builder

v0.20.1 βœ… Builder Polish + Starlette + GitHub Pages UI polish (20 features) β†’ _static/ architecture split β†’ Starlette ASGI app β†’ Serverless mode (client-side AI/runs) β†’ GitHub Pages deployment β†’ Design system

v0.21.0 βœ… Connector Expansion + Multimodal + Observability FAISS β†’ Qdrant β†’ pgvector vector stores β†’ Azure OpenAI provider β†’ Multimodal messages (images, audio) β†’ CSV/JSON/HTML/URL document loaders β†’ OTel observer β†’ Langfuse observer β†’ Code execution, web search, GitHub, DB toolbox tools

v0.22.0 βœ… Competitor-Informed Bug Fixes + Loop Detection + Ruff Tooling 38 bug fixes from 3 rounds mining Agno/PraisonAI/LangChain/LangGraph/ CrewAI/n8n/LlamaIndex/AutoGen/LiteLLM/Pydantic-AI/Haystack (~325k stars) β†’ Loop detection (Repeat, Stall, PingPong) with RAISE / INJECT_MESSAGE policies β†’ Dev tooling consolidated: Black + isort + flake8 β†’ Ruff β†’ 30-recipe cookbook expansion β†’ 95 runnable examples

v0.23.0 βœ… Supabase Sessions + Builder RAG SupabaseSessionStore β†’ 4th SessionStore backend (JSON/SQLite/Redis/Supabase) β†’ Visual builder: first-class Retriever (RAG) + Session Store node types β†’ 7 vector-store backends in builder (memory/SQLite/Chroma/Pinecone/FAISS/Qdrant/pgvector) β†’ Hybrid (BM25 + vector + RRF) + cross-encoder rerank toggles β†’ New presets: Hybrid RAG, Multi-Tenant RAG (pgvector + Supabase session) β†’ 8 post-ship code-gen fixes in builder (embedder class names, HybridSearcher params, etc.) β†’ 96 runnable examples, 5332 tests total

v0.24.0 βœ… Production Interop Agent-as-API (AgentAPI: REST + SSE + session CRUD + auth) β†’ A2A protocol (Agent Card + JSON-RPC 2.0 server/client) β†’ LiteLLMProvider (100+ models) β†’ RouterProvider (cost-optimized tier routing) β†’ Anthropic prompt caching β†’ UnifiedMemory (conversation/knowledge/entity/episodic tiers) β†’ Cross-session search on all 4 SessionStore backends β†’ KnowledgeBackend (Supabase/Redis) β†’ ToolResult base + Artifact side-channel β†’ Deferred confirmation flow (selectools.pending) β†’ Toolbox expansion: 15 new tools (33 β†’ 48) β†’ Gemini schema sanitization + flash-lite compat β†’ 106 runnable examples, 5968 tests total

v0.25.0 βœ… Hardening & v1.0 Prep Planning-as-config (AgentConfig(planning=...)) β†’ Agent-level HITL (ToolConfig(require_approval=...)) β†’ Tool result compression β†’ Knowledge pre-save sanitizers β†’ Pending intent hooks (pop_if_intent, tighten_ttl) β†’ Stability marking sweep: 433 public symbols 100% marked (205 stable / 228 beta), 19 betaβ†’stable promotions, stability on all 123 public modules, CI gate β†’ Wart removal: clone_for_isolation() public, all reconciled (+11 exports), AgentConfig.hooks REMOVED (BREAKING) β†’ Security audit published (docs/SECURITY_AUDIT.md) β†’ 0.xβ†’1.0 migration guide β†’ Compatibility matrix refresh β†’ 111 runnable examples, 7268 tests total

v0.26.0 βœ… Safety Patch & Verified Registry Confirm-parser negation veto (non-leading negation no longer fired destructive CONFIRM) β†’ Model registry refresh: 152 β†’ 115, every entry source-verified, opus-4-1 pricing corrected, retired-model constants REMOVED (BREAKING) β†’ Cache-aware calculate_cost β†’ A2A -32602 on malformed parts β†’ Gemini embedding dimension constant 3072 β†’ 111 runnable examples, 7420 tests total

v1.0.0 🟑 Stable Release (bake window β€” code-complete) API freeze βœ… (warts removed in v0.25) β†’ Stability markers on all modules βœ… β†’ Security audit published βœ… β†’ Compatibility matrix βœ… β†’ 0.xβ†’1.0 migration guide βœ… β†’ Deprecation policy β†’ Remaining at tag time: drop Python 3.9 β†’ PyPI classifier: Production/Stable


v0.19.1: Advanced Agent Patterns βœ…

Higher-level agent architectures built on the v0.18.0 orchestration primitives. Closes the "Advanced patterns" competitive gap. Each pattern is a standalone class β€” they wire up the AgentGraph topology for you.

PlanAndExecute Agent

from selectools.patterns import PlanAndExecuteAgent

agent = PlanAndExecuteAgent(
    planner=planner_agent,
    executors={"research": researcher, "write": writer, "review": reviewer},
)
result = agent.run("Write a technical blog post about vector databases")
# Planner creates structured plan β†’ executors handle each step β†’ result aggregated

ReflectiveAgent

from selectools.patterns import ReflectiveAgent

agent = ReflectiveAgent(
    actor=writer_agent,
    critic=reviewer_agent,
    max_reflections=3,
    stop_condition="approved",
)
result = agent.run("Draft a press release")
# Actor produces draft β†’ Critic evaluates β†’ Actor revises β†’ repeat until approved

Debate Pattern

from selectools.patterns import DebateAgent

agent = DebateAgent(
    agents={"optimist": optimist_agent, "skeptic": skeptic_agent},
    judge=judge_agent,
    max_rounds=3,
)
result = agent.run("Should we adopt microservices?")
# Agents argue positions β†’ Judge synthesizes final answer

TeamLead Pattern

from selectools.patterns import TeamLeadAgent

agent = TeamLeadAgent(
    lead=lead_agent,
    team={"analyst": analyst, "engineer": engineer, "writer": writer},
    delegation_strategy="dynamic",  # or "sequential", "parallel"
)
result = agent.run("Investigate and fix the billing discrepancy")
# Lead delegates tasks, reviews work, coordinates handoffs

Expanded Eval Suite (50 evaluators, up from 39)

11 new evaluators across two categories:

New deterministic (+8): ReadabilityEvaluator, AgentTrajectoryEvaluator, ToolEfficiencyEvaluator, SemanticSimilarityEvaluator, MultiTurnCoherenceEvaluator, JsonSchemaEvaluator, KeywordDensityEvaluator, ForbiddenWordsEvaluator

New LLM-as-judge (+4): FactConsistencyEvaluator, CustomRubricEvaluator, AnswerAttributionEvaluator, StepReasoningEvaluator

FeatureStatusImpactEffort
PlanAndExecuteβœ…HighMedium
ReflectiveAgentβœ…HighMedium
Debateβœ…MediumMedium
TeamLeadβœ…MediumMedium
50 evaluatorsβœ…HighMedium

Quality Infrastructure

  • Ralph loop β€” autonomous hunt-and-fix convergence system (scripts/ralph_bug_hunt.sh, /ralph-bug-hunt skill)
  • Bandit in CI β€” security scan job on every push
  • Property-based tests β€” Hypothesis suite for structural invariants
  • Thread-safety smoke suite β€” 10-thread Γ— 20-op concurrency tests
  • Production simulations β€” 16 integration tests covering memory pressure, provider failover, tool errors, concurrent load

v0.19.2: Enterprise Hardening βœ…

Focus: Production readiness and developer trust signals before the Visual Agent Builder in v0.20.0.

Enterprise Hardening

FeatureStatusImpactEffort
Security audit (bandit + manual nosec review)βœ…HighMedium
Stability markers (@stable, @beta, @deprecated)βœ…MediumSmall
Deprecation policy (2-version window, docs/DEPRECATION_POLICY.md)βœ…MediumSmall
Compatibility matrix (Python Γ— provider SDK Γ— optional deps)βœ…MediumSmall
SBOM (sbom.json via CycloneDX, published in repo)βœ…LowSmall
Enhanced trace viewer (trace_to_html() waterfall HTML)βœ…HighMedium

Quality Infrastructure

FeatureStatusImpactEffort
Property-based tests (Hypothesis)βœ…HighMedium
Thread-safety smoke suiteβœ…HighMedium
Production simulations (5 new)βœ…HighMedium

v0.19.3: Stability Markers Applied βœ…

Focus: Apply @stable and @beta markers to every public symbol in the library, completing the stability annotation work started in v0.19.2.

Stable APIs (60+ symbols)

Core types, providers, agent, memory, tools, evals, guardrails, sessions, knowledge, cache, cancellation, token estimation, analytics, audit β€” all marked @stable. Breaking changes to these require a major version bump.

Beta APIs (30+ symbols)

Orchestration (AgentGraph, SupervisorAgent), pipelines (Pipeline, @step, parallel, branch), patterns (PlanAndExecuteAgent, ReflectiveAgent, DebateAgent, TeamLeadAgent), and composition (compose) β€” marked @beta. These may change in a minor release.

Introspection

from selectools import Agent, AgentGraph, PlanAndExecuteAgent
print(Agent.__stability__)              # "stable"
print(AgentGraph.__stability__)         # "beta"
print(PlanAndExecuteAgent.__stability__)  # "beta"

v0.20.0: Visual Agent Builder βœ…

The headline feature: a zero-install web UI for designing, testing, and exporting agent configurations. Served by selectools serve --builder β€” no separate app, no subscription, no desktop install required.

Why a dedicated release: LangGraph Studio is a paid desktop app. AutoGen Studio is a separate project. selectools ships a full visual builder in one command. This deserves its own announcement.

A web-based UI for designing, testing, and exporting agent configurations. Zero-install β€” served by selectools serve --builder.

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  Visual Agent Builder                    [Export]   β”‚
β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
β”‚             β”‚                                       β”‚
β”‚  Components β”‚    β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”    β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”       β”‚
β”‚  ─────────  β”‚    β”‚ Planner  │───▢│  Writer  β”‚       β”‚
β”‚  ☐ Agent    β”‚    β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜    β””β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”˜       β”‚
β”‚  ☐ Tool     β”‚                         β”‚             β”‚
β”‚  ☐ Router   β”‚                    β”Œβ”€β”€β”€β”€β–Όβ”€β”€β”€β”€β”€β”       β”‚
β”‚  ☐ Gate     β”‚                    β”‚ Reviewer β”‚       β”‚
β”‚  ☐ Parallel β”‚                    β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜       β”‚
β”‚             β”‚                                       β”‚
β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
β”‚  Properties β”‚    Model: gpt-4o  β”‚ Tools: 3          β”‚
β”‚  ─────────  β”‚    Strategy: plan β”‚ Budget: \$0.50     β”‚
β”‚  Name: ...  β”‚                                       β”‚
β”‚  Model: ... β”‚    [β–Ά Test Run]   [πŸ’Ύ Save YAML]      β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Features:

  • Drag-and-drop graph builder for AgentGraph topologies
  • Node palette: Agent, Tool, Router (conditional), Gate (HITL), Parallel group
  • Visual edge wiring with routing condition editor
  • Per-node configuration panel (model, tools, system prompt, budget)
  • Live test: run the graph against real providers from the UI
  • Export: generates agent.yaml or Python code
  • Import: load existing YAML configs into the builder
  • Served by selectools: selectools serve --builder (zero frontend deps)
  • Built as self-contained HTML/JS (same pattern as playground.py)

Technical approach:

  • Single HTML file with embedded JS (no React, no build step)
  • Canvas-based graph rendering (or SVG with drag handlers)
  • Backend: new /builder endpoint on AgentServer
    • GET /builder β€” serves the HTML
    • POST /builder/validate β€” validates graph structure
    • POST /builder/export β€” generates YAML or Python
    • POST /builder/run β€” executes the designed graph
  • State stored in browser localStorage (no server state)

Why this matters:

  • LangGraph has LangGraph Studio (paid, desktop app)
  • CrewAI has no visual builder
  • AutoGen has AutoGen Studio (separate app)
  • selectools: zero-install, runs in browser, exports to YAML/Python
FeatureStatusImpactEffort
Graph canvas (drag-drop nodes + edges)βœ…HighLarge
Node configuration panelβœ…HighMedium
YAML export/importβœ…HighSmall
Python code exportβœ…MediumSmall
Live test executionβœ…HighMedium
Self-contained HTML (no build step)βœ…HighMedium

v0.20.1: Builder Polish + Starlette + GitHub Pages βœ…

UI polish (20 features), _static/ architecture split, Starlette ASGI app, serverless mode (client-side AI/runs), GitHub Pages deployment, design system.

FeatureStatusImpactEffort
UI polish (20 features)βœ…HighMedium
_static/ architecture splitβœ…MediumSmall
Starlette ASGI appβœ…HighMedium
Serverless mode (client-side AI/runs)βœ…HighMedium
GitHub Pages deploymentβœ…HighSmall
Design systemβœ…MediumSmall
Eval badges on builder nodesβœ…MediumSmall

v0.21.0: Connector Expansion + Multimodal + Observability βœ…

Shipped: FAISS + Qdrant + pgvector vector stores, CSV/JSON/HTML/URL document loaders, Azure OpenAI provider, OpenTelemetry + Langfuse observers, multimodal ContentPart + image_message() across OpenAI/Anthropic/Gemini/Ollama, new code/search/github/db toolbox modules (9 tools). 5215 tests (95% coverage), 88 examples, 5 LLM providers, 7 vector stores, 152 models.

Close integration gaps, add multimodal support (images/audio), and ship enterprise-grade observability (OTel + Langfuse). Full spec: .private/07-v0.21.0-connector-expansion.md

Current Inventory

CategoryCountItems
Document Loaders4text, file, directory, PDF
Vector Stores4Memory, SQLite, Chroma, Pinecone
Embedding Providers4OpenAI, Anthropic/Voyage, Gemini, Cohere
LLM Providers5OpenAI, Anthropic, Gemini, Ollama, Fallback
Toolbox24 toolsfile, web, data, datetime, text
Rerankers2Cohere, Jina

New Document Loaders

Add to src/selectools/rag/loaders.py as new static methods on DocumentLoader. Refactor to loaders/ subpackage with __init__.py re-exporting everything to support SaaS loaders as separate files.

LoaderMethodDependenciesComplexityWhy it matters
CSVfrom_csv(path, content_columns, metadata_columns)stdlib csvSmallMost common structured data format
JSON/JSONLfrom_json(path, text_field) / from_jsonl(...)stdlib jsonSmallStandard for API responses, logs, datasets
HTMLfrom_html(path_or_content, extract_text=True)beautifulsoup4 (optional)SmallWeb scraping output, saved pages
URLfrom_url(url, timeout=30)requests + beautifulsoup4SmallDirect URL-to-document (2nd most requested after PDF)
Markdown w/ Frontmatterfrom_markdown(path)pyyaml (optional)SmallStatic sites, docs, wikis
Google Drivefrom_google_drive(file_id, credentials)google-api-python-clientMediumMost-used enterprise doc platform
Notionfrom_notion(page_id, api_key)requests (existing)Medium2nd most-requested SaaS loader
GitHubfrom_github(repo, path, branch, token)requests (existing)SmallDeveloper docs and code
SQL Databasefrom_sql(connection_string, query)sqlalchemy (optional)MediumEnterprise data in databases

New Vector Stores

New files in src/selectools/rag/stores/. Each follows the same pattern as chroma.py: inherit VectorStore, implement add_documents, search, delete, clear, lazy-import the dependency. Register in VectorStore.create() factory.

StoreFileDependenciesComplexityWhy it matters
FAISSfaiss.pyfaiss-cpuMediumDe facto standard for local high-perf vector search (millions of vectors)
Qdrantqdrant.pyqdrant-clientMediumFastest-growing vector DB, excellent filtering, cloud + self-hosted
pgvectorpgvector.pypsycopg2-binaryMediumUse existing PostgreSQL β€” no new database needed
Weaviateweaviate.pyweaviate-clientMediumPopular cloud vector DB with GraphQL API
Redis Vectorredis.pyredis (existing)MediumLeverages existing Redis connection from cache_redis.py

New Toolbox Modules

New files in src/selectools/toolbox/. Follow @tool decorator pattern, register in get_all_tools() and get_tools_by_category().

ModuleToolsDependenciesComplexityWhy it matters
**code_tools.pyexecute_python, execute_shellstdlib subprocessMedium#1 most-used tool in agent frameworks
**search_tools.py**google_search, duckduckgo_searchduckduckgo_search (optional)Small-Medium#2 most-used tool category
**github_tools.py**create_issue, list_issues, create_pr, get_file_contentsrequests (existing)MediumDeveloper workflow automation
**db_tools.py**query_database, list_tables, describe_tablesqlalchemy (optional)MediumEnterprise data access

Dependency Management

All new dependencies are optional and lazy-imported. Add to pyproject.toml:

[project.optional-dependencies]
rag = [
    # existing deps ...
    "beautifulsoup4>=4.12.0",
    "faiss-cpu>=1.7.0",
    "qdrant-client>=1.7.0",
    "psycopg2-binary>=2.9.0",
    "weaviate-client>=4.0.0",
]

Individual stores/loaders remain installable a la carte: pip install selectools faiss-cpu works without the full [rag] group.

FeatureStatusImpactEffort
Multimodal messages🟑HighMedium
OTel observer🟑HighMedium
Azure OpenAI provider🟑HighSmall
Langfuse observer🟑HighSmall
FAISS Vector Store🟑HighSmall
Qdrant Vector Store🟑MediumSmall
pgvector Store🟑HighSmall
CSV/JSON/HTML/URL Loaders🟑HighSmall
Code Execution Tools🟑HighMedium
Web Search + GitHub Tools🟑HighSmall
Database Query Tools🟑MediumSmall

Backlog (Unscheduled β€” Priority Ordered)

Research basis: Competitive analysis of Agno (39k stars), PraisonAI (6.9k stars), and Superagent (6.5k stars) conducted 2026-04-10.

Strategic thesis: selectools wins on depth (50 evals, 7 vector stores, graph orchestration, pattern agents). Close the breadth gap cheaply, own the "production-ready" narrative, adopt the emerging A2A standard.

Reorganized 2026-06-12 against the v0.26.0 codebase: 11 of the 13 original P0–P2 items have shipped. Shipped items moved to the ledger below; the open remainder is re-prioritized for the post-1.0 release train.


Shipped from this backlog

ItemShippedWhere
Tool-call loop detection (3 detectors, two-tier response)v0.22.0loop_detection.py, docs/modules/LOOP_DETECTION.md
Agentic memory β€” remember toolv0.16.0toolbox/memory_tools.py (auto-injected with knowledge_memory)
Agent-as-API (production REST, auth, SSE)v0.24.0 (#68)serve/api.py
LiteLLM provider (100+ models)v0.24.0 (#74)providers/litellm_provider.py
Cost-optimized model routerv0.24.0 (#75)providers/router.py
A2A protocol (server + client + agent cards)v0.24.0 (#76)a2a/
Toolbox expansion to 48 tools (calculator, email, PDF, Slack, Notion, Linear)v0.24.0 (#77)toolbox/
Tool result compressionv0.25.0 (#87)ToolConfig.compress_results, agent/_tool_executor.py
Session history search (FTS across 4 backends)v0.24.0 (#79)sessions.py SessionStore.search()
Memory tiering / auto-promotion (UnifiedMemory, standalone)v0.24.0 (#78)unified_memory.py
Agent-level HITL / approvalv0.25.0 (#88)ToolConfig.require_approval + approval_handler
Planning-as-configv0.25.0 (#86)AgentConfig.planning (PlanningConfig)
Agentic memory β€” recall toolv0.27.0 (#109)toolbox/memory_tools.py make_recall_tool (auto-injected with remember)
UnifiedMemory AgentConfig wiringv0.27.0 (#111)MemoryConfig(unified=True, ...), agent/core.py
Toolbox: Discord, S3, browser, image-gen (48 β†’ 56 tools)v0.27.0 (#110)toolbox/{discord,s3,browser,image}_tools.py
Cache-rate cost support (OpenAI + Gemini)v0.27.0 (#112)pricing.calculate_cost_with_cached_input, cached_prompt_cost
Cron / scheduled agentsv0.27.0 (#113)scheduler.py (AgentScheduler, cron, every)
Reasoning-as-toolv0.27.0 (#114)toolbox/reasoning_tools.py (make_reasoning_tools, ReasoningTools)
Episodic memory retention configv0.27.0 (#111)Delivered with the UnifiedMemory wiring β€” MemoryConfig.episodic_retention_days flows through and add_turn auto-prunes (tested: test_retention_pruning)

Shipped 2026-06-13: the four v1.1 candidates (#109-#112) plus the top two Future/Watch items (cron #113, reasoning tools #114), folded into the v1.0 train. Episodic retention config (#111) needed no separate work β€” the UnifiedMemory wiring already exposed and auto-applied it. gemini-embedding-2 decision: GA/recommended-for-new; default stays gemini-embedding-001 (incompatible embedding space). See CHANGELOG.md [Unreleased].


Active Queue β€” needs a product decision before building

The autonomously-buildable, high-conviction items are now shipped. Each remaining item carries a scoping/product call (flagged), so these wait on John's direction rather than getting built blind.

ItemSourceDecision neededEffort
More DB backends β€” MongoDB (#116) + DynamoDB shipped; Firestore on demandAgnoMongoDB + DynamoDB session stores shipped 2026-06-13. Firestore next only if there's demand (needs google-cloud-firestore).Medium each
ML-based guard modelsSuperagentHeuristic tier shipped: PromptInjectionGuardrail (2026-06-13) covers templated attacks with no model hosting. The model-based tier still needs the hosting decision: bundle a 0.6-4B model, optional extra, or external endpoint?High
Multi-channel bot gatewayPraisonAIRoadmap itself says "better as a separate package." In-repo module vs new package = a product/packaging call.High
Learning systemAgnoScope is vague (decision logging + preference tracking). Needs a concrete spec before it's buildable.High
Shadow git checkpointsPraisonAIOnly relevant if selectools moves toward coding-agent use cases β€” a direction call.Medium

Previously Planned (Retained)

FeatureNotesTarget
AWS Bedrock providerCovered today via LiteLLM; native boto3 wrapper only if enterprise demandFuture
Durable execution / webhooksTask queue, resume from checkpointFuture
Code execution sandbox (Docker/E2B)Sandboxed code execution for untrusted inputFuture
Prompt registry / versioningVersion, A/B test, rollback promptsFuture
Time-travel debugging / state replayRewind, edit, replay from any checkpointv1.x
Voice / real-time audio agentsWebRTC, STT/TTS, sub-500ms latencyv1.x
Rate limiting & quotasPer-tool and per-user quotasFuture
CRM & business toolsHubSpot, Salesforce integrationsFuture
Niche loadersConfluence, Jira, Discord, DocxFuture
Niche vector storesWeaviate, Redis Vector, Milvus, OpenSearch, LanceFuture

Previously Planned (Retained)

FeatureNotesTarget
AWS Bedrock providerCovered today via LiteLLM; native boto3 wrapper only if enterprise demandFuture
Durable execution / webhooksTask queue, resume from checkpointFuture
Code execution sandbox (Docker/E2B)Sandboxed code execution for untrusted inputFuture
Prompt registry / versioningVersion, A/B test, rollback promptsFuture
Time-travel debugging / state replayRewind, edit, replay from any checkpointv1.x
Voice / real-time audio agentsWebRTC, STT/TTS, sub-500ms latencyv1.x
Rate limiting & quotasPer-tool and per-user quotasFuture
CRM & business toolsHubSpot, Salesforce integrationsFuture
Niche loadersConfluence, Jira, Discord, DocxFuture
Niche vector storesWeaviate, Redis Vector, Milvus, OpenSearch, LanceFuture