Selectools Development Roadmap
June 15, 2026 Β· View on GitHub
An open-source project from NichevLabs.
Status Legend
- β Implemented - Merged and available in latest release
- π΅ In Progress - Actively being worked on
- π‘ Planned - Scheduled for implementation
- βΈοΈ Deferred - Postponed to later release
- β Cancelled - No longer planned
v0.17.0 β Eval Framework 39 evaluators β A/B testing β snapshots β regression β HTML/JUnit β CI β templates
v0.17.1 β MCP Client/Server MCPClient β mcp_tools() β MCPServer β MultiMCPClient β tool interop
v0.17.3 β Agent Runtime Controls Token budget β Cancellation β Cost attribution β Structured results β Approval gate β SimpleStepObserver
v0.17.4 β Agent Intelligence Token estimation β Model switching β Knowledge memory enhancement (4 store backends)
v0.17.5 β Bug Hunt & Async Guardrails 91 validated fixes (13 critical, 26 high, 52 medium+low) β Async guardrails β 40 regression tests β 5 new Common Pitfalls
v0.17.6 β Quick Wins ReAct/CoT reasoning strategies β Tool result caching β Python 3.9β3.13 CI matrix
v0.17.7 β Caching & Context Semantic caching β Prompt compression β Conversation branching (55 tests, 3 examples)
v0.18.0 β Multi-Agent Orchestration + Composable Pipelines AgentGraph β GraphState β Typed reducers β Resume-from-yield interrupts β Scatter fan-out β Checkpointing β SupervisorAgent β Graph visualization β Pipeline β @step β | operator β parallel() β branch()
v0.19.0 β Serve, Deploy & Complete Composition selectools serve CLI β Playground UI β YAML config β 5 agent templates β Structured AgentConfig β compose() β retry() / cache_step() β Type-safe step contracts β Streaming composition β pipeline.astream() β PostgresCheckpointStore β TraceStore (3 backends) β selectools doctor
v0.19.1 β Advanced Agent Patterns PlanAndExecute β ReflectiveAgent β Debate β TeamLead β 50+ evaluators
v0.19.2 β Enterprise Hardening Security audit β Stability markers (@stable/@beta/@deprecated) β Deprecation policy β Compatibility matrix β trace_to_html() waterfall viewer β SBOM β Property-based tests (Hypothesis) β Concurrency smoke suite β 5 production simulations β 3,344 tests, 76 examples
v0.19.3 β Stability Markers Applied to All Public APIs @stable on 60+ core symbols β @beta on 30+ orchestration/pipeline/patterns symbols β Full stability introspection via .stability on every exported class and function
v0.20.0 β Visual Agent Builder Zero-install web UI β Drag-drop graph builder β YAML/Python export β Self-contained HTML (no React, no CDN) β One command: selectools serve --builder
v0.20.1 β Builder Polish + Starlette + GitHub Pages UI polish (20 features) β _static/ architecture split β Starlette ASGI app β Serverless mode (client-side AI/runs) β GitHub Pages deployment β Design system
v0.21.0 β Connector Expansion + Multimodal + Observability FAISS β Qdrant β pgvector vector stores β Azure OpenAI provider β Multimodal messages (images, audio) β CSV/JSON/HTML/URL document loaders β OTel observer β Langfuse observer β Code execution, web search, GitHub, DB toolbox tools
v0.22.0 β Competitor-Informed Bug Fixes + Loop Detection + Ruff Tooling 38 bug fixes from 3 rounds mining Agno/PraisonAI/LangChain/LangGraph/ CrewAI/n8n/LlamaIndex/AutoGen/LiteLLM/Pydantic-AI/Haystack (~325k stars) β Loop detection (Repeat, Stall, PingPong) with RAISE / INJECT_MESSAGE policies β Dev tooling consolidated: Black + isort + flake8 β Ruff β 30-recipe cookbook expansion β 95 runnable examples
v0.23.0 β Supabase Sessions + Builder RAG SupabaseSessionStore β 4th SessionStore backend (JSON/SQLite/Redis/Supabase) β Visual builder: first-class Retriever (RAG) + Session Store node types β 7 vector-store backends in builder (memory/SQLite/Chroma/Pinecone/FAISS/Qdrant/pgvector) β Hybrid (BM25 + vector + RRF) + cross-encoder rerank toggles β New presets: Hybrid RAG, Multi-Tenant RAG (pgvector + Supabase session) β 8 post-ship code-gen fixes in builder (embedder class names, HybridSearcher params, etc.) β 96 runnable examples, 5332 tests total
v0.24.0 β Production Interop Agent-as-API (AgentAPI: REST + SSE + session CRUD + auth) β A2A protocol (Agent Card + JSON-RPC 2.0 server/client) β LiteLLMProvider (100+ models) β RouterProvider (cost-optimized tier routing) β Anthropic prompt caching β UnifiedMemory (conversation/knowledge/entity/episodic tiers) β Cross-session search on all 4 SessionStore backends β KnowledgeBackend (Supabase/Redis) β ToolResult base + Artifact side-channel β Deferred confirmation flow (selectools.pending) β Toolbox expansion: 15 new tools (33 β 48) β Gemini schema sanitization + flash-lite compat β 106 runnable examples, 5968 tests total
v0.25.0 β Hardening & v1.0 Prep Planning-as-config (AgentConfig(planning=...)) β Agent-level HITL (ToolConfig(require_approval=...)) β Tool result compression β Knowledge pre-save sanitizers β Pending intent hooks (pop_if_intent, tighten_ttl) β Stability marking sweep: 433 public symbols 100% marked (205 stable / 228 beta), 19 betaβstable promotions, stability on all 123 public modules, CI gate β Wart removal: clone_for_isolation() public, all reconciled (+11 exports), AgentConfig.hooks REMOVED (BREAKING) β Security audit published (docs/SECURITY_AUDIT.md) β 0.xβ1.0 migration guide β Compatibility matrix refresh β 111 runnable examples, 7268 tests total
v0.26.0 β Safety Patch & Verified Registry Confirm-parser negation veto (non-leading negation no longer fired destructive CONFIRM) β Model registry refresh: 152 β 115, every entry source-verified, opus-4-1 pricing corrected, retired-model constants REMOVED (BREAKING) β Cache-aware calculate_cost β A2A -32602 on malformed parts β Gemini embedding dimension constant 3072 β 111 runnable examples, 7420 tests total
v1.0.0 π‘ Stable Release (bake window β code-complete) API freeze β (warts removed in v0.25) β Stability markers on all modules β β Security audit published β β Compatibility matrix β β 0.xβ1.0 migration guide β β Deprecation policy β Remaining at tag time: drop Python 3.9 β PyPI classifier: Production/Stable
v0.19.1: Advanced Agent Patterns β
Higher-level agent architectures built on the v0.18.0 orchestration primitives. Closes the "Advanced patterns" competitive gap. Each pattern is a standalone class β they wire up the AgentGraph topology for you.
PlanAndExecute Agent
from selectools.patterns import PlanAndExecuteAgent
agent = PlanAndExecuteAgent(
planner=planner_agent,
executors={"research": researcher, "write": writer, "review": reviewer},
)
result = agent.run("Write a technical blog post about vector databases")
# Planner creates structured plan β executors handle each step β result aggregated
ReflectiveAgent
from selectools.patterns import ReflectiveAgent
agent = ReflectiveAgent(
actor=writer_agent,
critic=reviewer_agent,
max_reflections=3,
stop_condition="approved",
)
result = agent.run("Draft a press release")
# Actor produces draft β Critic evaluates β Actor revises β repeat until approved
Debate Pattern
from selectools.patterns import DebateAgent
agent = DebateAgent(
agents={"optimist": optimist_agent, "skeptic": skeptic_agent},
judge=judge_agent,
max_rounds=3,
)
result = agent.run("Should we adopt microservices?")
# Agents argue positions β Judge synthesizes final answer
TeamLead Pattern
from selectools.patterns import TeamLeadAgent
agent = TeamLeadAgent(
lead=lead_agent,
team={"analyst": analyst, "engineer": engineer, "writer": writer},
delegation_strategy="dynamic", # or "sequential", "parallel"
)
result = agent.run("Investigate and fix the billing discrepancy")
# Lead delegates tasks, reviews work, coordinates handoffs
Expanded Eval Suite (50 evaluators, up from 39)
11 new evaluators across two categories:
New deterministic (+8): ReadabilityEvaluator, AgentTrajectoryEvaluator, ToolEfficiencyEvaluator, SemanticSimilarityEvaluator, MultiTurnCoherenceEvaluator, JsonSchemaEvaluator, KeywordDensityEvaluator, ForbiddenWordsEvaluator
New LLM-as-judge (+4): FactConsistencyEvaluator, CustomRubricEvaluator, AnswerAttributionEvaluator, StepReasoningEvaluator
| Feature | Status | Impact | Effort |
|---|---|---|---|
| PlanAndExecute | β | High | Medium |
| ReflectiveAgent | β | High | Medium |
| Debate | β | Medium | Medium |
| TeamLead | β | Medium | Medium |
| 50 evaluators | β | High | Medium |
Quality Infrastructure
- Ralph loop β autonomous hunt-and-fix convergence system (
scripts/ralph_bug_hunt.sh,/ralph-bug-huntskill) - Bandit in CI β security scan job on every push
- Property-based tests β Hypothesis suite for structural invariants
- Thread-safety smoke suite β 10-thread Γ 20-op concurrency tests
- Production simulations β 16 integration tests covering memory pressure, provider failover, tool errors, concurrent load
v0.19.2: Enterprise Hardening β
Focus: Production readiness and developer trust signals before the Visual Agent Builder in v0.20.0.
Enterprise Hardening
| Feature | Status | Impact | Effort |
|---|---|---|---|
| Security audit (bandit + manual nosec review) | β | High | Medium |
Stability markers (@stable, @beta, @deprecated) | β | Medium | Small |
Deprecation policy (2-version window, docs/DEPRECATION_POLICY.md) | β | Medium | Small |
| Compatibility matrix (Python Γ provider SDK Γ optional deps) | β | Medium | Small |
SBOM (sbom.json via CycloneDX, published in repo) | β | Low | Small |
Enhanced trace viewer (trace_to_html() waterfall HTML) | β | High | Medium |
Quality Infrastructure
| Feature | Status | Impact | Effort |
|---|---|---|---|
| Property-based tests (Hypothesis) | β | High | Medium |
| Thread-safety smoke suite | β | High | Medium |
| Production simulations (5 new) | β | High | Medium |
v0.19.3: Stability Markers Applied β
Focus: Apply @stable and @beta markers to every public symbol in the library, completing the stability annotation work started in v0.19.2.
Stable APIs (60+ symbols)
Core types, providers, agent, memory, tools, evals, guardrails, sessions, knowledge, cache, cancellation, token estimation, analytics, audit β all marked @stable. Breaking changes to these require a major version bump.
Beta APIs (30+ symbols)
Orchestration (AgentGraph, SupervisorAgent), pipelines (Pipeline, @step, parallel, branch), patterns (PlanAndExecuteAgent, ReflectiveAgent, DebateAgent, TeamLeadAgent), and composition (compose) β marked @beta. These may change in a minor release.
Introspection
from selectools import Agent, AgentGraph, PlanAndExecuteAgent
print(Agent.__stability__) # "stable"
print(AgentGraph.__stability__) # "beta"
print(PlanAndExecuteAgent.__stability__) # "beta"
v0.20.0: Visual Agent Builder β
The headline feature: a zero-install web UI for designing, testing, and exporting agent configurations. Served by selectools serve --builder β no separate app, no subscription, no desktop install required.
Why a dedicated release: LangGraph Studio is a paid desktop app. AutoGen Studio is a separate project. selectools ships a full visual builder in one command. This deserves its own announcement.
A web-based UI for designing, testing, and exporting agent configurations. Zero-install β served by selectools serve --builder.
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β Visual Agent Builder [Export] β
βββββββββββββββ¬ββββββββββββββββββββββββββββββββββββββββ€
β β β
β Components β ββββββββββββ ββββββββββββ β
β βββββββββ β β Planner βββββΆβ Writer β β
β β Agent β ββββββββββββ ββββββ¬ββββββ β
β β Tool β β β
β β Router β ββββββΌββββββ β
β β Gate β β Reviewer β β
β β Parallel β ββββββββββββ β
β β β
βββββββββββββββΌββββββββββββββββββββββββββββββββββββββββ€
β Properties β Model: gpt-4o β Tools: 3 β
β βββββββββ β Strategy: plan β Budget: \$0.50 β
β Name: ... β β
β Model: ... β [βΆ Test Run] [πΎ Save YAML] β
βββββββββββββββ΄ββββββββββββββββββββββββββββββββββββββββ
Features:
- Drag-and-drop graph builder for AgentGraph topologies
- Node palette: Agent, Tool, Router (conditional), Gate (HITL), Parallel group
- Visual edge wiring with routing condition editor
- Per-node configuration panel (model, tools, system prompt, budget)
- Live test: run the graph against real providers from the UI
- Export: generates
agent.yamlor Python code - Import: load existing YAML configs into the builder
- Served by selectools:
selectools serve --builder(zero frontend deps) - Built as self-contained HTML/JS (same pattern as playground.py)
Technical approach:
- Single HTML file with embedded JS (no React, no build step)
- Canvas-based graph rendering (or SVG with drag handlers)
- Backend: new
/builderendpoint on AgentServerGET /builderβ serves the HTMLPOST /builder/validateβ validates graph structurePOST /builder/exportβ generates YAML or PythonPOST /builder/runβ executes the designed graph
- State stored in browser localStorage (no server state)
Why this matters:
- LangGraph has LangGraph Studio (paid, desktop app)
- CrewAI has no visual builder
- AutoGen has AutoGen Studio (separate app)
- selectools: zero-install, runs in browser, exports to YAML/Python
| Feature | Status | Impact | Effort |
|---|---|---|---|
| Graph canvas (drag-drop nodes + edges) | β | High | Large |
| Node configuration panel | β | High | Medium |
| YAML export/import | β | High | Small |
| Python code export | β | Medium | Small |
| Live test execution | β | High | Medium |
| Self-contained HTML (no build step) | β | High | Medium |
v0.20.1: Builder Polish + Starlette + GitHub Pages β
UI polish (20 features), _static/ architecture split, Starlette ASGI app, serverless mode (client-side AI/runs), GitHub Pages deployment, design system.
- Visual builder live at: https://selectools.dev/builder/
- Examples gallery: https://selectools.dev/examples/
- 4,612 tests (95% coverage), 76 examples, 50 evaluators, 152 models
| Feature | Status | Impact | Effort |
|---|---|---|---|
| UI polish (20 features) | β | High | Medium |
| _static/ architecture split | β | Medium | Small |
| Starlette ASGI app | β | High | Medium |
| Serverless mode (client-side AI/runs) | β | High | Medium |
| GitHub Pages deployment | β | High | Small |
| Design system | β | Medium | Small |
| Eval badges on builder nodes | β | Medium | Small |
v0.21.0: Connector Expansion + Multimodal + Observability β
Shipped: FAISS + Qdrant + pgvector vector stores, CSV/JSON/HTML/URL document loaders, Azure OpenAI provider, OpenTelemetry + Langfuse observers, multimodal ContentPart + image_message() across OpenAI/Anthropic/Gemini/Ollama, new code/search/github/db toolbox modules (9 tools). 5215 tests (95% coverage), 88 examples, 5 LLM providers, 7 vector stores, 152 models.
Close integration gaps, add multimodal support (images/audio), and ship enterprise-grade observability (OTel + Langfuse). Full spec: .private/07-v0.21.0-connector-expansion.md
Current Inventory
| Category | Count | Items |
|---|---|---|
| Document Loaders | 4 | text, file, directory, PDF |
| Vector Stores | 4 | Memory, SQLite, Chroma, Pinecone |
| Embedding Providers | 4 | OpenAI, Anthropic/Voyage, Gemini, Cohere |
| LLM Providers | 5 | OpenAI, Anthropic, Gemini, Ollama, Fallback |
| Toolbox | 24 tools | file, web, data, datetime, text |
| Rerankers | 2 | Cohere, Jina |
New Document Loaders
Add to src/selectools/rag/loaders.py as new static methods on DocumentLoader. Refactor to loaders/ subpackage with __init__.py re-exporting everything to support SaaS loaders as separate files.
| Loader | Method | Dependencies | Complexity | Why it matters |
|---|---|---|---|---|
| CSV | from_csv(path, content_columns, metadata_columns) | stdlib csv | Small | Most common structured data format |
| JSON/JSONL | from_json(path, text_field) / from_jsonl(...) | stdlib json | Small | Standard for API responses, logs, datasets |
| HTML | from_html(path_or_content, extract_text=True) | beautifulsoup4 (optional) | Small | Web scraping output, saved pages |
| URL | from_url(url, timeout=30) | requests + beautifulsoup4 | Small | Direct URL-to-document (2nd most requested after PDF) |
| Markdown w/ Frontmatter | from_markdown(path) | pyyaml (optional) | Small | Static sites, docs, wikis |
| Google Drive | from_google_drive(file_id, credentials) | google-api-python-client | Medium | Most-used enterprise doc platform |
| Notion | from_notion(page_id, api_key) | requests (existing) | Medium | 2nd most-requested SaaS loader |
| GitHub | from_github(repo, path, branch, token) | requests (existing) | Small | Developer docs and code |
| SQL Database | from_sql(connection_string, query) | sqlalchemy (optional) | Medium | Enterprise data in databases |
New Vector Stores
New files in src/selectools/rag/stores/. Each follows the same pattern as chroma.py: inherit VectorStore, implement add_documents, search, delete, clear, lazy-import the dependency. Register in VectorStore.create() factory.
| Store | File | Dependencies | Complexity | Why it matters |
|---|---|---|---|---|
| FAISS | faiss.py | faiss-cpu | Medium | De facto standard for local high-perf vector search (millions of vectors) |
| Qdrant | qdrant.py | qdrant-client | Medium | Fastest-growing vector DB, excellent filtering, cloud + self-hosted |
| pgvector | pgvector.py | psycopg2-binary | Medium | Use existing PostgreSQL β no new database needed |
| Weaviate | weaviate.py | weaviate-client | Medium | Popular cloud vector DB with GraphQL API |
| Redis Vector | redis.py | redis (existing) | Medium | Leverages existing Redis connection from cache_redis.py |
New Toolbox Modules
New files in src/selectools/toolbox/. Follow @tool decorator pattern, register in get_all_tools() and get_tools_by_category().
| Module | Tools | Dependencies | Complexity | Why it matters |
|---|---|---|---|---|
**code_tools.py | execute_python, execute_shell | stdlib subprocess | Medium | #1 most-used tool in agent frameworks |
**search_tools.py** | google_search, duckduckgo_search | duckduckgo_search (optional) | Small-Medium | #2 most-used tool category |
**github_tools.py** | create_issue, list_issues, create_pr, get_file_contents | requests (existing) | Medium | Developer workflow automation |
**db_tools.py** | query_database, list_tables, describe_table | sqlalchemy (optional) | Medium | Enterprise data access |
Dependency Management
All new dependencies are optional and lazy-imported. Add to pyproject.toml:
[project.optional-dependencies]
rag = [
# existing deps ...
"beautifulsoup4>=4.12.0",
"faiss-cpu>=1.7.0",
"qdrant-client>=1.7.0",
"psycopg2-binary>=2.9.0",
"weaviate-client>=4.0.0",
]
Individual stores/loaders remain installable a la carte: pip install selectools faiss-cpu works without the full [rag] group.
| Feature | Status | Impact | Effort |
|---|---|---|---|
| Multimodal messages | π‘ | High | Medium |
| OTel observer | π‘ | High | Medium |
| Azure OpenAI provider | π‘ | High | Small |
| Langfuse observer | π‘ | High | Small |
| FAISS Vector Store | π‘ | High | Small |
| Qdrant Vector Store | π‘ | Medium | Small |
| pgvector Store | π‘ | High | Small |
| CSV/JSON/HTML/URL Loaders | π‘ | High | Small |
| Code Execution Tools | π‘ | High | Medium |
| Web Search + GitHub Tools | π‘ | High | Small |
| Database Query Tools | π‘ | Medium | Small |
Backlog (Unscheduled β Priority Ordered)
Research basis: Competitive analysis of Agno (39k stars), PraisonAI (6.9k stars), and Superagent (6.5k stars) conducted 2026-04-10.
Strategic thesis: selectools wins on depth (50 evals, 7 vector stores, graph orchestration, pattern agents). Close the breadth gap cheaply, own the "production-ready" narrative, adopt the emerging A2A standard.
Reorganized 2026-06-12 against the v0.26.0 codebase: 11 of the 13 original P0βP2 items have shipped. Shipped items moved to the ledger below; the open remainder is re-prioritized for the post-1.0 release train.
Shipped from this backlog
| Item | Shipped | Where |
|---|---|---|
| Tool-call loop detection (3 detectors, two-tier response) | v0.22.0 | loop_detection.py, docs/modules/LOOP_DETECTION.md |
Agentic memory β remember tool | v0.16.0 | toolbox/memory_tools.py (auto-injected with knowledge_memory) |
| Agent-as-API (production REST, auth, SSE) | v0.24.0 (#68) | serve/api.py |
| LiteLLM provider (100+ models) | v0.24.0 (#74) | providers/litellm_provider.py |
| Cost-optimized model router | v0.24.0 (#75) | providers/router.py |
| A2A protocol (server + client + agent cards) | v0.24.0 (#76) | a2a/ |
| Toolbox expansion to 48 tools (calculator, email, PDF, Slack, Notion, Linear) | v0.24.0 (#77) | toolbox/ |
| Tool result compression | v0.25.0 (#87) | ToolConfig.compress_results, agent/_tool_executor.py |
| Session history search (FTS across 4 backends) | v0.24.0 (#79) | sessions.py SessionStore.search() |
Memory tiering / auto-promotion (UnifiedMemory, standalone) | v0.24.0 (#78) | unified_memory.py |
| Agent-level HITL / approval | v0.25.0 (#88) | ToolConfig.require_approval + approval_handler |
| Planning-as-config | v0.25.0 (#86) | AgentConfig.planning (PlanningConfig) |
Agentic memory β recall tool | v0.27.0 (#109) | toolbox/memory_tools.py make_recall_tool (auto-injected with remember) |
| UnifiedMemory AgentConfig wiring | v0.27.0 (#111) | MemoryConfig(unified=True, ...), agent/core.py |
| Toolbox: Discord, S3, browser, image-gen (48 β 56 tools) | v0.27.0 (#110) | toolbox/{discord,s3,browser,image}_tools.py |
| Cache-rate cost support (OpenAI + Gemini) | v0.27.0 (#112) | pricing.calculate_cost_with_cached_input, cached_prompt_cost |
| Cron / scheduled agents | v0.27.0 (#113) | scheduler.py (AgentScheduler, cron, every) |
| Reasoning-as-tool | v0.27.0 (#114) | toolbox/reasoning_tools.py (make_reasoning_tools, ReasoningTools) |
| Episodic memory retention config | v0.27.0 (#111) | Delivered with the UnifiedMemory wiring β MemoryConfig.episodic_retention_days flows through and add_turn auto-prunes (tested: test_retention_pruning) |
Shipped 2026-06-13: the four v1.1 candidates (#109-#112) plus the top two
Future/Watch items (cron #113, reasoning tools #114), folded into the v1.0
train. Episodic retention config (#111) needed no separate work β the
UnifiedMemory wiring already exposed and auto-applied it. gemini-embedding-2
decision: GA/recommended-for-new; default stays gemini-embedding-001
(incompatible embedding space). See CHANGELOG.md [Unreleased].
Active Queue β needs a product decision before building
The autonomously-buildable, high-conviction items are now shipped. Each remaining item carries a scoping/product call (flagged), so these wait on John's direction rather than getting built blind.
| Item | Source | Decision needed | Effort |
|---|---|---|---|
| More DB backends β MongoDB (#116) + DynamoDB shipped; Firestore on demand | Agno | MongoDB + DynamoDB session stores shipped 2026-06-13. Firestore next only if there's demand (needs google-cloud-firestore). | Medium each |
| ML-based guard models | Superagent | Heuristic tier shipped: PromptInjectionGuardrail (2026-06-13) covers templated attacks with no model hosting. The model-based tier still needs the hosting decision: bundle a 0.6-4B model, optional extra, or external endpoint? | High |
| Multi-channel bot gateway | PraisonAI | Roadmap itself says "better as a separate package." In-repo module vs new package = a product/packaging call. | High |
| Learning system | Agno | Scope is vague (decision logging + preference tracking). Needs a concrete spec before it's buildable. | High |
| Shadow git checkpoints | PraisonAI | Only relevant if selectools moves toward coding-agent use cases β a direction call. | Medium |
Previously Planned (Retained)
| Feature | Notes | Target |
|---|---|---|
| AWS Bedrock provider | Covered today via LiteLLM; native boto3 wrapper only if enterprise demand | Future |
| Durable execution / webhooks | Task queue, resume from checkpoint | Future |
| Code execution sandbox (Docker/E2B) | Sandboxed code execution for untrusted input | Future |
| Prompt registry / versioning | Version, A/B test, rollback prompts | Future |
| Time-travel debugging / state replay | Rewind, edit, replay from any checkpoint | v1.x |
| Voice / real-time audio agents | WebRTC, STT/TTS, sub-500ms latency | v1.x |
| Rate limiting & quotas | Per-tool and per-user quotas | Future |
| CRM & business tools | HubSpot, Salesforce integrations | Future |
| Niche loaders | Confluence, Jira, Discord, Docx | Future |
| Niche vector stores | Weaviate, Redis Vector, Milvus, OpenSearch, Lance | Future |
Previously Planned (Retained)
| Feature | Notes | Target |
|---|---|---|
| AWS Bedrock provider | Covered today via LiteLLM; native boto3 wrapper only if enterprise demand | Future |
| Durable execution / webhooks | Task queue, resume from checkpoint | Future |
| Code execution sandbox (Docker/E2B) | Sandboxed code execution for untrusted input | Future |
| Prompt registry / versioning | Version, A/B test, rollback prompts | Future |
| Time-travel debugging / state replay | Rewind, edit, replay from any checkpoint | v1.x |
| Voice / real-time audio agents | WebRTC, STT/TTS, sub-500ms latency | v1.x |
| Rate limiting & quotas | Per-tool and per-user quotas | Future |
| CRM & business tools | HubSpot, Salesforce integrations | Future |
| Niche loaders | Confluence, Jira, Discord, Docx | Future |
| Niche vector stores | Weaviate, Redis Vector, Milvus, OpenSearch, Lance | Future |