Qualixar OS

April 17, 2026 · View on GitHub

License: FSL-1.1 Tests: 2,936 passing TypeScript Node.js 22+ arXiv DOI

Qualixar OS

The Universal OS for AI Agents

One platform. Every model. Every framework. Every transport.

Qualixar OS is the operating system that runs AI agents — yours and everyone else's. It doesn't replace your agent framework. It powers it. Import agents from OpenClaw, NemoClaw, DeerFlow, CrewAI, LangGraph, or build native. Run them all through one dashboard, one config, one runtime.

What Makes This Unique

To our knowledge, no other agent platform combines all of these:

CapabilityWhy It Matters
Forge AI auto-designs agent teamsDescribe what you need in one sentence. Forge picks the agents, tools, topology, and budget. No manual team assembly.
Judge pipeline with consensusAdversarial quality assurance built on AgentAssert research (arXiv:2602.22302). Multi-judge consensus, few-shot calibration, reliability scoring.
13 topologies including HybridPII-safe local/cloud split. Your sensitive data stays on your machine while cloud handles the rest.
Native A2A protocolAgents talk to agents — internally and externally. Agent card discovery at /.well-known/agent-card. Among the first agent runtimes to support both MCP and A2A natively.
25 CLI commandsFull native CLI, not just a wrapper. Task execution, marketplace, diagnostics, config, server management — all from your terminal.
SLM-Lite memoryLearns from every task. 4-layer memory store powered by SuperLocalMemory — backed by 3 peer-reviewed papers (arXiv:2604.04514, arXiv:2603.14588, arXiv:2603.02240).
Real file outputAgents create runnable code on disk, not just text in a chat window. Universal Type-C protocol writes real artifacts.
Marketplace with one-click installBrowse, search, install tools. Forge auto-discovers new tools the moment you install them.
Universal compatibilityImport agents from OpenClaw, NemoClaw, DeerFlow, and GitAgent natively. LangChain, CrewAI, and AutoGen integrate via the HTTP API.

Demo

Qualixar OS Demo

Watch the full dashboard demo (1:48) — 24 tabs, task execution, topology selection, cost tracking, Forge AI, marketplace, and more.

Install in 30 Seconds

npx qualixar-os

That's it. Dashboard opens. Start chatting with AI, or design a multi-agent team.

# Or install globally
npm install -g qualixar-os

# Start the server with dashboard
qos serve --dashboard --port 3000

# Open your browser
open http://localhost:3000/dashboard/

Why an OS?

Agent frameworks give you building blocks. Qualixar OS gives you the complete runtime:

What You NeedFramework ApproachQualixar OS
Route to the right modelWrite custom logicBuilt-in (15 providers, cost/quality/latency routing)
Run multi-agent teamsBuild from scratch13 topologies, auto-designed by Forge AI
Quality assuranceHope for the bestJudge pipeline with consensus protocols (built on AgentAssert research)
Persistent memoryAdd a vector DB4-layer memory store powered by SuperLocalMemory (local, 3 papers)
Monitor costsCheck your bill laterReal-time budget tracking per agent
Manage toolsHardcode per agentMarketplace with categories, one-click install
DashboardBuild your own24 tabs, production-ready
SecurityDIYHardened sandbox (51 denied commands), RBAC, credential vault, Docker isolation
Human Review GateBuild your ownDashboard workflow for human approval of agent output, integrated with the 4-tier degradation engine
Token-level streamingVariesReal-time AsyncIterable streaming for all 6 LLM providers (Anthropic, OpenAI, Google, Ollama, Azure OpenAI, Bedrock)
File executionText onlyReal files on disk — agents create runnable code via Universal Type-C

The Soul — How Qualixar OS Works

Every task follows this pipeline. No shortcuts. No half-measures.

                          ┌──────────────────────────────────┐
                          │                                  │
  User Prompt ──→ Memory ──→ Forge (designs agent team)     │
                              │                              │
                              ▼                              │
                          Agents Execute                     │
                          (real files on disk)                │
                              │                              │
                              ▼                              │
                          Judge Evaluates                    │
                              │                              │
                    ┌─────────┴─────────┐                    │
                    │                   │                    │
                Approved            Rejected                 │
                    │                   │                    │
                    ▼                   ▼                    │
                Output           Forge Redesigns ────────────┘
                (done)           (new team, retry)
                                 Up to 5 rounds
  1. Memory — Recalls relevant context from previous tasks
  2. Forge — AI meta-agent designs the optimal team (roles, models, tools, topology)
  3. Agents — Execute the task, creating real files in the workspace directory
  4. Judge — Adversarial evaluation with consensus protocols
  5. If rejected — Forge redesigns the team and agents retry with the judge's feedback
  6. If approved — Output saved, task complete

The judge doesn't just say pass/fail. It provides structured feedback — specific issues, severity ratings, improvement suggestions. The next team gets this feedback in their prompt. Each redesign cycle gets smarter.

Safeguards: 5 redesign max, 3x budget cap, human escalation if stuck.

Universal Compatibility — The Claw Bridge

Qualixar OS doesn't compete with agent frameworks. It runs them.

# Import agents from any framework
qos import ./my-openclaw-agent.yaml       # OpenClaw
qos import ./nemoclaw-config.json          # NemoClaw (NVIDIA)
qos import ./deerflow-workflow.json         # DeerFlow (ByteDance)
qos import ./gitagent-spec.yaml             # GitAgent (Microsoft)

# Or use native Qualixar agents — designed by Forge AI
qos run "Build me a code review pipeline"
# → Forge auto-designs a team of 4 agents with the right tools
FrameworkImportRunManage from Dashboard
OpenClawYesYesYes
NemoClaw (NVIDIA)YesYesYes
DeerFlow (ByteDance)YesYesYes
GitAgent (Microsoft)YesYesYes
LangChain / LangGraphVia HTTP API (guide)YesYes
CrewAIVia HTTP API (guide)YesYes
AutoGenVia HTTP API (guide)YesYes
MCP ServersNativeYesYes
A2A ProtocolNativeYesYes
Custom Agentsskill.json manifestYesYes

Protocols — MCP + A2A

Qualixar OS speaks two standard protocols natively. No plugins, no adapters — built into the core.

MCP (Model Context Protocol)

25 MCP tools expose the full Qualixar OS API to any MCP-compatible client. Your IDE becomes the dashboard.

ClientIntegration
Claude CodeAuto-discovered via ~/.claude.json config
VS CodeMCP extension with tool palette
CursorNative MCP support
WindsurfNative MCP support
# Add Qualixar OS as an MCP server in your IDE
qos serve --mcp
# → 25 tools available: task execution, agent management, marketplace, memory, cost tracking

A2A (Agent-to-Agent Protocol)

v0.3 implementation with agent card discovery. Internal agents communicate via A2A. External agents federate via A2A. Among the first agent runtimes to support both MCP and A2A natively.

GET /.well-known/agent-card
→ Returns capabilities, supported tasks, authentication methods

POST /a2a/tasks
→ External agents submit tasks directly to Qualixar OS agents

What this enables: Your Qualixar OS agents can discover and collaborate with any A2A-compatible agent on the network — Google's agent ecosystem, other Qualixar OS instances, or custom A2A servers.

Memory System — SLM-Lite

Powered by SuperLocalMemory Lite — the same cognitive memory engine backed by 3 peer-reviewed papers (V3.3, V3, V2). Every task makes Qualixar OS smarter.

LayerWhat It Does
EpisodicStores task history — what was asked, what worked, what failed
SemanticStores knowledge from task outputs with full-text search (FTS5)
ProceduralLearns patterns — which topologies work best for which task types
BehavioralCaptures outcomes — Forge uses this to design better teams over time

Auto-invoke retrieves relevant context before every task. No manual search needed — agents start with the right context.

Compatible with full SuperLocalMemory for cross-session, cross-product memory sharing.

The Dashboard — Everything in Your Browser

24 interactive tabs. No CLI required for daily use.

TabWhat It Does
ChatTalk to AI with streaming, file upload, model selection
ForgeAI auto-designs specialized agent teams from your prompt
BuilderVisual workflow editor — drag, connect, run
MarketplaceBrowse, search, install skills and tools with one click
AgentsMonitor running agents, view stats, kill/restart
SwarmsWatch multi-agent execution in real-time
JudgesQuality verdicts, consensus scores, approval/rejection
CostPer-model, per-agent, per-task cost breakdown
MemoryPersistent agent memory — search, inspect, manage
PipelinesMulti-step task orchestration with checkpoints
ToolsTool palette with 6 categories, drag onto agents
TracesOpenTelemetry spans for every agent action
SettingsProviders, models, budget, security, connectors
...and 11 moreFlows, Connectors, Logs, Gate, Datasets, Vectors, Blueprints, Brain, Audit, Lab, Marketplace

Forge — AI Designs Your Agent Teams

Tell Forge what you need. It designs the team.

qos run "Review my GitHub PRs for security issues"

Forge responds:

  • Creates 3 agents: Security Analyst, Code Reviewer, Report Writer
  • Assigns tools: github_pr_read, code_search, file_write
  • Picks topology: Sequential (analyze → review → report)
  • Sets budget: $0.05 estimated cost
  • Runs the team. Returns results.

No manual configuration. No YAML files. One sentence, full team.

Marketplace — One-Click Tool Installation

Browse the global skill registry from your dashboard. Install with one click. Forge auto-discovers new tools.

Dashboard → Marketplace → Search "GitHub" → Install → Done
Your agents now have GitHub tools. Forge uses them automatically.

6 Tool Categories:

CategoryTools
Web & DataWeb search, crawl, scrape, RSS, API connectors
Code & DevGitHub, file I/O, shell, linter, test runner
CommunicationSlack, email, Discord, webhook
KnowledgeVector search, document reader, database, RAG
CreativeImage gen, video gen, TTS, diagrams
EnterpriseCRM, project management, analytics, cloud

18 built-in plugins (agents, skills, tools, and topologies) ship with the product. Community plugins available via the global registry.

Self-Evolving Skills (Roadmap)

Planned capabilities for a future release. Not yet implemented.

CapabilityWhat It Will Do
SkillEvolverJudge verdict triggers automatic skill mutation — failed skills evolve into better versions
Blind VerificationInformation-isolated verification — generator and verifier can't see each other's work, eliminating confirmation bias
Skill LineageFull version history — trace how any skill evolved, what triggered each mutation, rollback to any ancestor
Living MarketplaceEvolved skills auto-publish to marketplace with lineage metadata. Community skills improve across all users

13 Execution Topologies

TopologyPatternUse Case
SequentialA → B → CPipelines, step-by-step tasks
ParallelA ‖ B ‖ C → mergeIndependent analysis, speed
HierarchicalBoss → WorkersTask decomposition
DAGDirected graphComplex dependencies
DebatePro vs Con → JudgeQuality decisions
MeshAll-to-allCollaboration
StarHub ↔ SpokesCentral coordinator
GridMatrix communicationStructured teams
ForestMultiple treesParallel hierarchies
CircularRound-robinIterative refinement
Mixture-of-AgentsEnsemble → SynthesizeBest-of-N outputs
MakerBuild → Test → ShipEngineering workflows
HybridLocal ↔ Cloud splitPII-safe cloud offloading

Claude Management Integration

Qualixar OS ships with first-class support for Claude's agent ecosystem — managed agents, agent teams, and quality gate hooks.

Hybrid Topology (13th topology)

Splits agent execution between local and cloud. A 7-phase algorithm with keyword + regex PII detection routes sensitive work to local models and offloads the rest to cloud providers. To our knowledge, among the first agent runtimes with built-in PII-aware local/cloud routing.

qos run "Analyze customer data" --topology hybrid
# → PII detected in 2 fields → routed to local Ollama
# → Summary + report → routed to Claude cloud

Claude Managed Agents Adapter

Python and TypeScript adapter for Claude's Managed Agents API. SSE streaming, dual cost tracking (local + cloud), drop-in provider integration.

providers:
  claude-managed:
    type: claude-managed
    api_key_env: ANTHROPIC_API_KEY

Claude Agent Teams Subagents

6 pre-built agent definitions optimized for Claude Code Agent Teams:

AgentModelRole
qos-forge-architectopusTeam design and orchestration
qos-code-reviewersonnetMulti-judge code review
qos-cost-optimizerhaikuBudget-aware model routing
qos-topology-designersonnetTopology selection for tasks
qos-research-agentopusDeep research with full arsenal
qos-quality-judgeopusQuality gating and consensus

Quality Gate Hooks

3 hooks for Claude Code Agent Teams that enforce quality at every stage:

  • TeammateIdle — detects idle teammates and reassigns or escalates
  • TaskCreated — validates task structure, budget, and topology before execution
  • TaskCompleted — runs judge pipeline on output before marking done

Agent Teams Skill

Interactive design guide for assembling optimal agent teams. Invoke with:

/agent-teams

Walks through team composition, model selection, topology, and budget — then generates the full configuration.

Configuration

Zero-config start: Qualixar OS works out of the box with Ollama (free, local, private).

# 1. Install Ollama (if you haven't already)
brew install ollama && ollama pull llama3.1

# 2. Start Qualixar OS — it auto-detects Ollama
npx qualixar-os

That's it. No API keys, no YAML editing, no cloud accounts. Your data stays on your machine.

Add a cloud provider when you want more power:

  1. Open the Dashboard → Settings tab → Providers
  2. Click Add Provider → pick OpenRouter, Anthropic, OpenAI, Azure, or any of the 15 supported providers
  3. Paste your API key → Save

All provider configuration happens in the Settings UI. No config files to edit.

15 providers supported: Ollama, OpenRouter, Anthropic, OpenAI, Azure OpenAI, Google AI, AWS Bedrock, Groq, Together AI, Fireworks AI, Mistral AI, Cohere, DeepSeek, Cerebras, and custom OpenAI-compatible endpoints.

CLI — 25 Commands

Full native CLI. Every feature accessible from your terminal.

CategoryCommands
Serverqos serve, qos dashboard, qos mcp
Tasksqos run <prompt>, qos status <id>, qos output <id>, qos pause <id>, qos resume <id>, qos cancel <id>
Designqos forge [taskType], qos models
Agentsqos agents [taskId], qos judges [taskId]
Import/Exportqos import <path>, qos export <agentId>
Configqos config [key] [value]
Memoryqos memory <query>
Diagnosticsqos cost [taskId], qos doctor, qos version
Setupqos init, qos new <project>
Universal Commandsqos cmd <command>, qos cmd-list, qos dispatch <command>
# Quick examples
qos serve --dashboard --port 3000        # Start server + dashboard
qos run "Build a REST API for todos"    # Forge designs team, agents execute
qos dashboard                            # Open the web dashboard
qos cmd marketplace.install -i '{"id":"github-tools"}'  # Install a tool
qos cost                                 # See what you've spent

Full reference: docs/cli/overview.md

API

27+ REST endpoints. Full OpenAPI documentation at /api/docs.

MethodPathDescription
POST/api/tasksSubmit a task (Forge auto-designs team)
GET/api/tasksList tasks with status
GET/api/agentsActive agents
GET/api/costCost breakdown
GET/api/toolsTool catalog with categories
GET/api/skill-store/browseBrowse marketplace
POST/api/chat/conversations/:id/messagesChat (streams via WebSocket)
GET/.well-known/agent-cardA2A discovery

Tech Stack

LayerTechnology
RuntimeNode.js 22+ (ESM)
LanguageTypeScript 5.7
HTTPHono
Databasebetter-sqlite3 (49 tables)
DashboardReact 19 + Vite
AI SDKsAnthropic (incl. Managed Agents), OpenAI, Azure, Ollama
TestingVitest (2,936 tests, 213 files)
ProtocolsMCP, A2A, HTTP, WebSocket, CLI

Development

git clone https://github.com/qualixar/qualixar-os.git
cd qualixar-os
npm install
npm run build
npm test              # 2,936 tests
npm run typecheck     # 0 errors

Research Ecosystem — 7 Peer-Reviewed Papers

Qualixar OS is not a standalone project. Every major component is backed by peer-reviewed research. To our knowledge, no other agent runtime ships with this level of academic rigor across its subsystems.

How the Research Maps to the Product

QOS ComponentPowered ByResearch
Judge PipelineAgentAssert — contract-based reliability testing, LLM-as-Judge evaluation, SPRT statistical certification, reliability index ThetaarXiv:2602.22302
SLM-Lite MemorySuperLocalMemory — 4-layer cognitive memory with information-geometric retrieval and privacy-preserving local storagearXiv:2604.04514, arXiv:2603.14588, arXiv:2603.02240
Quality EvaluationAgentAssay — standardized evaluation adapters for 10+ agent frameworksarXiv:2603.02601
Skill VerificationSkillFortify — 22 testing frameworks for agent skills, 100% precisionarXiv:2603.00195
Forge AI + TopologiesQualixar OS native — POMDP-based team design, 13 formally specified topologiesarXiv:2604.06392

All 7 Papers

#PaperFocusarXiv
1Qualixar OS — A Protocol-Unified Operating System for AI Agent OrchestrationTopology semantics, Forge AI, agent runtime architecture2604.06392
2SuperLocalMemory V3.3 — The Living BrainUnified memory + learning, pattern extraction, behavioral assertions2604.04514
3SuperLocalMemory V3 — Information-Geometric MemoryFisher-Rao distance, multi-channel retrieval, graph-enhanced recall2603.14588
4SuperLocalMemory V2 — Privacy-Preserving Local MemoryLocal-first cognitive memory, SQLite, semantic indexing2603.02240
5AgentAssert — Contract-Based Reliability Testing for AI AgentsContractSpec DSL, drift detection, SPRT certification, LLM-as-Judge2602.22302
6AgentAssay — Standardized Agent Evaluation10 framework adapters, unified evaluation protocol2603.02601
7SkillFortify — Testing Frameworks for Agent Skills22 test frameworks, mutation testing, 100% precision2603.00195

Cite

If you use Qualixar OS in your research, please cite:

@article{bhardwaj2026qualixaros,
  title={Qualixar OS: A Protocol-Unified Operating System for AI Agent Orchestration},
  author={Bhardwaj, Varun Pratap},
  year={2026},
  eprint={2604.06392},
  archivePrefix={arXiv},
  primaryClass={cs.AI},
  doi={10.5281/zenodo.19454219}
}

Documentation

TopicPath
Getting Starteddocs/getting-started.md
Dashboard (24 tabs)docs/dashboard/
CLI Reference (25 commands)docs/cli/
Protocols (MCP + A2A)docs/protocols/
Providers (15 providers)docs/providers/
Frameworks (4 native readers + adapters)docs/frameworks/
Claude CLI Integrationdocs/claude-cli/
IDE Integrationdocs/ide-integration/
Memory Systemdocs/memory/
Guidesdocs/guides/
API Referencedocs/reference/

License

Functional Source License, Version 1.1, ALv2 Future License (FSL-1.1-ALv2). See LICENSE.

This means you can use, modify, and redistribute Qualixar OS for any purpose except creating a competing product or service. After two years from each release, the code converts to Apache License 2.0.

For commercial licensing, see COMMERCIAL-LICENSE.md or contact varun.pratap.bhardwaj@gmail.com.

Copyright (c) 2026 Varun Pratap Bhardwaj / Qualixar.

Built By

Qualixar — AI Agent Reliability Engineering.

7 published papers. 6 live products. One mission: make AI agents trustworthy.

ProductWhat It DoesLinks
Qualixar OSAgent operating system — this repoGitHub · npm
SuperLocalMemoryCognitive memory for AI agentsGitHub · npm · superlocalmemory.com
AgentAssertContract-based reliability testingGitHub · PyPI · agentassert.com
AgentAssayStandardized agent evaluationGitHub · PyPI
SkillFortifyAgent skill testing frameworksGitHub · PyPI
SLM MeshP2P agent communicationGitHub · npm

⭐ Support This Project

If this project solves a real problem for you, please star the repo — it helps other developers discover Qualixar and signals that the AI agent reliability community is growing. Every star matters.

Star History Chart


Part of the Qualixar AI Agent Reliability Platform

Qualixar is building the open-source infrastructure for AI agent reliability engineering. Seven products, seven peer-reviewed papers, one coherent platform. Each tool solves one reliability pillar:

ProductPurposeInstallPaper
SuperLocalMemoryPersistent memory + learning for AI agentsnpx superlocalmemoryarXiv:2604.04514
Qualixar OSUniversal agent runtime (13 execution topologies)npx qualixar-osarXiv:2604.06392
SLM MeshP2P coordination across AI agent sessionsnpm i slm-mesh
SLM MCP HubFederate 430+ MCP tools through one gatewaypip install slm-mcp-hub
AgentAssayToken-efficient AI agent testingpip install agentassayarXiv:2603.02601
AgentAssertBehavioral contracts + drift detectionpip install agentassert-abcarXiv:2602.22302
SkillFortifyFormal verification for AI agent skillspip install skillfortifyarXiv:2603.00195

Zero cloud dependency. Local-first. EU AI Act compliant.

Start here → qualixar.com · All papers on Qualixar HuggingFace