README.md
June 11, 2026 Β· View on GitHub
Cognitive-Physical Decoupling β A Session-Centered Runtime for Embodied Intelligence
π’ Changelog
| Version | Date | Update |
|---|---|---|
| 2026-06-11 | Cleaned protocol files and docs; game scenario separated to general-game-agent branch; main branch now focused on sim & real | |
| 2026-06-5 | Optimize the user-friendly onboarding process; Communication Protocol Specification; More reasonable coding standards; Game Agent & Benchmarking ready | |
| 2026-05-25 | Strict separation of PolicySkillRuntime / BuiltinSkillRuntime; Game Agent & Benchmarking ready | |
| 2026-05-20 | Perception plugin system: SensorConfig / PerceptionConfig YAML + EnvironmentWriter auditable writeback | |
| 2026-05-18 | Session-Centered Runtime MVP: DummySimTarget + DummyAdapter + DummyClient serial pipeline | |
| 2026-04-29 | Hackathon baseline: plugin-based HAL, ReKep / SAM3 real-robot grasping & VLN full pipeline |
π€ Why PhyAgentOS?
Traditional "LLM-direct-to-hardware" approaches tightly couple reasoning to execution β switching robots means rewriting the entire pipeline. PhyAgentOS changes this through Cognitive-Physical Decoupling + Session-Centered Runtime:
| π | One Codebase, Any Hardware β Adding a new robot means implementing one Target Adapter (~100 lines); zero changes to the scheduling layer. |
| π‘οΈ | Three Safety Layers β Critic validation β Strict Preflight β Target-side SafetyGuard; mandatory for real-robot deployment. |
| π | Fully Auditable β State, actions, and perception results are written to Markdown + YAML files; every step is traceable and reproducible. |
| π | Zero-Friction Migration β The same Session protocol runs identically across sim and real targets. |
β² Session-Centered Runtime Architecture Overview
β¨ Core Features
| π | Session-Centered Runtime | WatchdogSupervisor β SessionRunner β SkillRuntime β TargetSessionHandle execution pipeline, replacing the legacy Driver-Center architecture |
| π― | Target-Configured | Three target kinds β debug / simulation / real_robot β registered in TARGETS.md, adapters attached on demand |
| π§© | Adapter + Bridge | TargetAdapter + PolicyAdapter + ActionBridge three-way decoupling with explicit observation/action contracts; AdapterPlan auto-composed, eliminating targetΓskill combinatorial explosion |
| β‘ | Dual Skill Runtimes | PolicySkillRuntime maintains policy closed-loop + BuiltinSkillRuntime manages agent interactive loop |
| π‘οΈ | Strict Preflight | Runtime validation checks (target / sensor / perception / adapter contract / action contract / tool); failures are rejected before execution starts |
| π | File Protocol Matrix | TARGETS.md Β· SKILLRUNTIME.md Β· SESSIONS.md Β· ENVIRONMENT.md Β· LESSONS.md + external YAML configs |
| π | Multi-Layer Safety | Critic validation β Preflight contract checks β Target-side SafetyGuard β Operator Override |
| π | Fleet Mode | Multi-robot coordination with shared + per-robot workspaces, priority-based serial scheduling |
π 5-Minute Quick Start
| 1 |
Install
|
| 2 |
Initialize Workspace
|
| 3 |
Start Agent
|
| 4 |
Optional: Connect Runtime Services
|
paos agent and paos gateway create the runtime workspace and start the
session watchdog automatically when runtime is enabled in config. Runtime
targets are declared in TARGETS.md, executable runtimes in SKILLRUNTIME.md,
and the Agent queues work by appending sessions to SESSIONS.md.
paos agent -m "run the configured LIBERO benchmark task"
ποΈ Protocol Files
| Context Loading | File | Owner | Purpose |
|---|---|---|---|
| Always loaded into the agent system prompt | AGENTS.md | Agent workspace | Project-level operating rules for the agent |
| Always loaded into the agent system prompt | SOUL.md | Agent workspace | Identity, high-level behavior, and assistant style |
| Always loaded into the agent system prompt | USER.md | Agent workspace | User preferences and durable profile notes |
| Always loaded into the agent system prompt | TOOLS.md | Agent workspace | Tool usage policy and available tool guidance |
| Always loaded into the agent system prompt | SKILLS.md | Agent workspace | Agent-facing skill discovery and loading rules |
| Loaded when present; filtered by enabled runtime targets where applicable | EMBODIED.md | Agent workspace | Human-readable target capability descriptions |
| Loaded when present as state, not bootstrap policy | ENVIRONMENT.md | Agent/runtime workspace | Current target and scene/environment state |
| Loaded when present as memory/state | LESSONS.md | Agent workspace | Operational lessons and failure notes |
| Loaded when present as task state | TASK.md | Agent workspace | Multi-step task decomposition and progress |
| Runtime protocol; read before scheduling sessions | RUNTIME.md | Runtime workspace | Instructions for writing valid runtime sessions |
| Runtime protocol; read before scheduling sessions | TARGETS.md | Runtime workspace | Enabled targets, endpoint/adapter/config references, supported skill runtimes |
| Runtime protocol; read before scheduling sessions | SKILLRUNTIME.md | Runtime workspace | Policy/builtin skill runtime registry and execution contracts |
| Runtime queue/state; written by Agent and watchdog | SESSIONS.md | Runtime workspace | Pending/running/completed execution sessions and results |
SKILLS.md is for agent capabilities and skill discovery. SKILLRUNTIME.md is
for runtime execution contracts; it is paired with TARGETS.md and SESSIONS.md.
π¦ Project Structure
PhyAgentOS/
β
βββ PhyAgentOS/agent/ # Track A β Planner / Critic / Memory
β
βββ PhyAgentOS/runtime/ # Track B β Execution Plane
β βββ watchdog/ # WatchdogSupervisor
β βββ sessions/ # SessionRunner / TargetSessionHandle
β βββ targets/ # RolloutTarget (debugΒ·simΒ·real)
β β βββ remote/libero/ # LIBERO benchmark TargetWS server + proxy
β βββ skillruntime/ # PolicySkillRuntime / BuiltinSkillRuntime
β βββ adapters/ # TargetAdapter / PolicyAdapter / Bridge
β β βββ libero/ # LIBERO target adapter
β β βββ openpi/ # OpenPI policy adapters
β βββ policy/openpi/ # OpenPI client + LeRobot pi0-family server
β βββ perception/ # Perception Runtime / EnvironmentWriter
β βββ preflight/ # RuntimeCompatibilityPreflight
β βββ schemas/ # Pydantic Schema
β
βββ configs/runtime/ # Sensor / Perception / Contract YAML
βββ scripts/ # Utility scripts
βββ workspace/ # Agent workspace; runtime files may share it by config
βββ docs/ # Documentation
βββ tests/ # Tests
π·οΈ Supported Targets
| Kind | Location | Examples | |
|---|---|---|---|
| π | debug | Local | echo / mock / dry-run β zero-hardware protocol pipeline validation |
| π§ͺ | simulation | Remote | RoboCasa, LIBERO β benchmark evaluation & batch experience mining |
| π€ | real_robot | Remote | Franka, Go2, XLeRobot, AgileX PIPER β real-world deployment |
All targets are registered in
TARGETS.md, identified bytarget_adapter://URI. More examples & demos β Project Website
π Documentation
| Document | Audience | Description |
|---|---|---|
| π Website | Everyone | Full docs, architecture details, demos |
| π User Manual | Users | Installation, deployment, and operation guide |
| π Dev Guide | Developers | Secondary development, hardware integration, plugin authoring |
π€ Contributing
PRs and Issues are welcome! Check our development roadmap here β Dev Plan.
Built on nanobot
Jointly developed by Sun Yat-sen University HCP Lab & Peng Cheng Laboratory
Β Β Β
Β Β Β
MIT License Β· Copyright Β© 2025-2026 PhyAgentOS