๐Ÿœ Phero

May 11, 2026 ยท View on GitHub

Phero

๐Ÿœ Phero

The chemical language of AI agents.

Phero is a modern Go framework for building multi-agent AI systems. Like ants in a colony, agents in Phero cooperate, communicate, and coordinate toward shared goals, each with specialized roles, working together through a clean, composable architecture.

Build Status GoDoc Go Report Card GitHub release

Why Phero?

  • ๐Ÿค Agent orchestration Multi-agent workflows with role specialization, coordination, and agent handoffs
  • ๐Ÿงฉ Composable primitives Small, focused packages that solve specific problems
  • ๐Ÿ”ง Tool-first design Built-in support for function tools, skills, RAG, and MCP
  • ๐ŸŽจ Developer-friendly Clean APIs, opt-in tracing, OpenAI-compatible + Anthropic support
  • ๐Ÿชถ Lightweight No heavy dependencies; just Go and your choice of LLM provider

Features

Core Capabilities

  • ๐Ÿค Agent orchestration Multi-agent workflows with role specialization, coordination, and runtime handoffs
  • ๐Ÿ”€ Agent handoffs Transfer control between agents at runtime; Result.HandoffAgent tells you where to route next
  • ๐ŸŒ A2A protocol Expose any agent as an HTTP A2A server, or call remote A2A agents as local tools
  • ๐Ÿ”€ NATS Agent Protocol Register agents as NATS micro services and discover/call them over pub/sub; wire-compatible with TypeScript and Python SDKs
  • ๐Ÿงฉ LLM abstraction Work with OpenAI-compatible endpoints (OpenAI, Ollama, etc.) and Anthropic
  • ๐Ÿ–ผ๏ธ Multimodal input Mix text and images with typed content parts (llm.Text, llm.ImageURL, llm.ImageFile)
  • ๐Ÿ”Š Audio I/O OpenAI backend supports speech-to-text and text-to-speech via llm.Transcriber and llm.SpeechSynthesizer
  • ๐Ÿงฑ LLM middleware Compose reusable cross-cutting behaviors around any backend with llm.Use(...)
  • ๐Ÿ› ๏ธ Function tools Expose Go functions as callable tools with automatic JSON Schema generation
  • ๐Ÿ“š RAG (Retrieval-Augmented Generation) Built-in vector storage and semantic search
  • ๐Ÿง  Skills system Define reusable agent capabilities in SKILL.md files
  • ๐Ÿ”Œ MCP support Integrate Model Context Protocol servers as agent tools
  • ๐Ÿงพ Memory management Conversational context storage for agents
  • ๐Ÿ” Tracing Typed lifecycle events with a colorized text tracer (trace/text) and an NDJSON file tracer (trace/jsonfile); per-run summary with token usage and latency breakdowns
  • ๐Ÿ›ก๏ธ Tool guardrails Bash tool blocklist, allowlist, timeout, and safe-mode options
  • โœ‚๏ธ Text splitting Recursive and Markdown-aware chunkers under textsplitter/recursive and textsplitter/markdown
  • ๐Ÿงฌ Embeddings Semantic search capabilities via OpenAI embeddings
  • ๐Ÿ—„๏ธ Vector stores Qdrant, PostgreSQL/pgvector, and Weaviate backends

Requirements

  • Go 1.25.5 or later
  • An LLM provider (OpenAI / Ollama / OpenAI-compatible endpoint, or Anthropic)

Quick Start

Start with the Simple Agent example to learn the basics in ~100 lines of code.

Then try:

Then explore the examples/ directory for more advanced patterns:

  • Multi-agent workflows
  • Multimodal and audio pipelines
  • RAG chatbots
  • Skills integration
  • MCP server connections

Some examples require extra services (e.g. Qdrant for vector search).

Architecture

Phero is organized into focused packages, each solving a specific problem:

๐Ÿค– Agent Layer

  • agent Core orchestration for LLM-based agents with tool execution, chat loops, and runtime handoffs
  • memory Conversational context management for multi-turn interactions (in-process, file-backed, RAG-backed, PostgreSQL-backed, or NATS JetStream KV-backed)

๐Ÿ’ฌ LLM Layer

  • llm Provider-agnostic chat interface with typed messages/content parts, function tools, JSON Schema utilities, audio interfaces, and LLM middleware composition
  • llm/openai OpenAI-compatible client (works with OpenAI, Ollama, and compatible endpoints)
  • llm/anthropic Anthropic API client

๐Ÿง  Knowledge Layer

  • embedding Embedding interface for semantic operations
  • embedding/openai OpenAI embeddings implementation
  • vectorstore Vector storage interface for similarity search
  • vectorstore/qdrant Qdrant vector database integration
  • vectorstore/psql PostgreSQL + pgvector integration
  • vectorstore/weaviate Weaviate vector database integration
  • textsplitter Text splitting interface and shared types
  • textsplitter/recursive Recursive character-based chunker
  • textsplitter/markdown Markdown-aware chunker (heading-first separators)
  • rag Complete RAG pipeline combining embeddings and vector stores

๐Ÿ”ง Tools & Integration

  • skill Parse SKILL.md files and expose them as agent capabilities
  • mcp Model Context Protocol adapter for external tool integration
  • a2a Agent-to-Agent (A2A) protocol โ€” expose agents as HTTP servers or call remote agents as tools
  • nats NATS Agent Protocol v0.3 โ€” register agents as NATS micro services; discover and call them over pub/sub
  • trace Typed observability events; trace/text for human-readable colorized output; trace/jsonfile for NDJSON file logging; trace.NewLLM for raw LLM call wrapping
  • tool/agent Create and run a sub-agent at runtime as a delegated tool
  • tool/file Filesystem tools (read, write, edit, glob, grep)
  • tool/bash Bash command execution with guardrails (blocklist, allowlist, timeout, safe mode) and background execution (RunInBackground, bash_output, kill_shell)
  • tool/human Structured user-interaction checkpoints; caller provides the interactor via WithInteractor
  • tool/skill Dispatcher-style SKILL.md loader tool that expands instructions in the main conversation

Examples

Comprehensive examples are included in the examples/ directory:

ExampleDescription
Simple AgentStart here! Minimal example showing one agent with one custom tool perfect for learning the basics
MultimodalSend text + image inputs to a vision-capable model using typed content parts
AudioEnd-to-end speech-to-text and text-to-speech using the OpenAI backend
LLM MiddlewareWrap an LLM with composable middleware for logging and other cross-cutting concerns
Conversational AgentREPL-style chatbot with short-term conversational memory and a simple built-in tool
Long-Term MemoryREPL-style chatbot with semantic long-term memory (RAG) backed by Qdrant
NATS MemoryPersistent chatbot backed by NATS JetStream KV; conversation survives process restarts and supports named sessions
HandoffOne agent hands work off to a specialist agent at runtime using the built-in handoff mechanism
A2A ServerExpose a Phero agent as an A2A-compliant HTTP server for cross-process agent calls
A2A ClientConnect to a remote A2A agent and use it as a local tool inside an orchestrator
A2A Multi-Agent NewsroomThree specialised agents (researcher, writer, editor) each running as an independent A2A server, coordinated by a local orchestrator
NATS AgentRegister a Phero agent as a NATS micro service and interact with it from an interactive client using the NATS Agent Protocol
NATS Multi-Agent NewsroomThree specialised agents running as NATS micro services, orchestrated via service discovery and Client.AsTool()
Debate CommitteeMulti-agent architecture where committee members debate independently and a judge synthesizes the final decision
Evaluator-OptimizerIterative generation loop where an optimizer proposes drafts and an evaluator critiques them until quality criteria are met
Human-in-the-LoopMulti-agent flow that pauses for explicit human approval/input before continuing
Multi-Agent WorkflowClassic Plan โ†’ Execute โ†’ Analyze โ†’ Critique pattern with specialized agent roles
Orchestrator-WorkersDynamic task decomposition where an orchestrator delegates sub-tasks to worker agents
Parallel ResearchFan-out/fan-in workflow that runs multiple specialist researchers in parallel and merges their findings
Prompt ChainingSequential multi-step prompting with a programmatic gate between stages
RAG ChatbotTerminal chatbot with semantic search over local documents using Qdrant
SkillUse the tool/skill dispatcher to load SKILL.md instructions into the current conversation
Social SimulationMulti-agent social simulation with persona-driven actors and emergent interactions
MCP IntegrationRun an MCP server as a subprocess and expose its tools to agents
Playwright MCPConnect browser automation tools through MCP and orchestrate them from an agent
Supervisor BlackboardSupervisor-worker pattern with a shared blackboard for coordination
TracingAttach a colorized tracer to an agent and inspect LLM requests, tool calls, and memory events in real time

Design Philosophy

Phero embraces several core principles:

  1. Composability over monoliths Each package does one thing well
  2. Interfaces over implementations Swap LLMs, vector stores, or embeddings easily
  3. Explicit over implicit No hidden magic; clear control flow
  4. Tools are first-class Function tools are the primary integration point
  5. Developer experience matters Clean APIs, helpful tracing, good error messages

Contributing

Contributions are welcome! Please feel free to submit issues, feature requests, or pull requests.

License

This project is licensed under the Apache License 2.0. See the LICENSE file for details.

Acknowledgments

Built with โค๏ธ by Simone Vellei.

Inspired by the collaborative intelligence of ant colonies where independent agents work together toward shared goals, recognizing one another and coordinating through clear protocols.

The ant is not just a mascot. It is the philosophy. ๐Ÿœ