README.md

April 28, 2026 · View on GitHub

Manifest

Reduce your AI costs

manifest-gh

beta   GitHub stars   Docker pulls   Docker image size   CI status   Codecov   license   Discord

mnfst%2Fmanifest | Trendshift

What is Manifest?

Manifest is a smart model router for agents and AI applications that redirects each query to the right model, saving up to 70% in AI costs.

  • 🔀 Routing based on complexity, specificity and custom HTTP headers
  • 🎛️ Mix your providers: API keys, Subscriptions, Local models, Custom providers
  • 📊 Track every single dollar, setup notifications and limits
  • 🚑 Fallback on different models when queries fails

Quick start

Cloud version

Go to app.manifest.build and follow the guide.

Self-hosted

Manifest ships as a Docker image. One command:

bash <(curl -sSL https://raw.githubusercontent.com/mnfst/manifest/main/docker/install.sh)

Open http://localhost:2099 and sign up — the first account you create becomes the admin. Full self-hosting guide: docker/DOCKER_README.md.

The legacy manifest npm package is deprecated and no longer published.

Providers

Manifest connects to 300+ models across 16 providers plus any custom provider (OpenAI/Anthropic compatible). Bring your own API key, reuse a paid subscription you already have, or run models locally — all routed through the same /auto endpoint.

ProviderAPI keySubscriptionFeatured models
OpenAI✅ ChatGPT Plus / Pro / Teamgpt-5, gpt-5-mini, o4, o4-mini
Anthropic✅ Claude Max / Proclaude-opus-4-7, claude-sonnet-4-6, claude-haiku-4-5
Googlegemini-2.5-pro, gemini-2.5-flash, gemini-2.0-flash
xAIgrok-4, grok-3, grok-code-fast
DeepSeekdeepseek-v3.2, deepseek-r1
Mistralmistral-large, codestral, magistral
Qwen (Alibaba)qwen3-max, qwen3-coder, qwq-32b
Moonshot (Kimi)kimi-k2, moonshot-v1-128k
MiniMax✅ MiniMax Coding Planminimax-m2, abab7-chat-preview
Z.ai (Zhipu)✅ GLM Coding Planglm-4.6, glm-4.5-air
OpenCode✅ Go subscriptionRoutes via OpenCode Go catalog
Ollama🖥️ Local✅ Ollama CloudAny GGUF model, port 11434
LM Studio🖥️ LocalAny GGUF model, port 1234
llama.cpp🖥️ LocalAny GGUF model, port 8080
OpenRouterRoutes to 300+ models across labs
GitHub Copilot✅ Copilot subscriptionOAuth, no API key needed
Custom (OpenAI/Anthropic-compatible)Any /v1/chat/completions or /v1/messages endpoint

License

MIT