Lemonade Endpoints Spec

June 12, 2026 ยท View on GitHub

The Lemonade HTTP service provides a wide array of standards-compliant and custom endpoints.

Our design philosophy is:

  1. Ensure that Lemonade works out-of-box with all popular local AI apps.
  2. Prioritize using a pre-existing standard for functionality whenever possible.
  3. Add sufficient custom functionality to enable developers to build highly polished experiences.

This spec details all supported endpoints. It is organized into pages that correspond to which organization (OpenAI, Ollama, Lemonade, etc.) defined the endpoints.

APIDescription
OpenAI-Compatible APIStart here for the main API surface used by most SDKs and clients.
Ollama-Compatible APIUse this if your client expects Ollama-style behavior and routes.
Anthropic-Compatible APIUse this for clients built around Anthropic's message format.
llama.cpp-Specific APIReference for llama.cpp-specific compatibility and conventions.
Lemonade-Specific APILocal-first API for managing lifecycle, configuration, backends, etc.