Decoupling Model Selection from Profile Binding

June 24, 2026 · View on GitHub

Problem

Profiles currently provide useful defaults (engine, tool policy, system prompt), but users need to choose model independently at runtime (task/agent/session) without being locked to profile model defaults.

Goals

Preserve profile-level behavior and policy defaults.
Let callers specify model independently from profile.
Keep explicit engine selection possible.
Support clear precedence and mismatch visibility.

Final Design

1) Canonical run contract now has top-level `model`

LemonCore.RunRequest now includes :model so model overrides can be passed explicitly (not only via meta).

2) Dedicated resolver module

Added LemonRouter.ModelSelection to centralize resolution:

Model precedence:
1. request-level explicit model
2. meta model (back-compat)
3. session policy model
4. profile model
5. config default model (LemonCore.Config.cached().agent.default_model, backed by [defaults])

SubmissionBuilder also stores the resolved history_model in submission metadata for routing-feedback history when that feature is enabled. Runtime modules should not read Application.get_env(:lemon_router, :default_model); model defaults come from Lemon config or policy stores.

Engine precedence:
1. resume token engine
2. explicit engine_id
3. model-implied engine (codex:*, claude:*, etc.)
4. profile default engine

3) Mismatch warnings (non-blocking)

If explicit engine conflicts with model-implied engine, the system:

keeps explicit engine (caller intent wins),
records warning in job.meta[:model_resolution_warning],
logs warning in orchestrator.

4) API/tool surface updates

agent control-plane method accepts model.
agent.inbox.send accepts model.
CodingAgent.Tools.Agent accepts model and forwards it through RunRequest.model.
CodingAgent.Tools.Task accepts model and thinking_level for internal-session subtasks.

Add strict mode to reject incompatible engine/model combinations.
Add capability checks (e.g., tool-calling requirements vs selected model).
Expose model-resolution diagnostics in run inspection APIs.