Customization Guide
May 8, 2026 · View on GitHub
Open SWE is designed to be forked and customized for your org. The core agent is assembled in a single function — get_agent() in agent/server.py — where you can swap out the sandbox, model, tools, and triggers.
# agent/server.py — the key lines
model_id = os.environ.get("LLM_MODEL_ID", DEFAULT_LLM_MODEL_ID)
model_kwargs = {"max_tokens": DEFAULT_LLM_MAX_TOKENS}
if model_id == DEFAULT_LLM_MODEL_ID:
model_kwargs["reasoning"] = DEFAULT_LLM_REASONING
return create_deep_agent(
model=make_model(model_id, **model_kwargs),
system_prompt=construct_system_prompt(...),
tools=[http_request, fetch_url, linear_comment, slack_thread_reply],
backend=sandbox_backend,
middleware=[
ToolErrorMiddleware(),
check_message_queue_before_model,
ensure_no_empty_msg,
notify_step_limit_reached,
],
)
1. Sandbox
By default, Open SWE runs each task in a LangSmith cloud sandbox — an isolated Linux environment where the agent clones the repo and executes commands. Sandbox creation and connection is handled in agent/integrations/langsmith.py.
Using a custom sandbox snapshot
Build a snapshot in LangSmith (UI or SandboxClient.create_snapshot) from your Docker image and point Open SWE at its UUID:
DEFAULT_SANDBOX_SNAPSHOT_ID="<snapshot-uuid>" # Required
DEFAULT_SANDBOX_SNAPSHOT_FS_CAPACITY_BYTES="34359738368" # Optional, default 32 GiB
DEFAULT_SANDBOX_VCPUS="4" # Optional, default 4
DEFAULT_SANDBOX_MEM_BYTES="16106127360" # Optional, default 15 GiB
DEFAULT_SANDBOX_IDLE_TTL_SECONDS="600" # Optional, default 600 (10 min); 0 disables
DEFAULT_SANDBOX_DELETE_AFTER_STOP_SECONDS="86400" # Optional, default 86400 (24 h); 0 disables
This is useful for pre-installing languages, frameworks, or internal tools that your repos depend on — reducing setup time per agent run. The default snapshot includes the GitHub CLI; agents invoke it as GH_TOKEN=dummy gh <command> and rely on the LangSmith proxy for the real credentials.
For LangSmith sandboxes, Open SWE configures two GitHub proxy rules whenever a sandbox is created or reattached to a run:
github.com/*.github.comreceive Basic auth for git-over-HTTPS operations.api.github.comreceives Bearer auth forghand REST API operations.
The proxy token is minted at runtime from the GitHub App installation credentials. Do not store GitHub access tokens as deployment environment variables.
Using a different sandbox provider
Set the SANDBOX_TYPE environment variable to switch providers. Each provider has a corresponding integration file in agent/integrations/ and a factory function registered in agent/utils/sandbox.py:
SANDBOX_TYPE | Integration file | Required env vars |
|---|---|---|
langsmith (default) | agent/integrations/langsmith.py | LANGSMITH_API_KEY_PROD, SANDBOX_TYPE="langsmith" |
daytona | agent/integrations/daytona.py | DAYTONA_API_KEY, SANDBOX_TYPE="daytona", optional DAYTONA_SANDBOX_SNAPSHOT |
runloop | agent/integrations/runloop.py | RUNLOOP_API_KEY, SANDBOX_TYPE="runloop" |
modal | agent/integrations/modal.py | Modal credentials, SANDBOX_TYPE="modal" |
local | agent/integrations/local.py | None (no isolation — development only), SANDBOX_TYPE="local" |
Warning:
localruns commands directly on your host with no sandboxing. Only use for local development with human-in-the-loop enabled.
Adding a new sandbox provider
- Create an integration file at
agent/integrations/my_provider.pywith a factory function matching this signature:
def create_my_provider_sandbox(sandbox_id: str | None = None):
"""Create or reconnect to a sandbox.
Args:
sandbox_id: Optional existing sandbox ID to reconnect to.
If None, creates a new sandbox.
Returns:
An object implementing SandboxBackendProtocol.
"""
...
- Register it in
agent/utils/sandbox.pyby importing your factory and adding it toSANDBOX_FACTORIES:
from agent.integrations.my_provider import create_my_provider_sandbox
SANDBOX_FACTORIES = {
...
"my_provider": create_my_provider_sandbox,
}
The factory must return an object implementing SandboxBackendProtocol from deepagents. See the existing integration files for reference.
Building a custom sandbox provider
If none of the built-in providers fit, you can build your own. The agent accepts any backend that implements SandboxBackendProtocol from deepagents. The protocol requires:
- File operations:
ls(),read(),write(),edit(),glob(),grep() - Shell execution:
execute(command, timeout=None) -> ExecuteResponse - Identity:
idproperty returning a unique sandbox identifier
The easiest approach is to extend BaseSandbox from deepagents.backends.sandbox — it implements all file operations by delegating to execute(), so you only need to implement the shell execution layer:
from deepagents.backends.sandbox import BaseSandbox
from deepagents.backends.protocol import ExecuteResponse
class MySandbox(BaseSandbox):
def __init__(self, connection):
self._conn = connection
@property
def id(self) -> str:
return self._conn.id
def execute(self, command: str, *, timeout: int | None = None) -> ExecuteResponse:
result = self._conn.run(command, timeout=timeout or 300)
return ExecuteResponse(
output=result.stdout + result.stderr,
exit_code=result.exit_code,
truncated=False,
)
See deepagents.backends.LangSmithSandbox and agent/integrations/langsmith.py for a full reference implementation.
2. Model
The model is configured in the get_agent() function in agent/server.py. By default it uses openai:gpt-5.5 with medium reasoning effort, but you can override the model with the LLM_MODEL_ID environment variable:
# Set the model via environment variable (uses provider:model format)
LLM_MODEL_ID="anthropic:claude-sonnet-4-6"
If LLM_MODEL_ID is not set, the default model (openai:gpt-5.5) is used.
max_tokens is a maximum completion/output token budget, not the model's total context window. For OpenAI reasoning models, this budget can include both internal reasoning tokens and final response tokens.
Switching models
Use the provider:model format:
# Anthropic
model=make_model("anthropic:claude-sonnet-4-6", temperature=0, max_tokens=16_000)
# OpenAI (uses Responses API by default)
model=make_model("openai:gpt-5.5", max_tokens=128_000, reasoning={"effort": "medium"})
# Google
model=make_model("google_genai:gemini-2.5-pro", temperature=0, max_tokens=16_000)
The make_model() helper in agent/utils/model.py wraps langchain.chat_models.init_chat_model. For OpenAI models, it automatically enables the Responses API. For full control, pass a pre-configured model instance directly:
from langchain_anthropic import ChatAnthropic
model = ChatAnthropic(model_name="claude-sonnet-4-6", temperature=0, max_tokens=16_000)
return create_deep_agent(
model=model,
...
)
Using different models per context
You can route to different models based on task complexity, repo, or trigger source:
async def get_agent(config: RunnableConfig) -> Pregel:
source = config["configurable"].get("source")
if source == "slack":
# Faster model for Slack Q&A
model = make_model("anthropic:claude-sonnet-4-6", temperature=0, max_tokens=16_000)
else:
# Full model for code changes from Linear
model = make_model("openai:gpt-5.5", max_tokens=128_000, reasoning={"effort": "medium"})
return create_deep_agent(model=model, ...)
3. Tools
Open SWE ships with a small set of custom tools on top of the built-in Deep Agents tools (file operations, shell execution, subagents, todos). GitHub operations are handled by GH_TOKEN=dummy gh inside the sandbox.
| Tool | File | Purpose |
|---|---|---|
fetch_url | agent/tools/fetch_url.py | Fetch web pages as markdown |
http_request | agent/tools/http_request.py | HTTP API calls |
linear_comment | agent/tools/linear_comment.py | Post comments on Linear tickets |
slack_thread_reply | agent/tools/slack_thread_reply.py | Reply in Slack threads |
Adding a tool
Create a new file in agent/tools/, define a function, and add it to the tools list.
Example — adding a Datadog search tool:
# agent/tools/datadog_search.py
import requests
from typing import Any
def datadog_search(query: str, time_range: str = "1h") -> dict[str, Any]:
"""Search Datadog logs for debugging context.
Args:
query: Datadog log query string
time_range: Time range to search (e.g. "1h", "24h", "7d")
Returns:
Dictionary with matching log entries
"""
# Your Datadog API integration here
...
Then register it in agent/server.py:
from .tools import fetch_url, http_request, linear_comment, slack_thread_reply
from .tools.datadog_search import datadog_search
return create_deep_agent(
...
tools=[
http_request, fetch_url,
linear_comment, slack_thread_reply,
datadog_search, # new tool
],
...
)
The agent will automatically see the tool's name, docstring, and parameter types — the docstring serves as the tool description, so write it clearly.
Removing tools
If you only use Linear (not Slack), remove slack_thread_reply from the tools list and vice versa. If you don't need web fetching, remove fetch_url.
Conditional tools
You can vary the toolset based on the trigger source:
base_tools = [http_request, fetch_url]
source = config["configurable"].get("source")
if source == "linear":
tools = [*base_tools, linear_comment]
elif source == "slack":
tools = [*base_tools, slack_thread_reply]
else:
tools = [*base_tools, linear_comment, slack_thread_reply]
return create_deep_agent(tools=tools, ...)
4. Triggers
Open SWE supports three invocation surfaces: Linear, Slack, and GitHub. Each is implemented as a webhook endpoint in agent/webapp.py. You can add, remove, or modify triggers independently.
Removing a trigger
If you don't use Linear, simply don't configure the Linear webhook and remove the env vars. Same for Slack. The webhook endpoints still exist but won't receive events.
To fully remove a trigger's code, delete the corresponding endpoint from agent/webapp.py:
- Linear:
linear_webhook()andprocess_linear_issue() - Slack:
slack_webhook()andprocess_slack_mention()
Default repository
Set the default GitHub org and repo used across all triggers (Slack, Linear, GitHub) when no repo is specified:
DEFAULT_REPO_OWNER="my-org" # Default GitHub org (used everywhere)
DEFAULT_REPO_NAME="my-repo" # Default GitHub repo (used everywhere)
These are used as the fallback when:
- A Slack message doesn't specify a repo (and no thread metadata exists)
- A Linear issue's team/project isn't in the
LINEAR_TEAM_TO_REPOmapping - A user writes
repo:namewithout an org prefix — the org defaults toDEFAULT_REPO_OWNER
Repository extraction from messages
Both Slack and Linear support specifying a target repo directly in the message or comment text. The shared utility extract_repo_from_text() in agent/utils/repo.py handles parsing these formats:
repo:owner/name— explicit org and reporepo owner/name— space syntax (same result)repo:name— repo name only; the org defaults toDEFAULT_REPO_OWNERhttps://github.com/owner/name— GitHub URL
Customizing Linear routing
The LINEAR_TEAM_TO_REPO dict in agent/utils/linear_team_repo_map.py maps Linear teams and projects to GitHub repos:
LINEAR_TEAM_TO_REPO = {
"Engineering": {
"projects": {
"backend": {"owner": "my-org", "name": "backend"},
"frontend": {"owner": "my-org", "name": "frontend"},
},
"default": {"owner": "my-org", "name": "monorepo"},
},
}
Users can also override the team/project mapping on a per-comment basis by including repo:owner/name in their @openswe comment. This takes priority over the mapping — the mapping is used as a fallback when no repo is specified in the comment. If the team/project isn't found in the mapping either, DEFAULT_REPO_OWNER/DEFAULT_REPO_NAME is used.
Customizing Slack routing
Slack uses DEFAULT_REPO_OWNER and DEFAULT_REPO_NAME as the fallback when no repo is specified in a message.
Users can override per-message with repo:owner/name syntax in their Slack message. A shorthand repo:name (without the org) is also supported — the org defaults to DEFAULT_REPO_OWNER.
Adding a new trigger
To add a new invocation surface (e.g. Jira, Discord, a custom API):
- Add a webhook endpoint in
agent/webapp.py:
@app.post("/webhooks/my-trigger")
async def my_trigger_webhook(request: Request, background_tasks: BackgroundTasks):
# Parse the incoming event
payload = await request.json()
# Extract task description and repo info
task_description = payload["description"]
repo_config = {"owner": "my-org", "name": "my-repo"}
# Create a LangGraph run
background_tasks.add_task(process_my_trigger, task_description, repo_config)
return {"status": "accepted"}
- Create a processing function that builds the prompt and starts an agent run:
async def process_my_trigger(task_description: str, repo_config: dict):
thread_id = generate_deterministic_id(task_description)
langgraph_client = get_client(url=LANGGRAPH_URL)
await langgraph_client.runs.create(
thread_id,
"agent",
input={"messages": [{"role": "user", "content": task_description}]},
config={"configurable": {
"repo": repo_config,
"source": "my-trigger",
"user_email": "user@example.com",
}},
if_not_exists="create",
)
- Add a communication tool (optional) so the agent can report back:
# agent/tools/my_trigger_reply.py
def my_trigger_reply(message: str) -> dict:
"""Post a reply to the triggering service."""
# Your API call here
...
The key fields in config.configurable are:
repo:{"owner": "...", "name": "..."}— which GitHub repo to work onsource: string identifying the trigger (used for auth routing and communication)user_email: the triggering user's email (for GitHub OAuth resolution)
5. System prompt
The system prompt is assembled in agent/prompt.py from modular sections. You can customize behavior by editing individual sections:
| Section | What it controls |
|---|---|
WORKING_ENV_SECTION | Sandbox paths and execution constraints |
TASK_EXECUTION_SECTION | Workflow steps (understand → implement → verify → submit) |
CODING_STANDARDS_SECTION | Code style, testing, and quality rules |
COMMIT_PR_SECTION | PR title/body format and commit conventions |
CODE_REVIEW_GUIDELINES_SECTION | How the agent reviews code changes |
COMMUNICATION_SECTION | Formatting and messaging guidelines |
Default prompt file
Open SWE supports a default_prompt.md file for org-level instructions that apply to every agent run, regardless of which repository is being worked on. This is the recommended way to set default repository preferences, org conventions, and shared guidelines.
The file is loaded at agent startup and injected into the system prompt between the task overview and repository setup sections.
Location: default_prompt.md in the project root.
Override: Set the DEFAULT_PROMPT_PATH environment variable to use a different file:
DEFAULT_PROMPT_PATH="/path/to/my-org-prompt.md"
Format: Write plain markdown. The content is injected as-is under a ### Custom Instructions heading in the system prompt. Example:
# Default Prompt
## Default Repository
When no repository is specified, work on the **my-app** repository under **my-org**.
## Organization Conventions
- Use conventional commits: feat:, fix:, chore:
- Always tag the requesting user when work is complete
Loading order: Default prompt → System prompt sections → AGENTS.md (per-repo). If the file is missing or empty, it is silently skipped — no error is raised.
When to use default_prompt.md vs AGENTS.md:
default_prompt.md | AGENTS.md | |
|---|---|---|
| Scope | All tasks, all repos | Single repository |
| Location | Open SWE project root | Target repo root |
| Use for | Default repo, org conventions | Repo-specific coding standards |
Using AGENTS.md
Drop an AGENTS.md file in the root of any repository to add repo-specific instructions. The agent reads it from the sandbox at startup and appends it to the system prompt. This is the easiest way to encode conventions per-repo without modifying Open SWE's code.
6. Middleware
Middleware hooks run around the agent loop. Open SWE includes:
| Middleware | Type | Purpose |
|---|---|---|
ToolErrorMiddleware | Tool error handler | Catches and formats tool errors |
check_message_queue_before_model | Before model | Injects follow-up messages that arrived mid-run |
ensure_no_empty_msg | Before model | Prevents empty messages from reaching the model |
notify_step_limit_reached | After agent | Posts a Slack reply when the agent hits the model-call limit |
There is intentionally no after-agent middleware that opens a PR for the agent. The agent is responsible for committing, pushing, opening/updating the draft PR, and replying in the source channel. If you want a deterministic backstop for your fork, add an @after_agent hook here.
Add custom middleware by appending to the middleware list in get_agent(). See the LangChain middleware docs for the @before_model and @after_agent decorators.
Example — adding a CI check after agent completion:
from langchain.agents.middleware import AgentState, after_agent
from langgraph.runtime import Runtime
@after_agent
async def run_ci_check(state: AgentState, runtime: Runtime):
"""Run CI checks after the agent finishes."""
# Trigger your CI pipeline here
...
Then add it to the middleware list:
middleware=[
ToolErrorMiddleware(),
check_message_queue_before_model,
ensure_no_empty_msg,
notify_step_limit_reached,
run_ci_check, # new middleware
],