CLAUDE.md

March 16, 2026 · View on GitHub

This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.

Commands

Testing and Development

Run smoke tests: uv run scripts/test_mcp_server.py server/zenml_server.py
Run analytics tests: uv run scripts/test_analytics.py --full-diagnostic
Run unit tests: uv run scripts/test_datetime_normalization.py
Format code: ./scripts/format.sh (uses ruff for linting/formatting + ty for type checking)
Run MCP server locally: uv run server/zenml_server.py
Type check only: uvx ty check (runs type checking without formatting)

Code Quality

Format + Type Check: bash scripts/format.sh (runs ruff + ty)
Type Check Only: uvx ty check (uses configuration from pyproject.toml)

Development Workflow

IMPORTANT: Always use feature branches and pull requests for changes.

ALSO IMPORTANT: Before opening a PR or making a large commit, always run /simplify to review changed code for reuse opportunities, quality issues, and efficiency improvements. Fix any issues it finds before committing.

Create a feature branch for any changes:

git checkout -b feature/your-feature-name

Make your changes and ensure tests pass:

uv run scripts/test_mcp_server.py server/zenml_server.py
docker build -t mcp-zenml:test .  # Verify Docker build works

Create a pull request - never commit directly to main:

git push -u origin feature/your-feature-name
gh pr create --fill

Wait for CI to pass before merging - PR tests include:
- MCP smoke tests (Python) — requires ZenML credentials
- Analytics pipeline tests
- Unit tests (datetime normalization, exception classification) — no credentials needed
- Docker build verification
- Type checking (ty)
- Security linting (zizmor)
After merge, trigger release if needed (see Release Process below)

Why this matters: Direct commits to main bypass CI checks and can result in broken releases (e.g., Docker images that fail to start). The PR workflow ensures all changes are validated before release.

PR and Commit Style

PR titles: Use plain English titles without conventional commit prefixes (e.g., "Improve error detection in smoke test" not "fix: improve error detection in smoke test")
Commit messages: Can use conventional commits for the commit history, but PR titles should be human-readable

Architecture

Core Components

The project is a Model Context Protocol (MCP) server that provides AI assistants with access to ZenML API functionality.

Main Server File: server/zenml_server.py

Uses FastMCP framework for MCP protocol implementation
Implements lazy initialization of ZenML client to avoid startup delays
Provides comprehensive exception handling with the @handle_exceptions decorator
Configures minimal logging to prevent JSON protocol interference

Analytics Module: server/zenml_mcp_analytics.py

Anonymous usage tracking via the ZenML Analytics Server (opt-out available)
Sends events to https://analytics.zenml.io/batch with Source-Context: mcp-zenml
Tracks tool usage, session duration, error rates, and MCP client info
Deterministic Docker user IDs (UUID5 from ZENML_STORE_URL hash when filesystem is ephemeral)
Synchronous shutdown flush for reliable delivery under SIGTERM
Session-wide properties via set_session_properties() / set_client_info_once()
Failure-safe: analytics errors never affect server functionality
Environment variables: ZENML_MCP_ANALYTICS_ENABLED, ZENML_MCP_ANALYTICS_DEV, ZENML_MCP_ANALYTICS_SHUTDOWN_TIMEOUT_S

Key Features:

Reads ZenML server configuration from environment variables (ZENML_STORE_URL, ZENML_STORE_API_KEY)
Provides MCP tools for accessing ZenML entities (users, stacks, pipelines, runs, etc.)
Supports triggering new pipeline runs via snapshots (preferred) or run templates (deprecated)
Includes automated CI/CD testing with GitHub Actions

Domain Model: Snapshots vs Run Templates

Historical context: ZenML underwent a significant evolution in its "runnable pipeline artifact" concepts:

2024-07-22: Run Templates introduced, pointing to "pipeline deployments"
2025-07-22: Pipeline Deployments renamed to Snapshots; Run Templates now reference snapshots via source_snapshot_id
Current: Run Template API marked deprecated=True; SDK methods emit deprecation warnings

What this means:

Snapshots = The core "frozen pipeline configuration" artifact (immutable, runnable, deployable)
Run Templates = A legacy wrapper that just references a snapshot (effectively a named pointer)

For contributors:

New development should be snapshot-first
Run template tools (get_run_template, list_run_templates) are kept for backward compatibility but include deprecation warnings
trigger_pipeline supports both snapshot_name_or_id (preferred) and template_id (deprecated)

MCP Tool Taxonomy

Tools are organized by entity type in server/zenml_server.py:

Category	Tools	Notes
Projects	`get_active_project`, `get_project`, `list_projects`	New in v1.2
Snapshots	`get_snapshot`, `list_snapshots`	Replaces run templates
Deployments	`get_deployment`, `list_deployments`, `get_deployment_logs`	New in v1.2
Tags	`get_tag`, `list_tags`	New in v1.2
Builds	`get_build`, `list_builds`	New in v1.2
Users	`get_user`, `list_users`, `get_active_user`
Stacks	`get_stack`, `list_stacks`
Components	`get_stack_component`, `list_stack_components`
Flavors	`get_flavor`, `list_flavors`
Pipelines	`list_pipelines`, `get_pipeline_details`
Runs	`get_pipeline_run`, `list_pipeline_runs`
Steps	`get_run_step`, `list_run_steps`, `get_step_logs`, `get_step_code`
Schedules	`get_schedule`, `list_schedules`
Services	`get_service`, `list_services`
Connectors	`get_service_connector`, `list_service_connectors`
Models	`get_model`, `list_models`, `get_model_version`, `list_model_versions`
Artifacts	`list_artifacts`
Secrets	`list_secrets`	Names only
Analysis	`stack_components_analysis`, `recent_runs_analysis`, `most_recent_runs`
Diagnostics	`diagnose_zenml_setup`	Works without ZenML SDK
Execution	`trigger_pipeline`	Prefer `snapshot_name_or_id`
Deprecated	`get_run_template`, `list_run_templates`	Use snapshot tools instead

When adding new tools:

Add the tool to server/zenml_server.py following existing patterns
Update README.md tool inventory
If the tool is safe (read-only, no required IDs), add to scripts/test_mcp_server.py safe_tools_to_test
Run smoke tests: uv run scripts/test_mcp_server.py server/zenml_server.py

Environment Setup

The server requires:

Python 3.12+
Dependencies managed via uv (preferred) or pip
ZenML server URL and API key configured as environment variables

Testing Infrastructure

PR Testing: GitHub Actions runs tests on every PR (smoke tests, unit tests, formatting, type checks)
Scheduled testing: Comprehensive smoke tests run every 3 days with automated issue creation on failures
Manual testing: Use the test scripts to verify MCP protocol functionality
CI/CD: Uses UV with caching for fast dependency installation
Important: When adding new test scripts, always wire them into .github/workflows/pr-test.yml so they run in CI. Tests that don't need ZenML credentials should run unconditionally (no if: env.ZENML_STORE_URL != '' guard).

Debugging with MCP Inspector

The MCP Inspector is an interactive debugging tool for testing MCP servers. It provides a web UI to call tools, inspect responses, and debug issues.

Quick start (using .env.local):

Copy the example file and add your credentials:

cp .env.local.example .env.local
# Edit .env.local with your ZENML_STORE_URL and ZENML_STORE_API_KEY

Run the inspector with credentials loaded from .env.local:

source .env.local && npx @modelcontextprotocol/inspector \
  -e ZENML_STORE_URL=$ZENML_STORE_URL \
  -e ZENML_STORE_API_KEY=$ZENML_STORE_API_KEY \
  -- uv run server/zenml_server.py

This opens a web UI (typically at http://localhost:6274) with your credentials pre-filled. Just click "Connect" and start testing!

Alternative: inline credentials (for one-off testing):

npx @modelcontextprotocol/inspector \
  -e ZENML_STORE_URL=https://your-server.zenml.io \
  -e ZENML_STORE_API_KEY=ZENKEY_... \
  -- uv run server/zenml_server.py

Key syntax notes:

-e key=value flags pass environment variables to the server subprocess
Place -e flags before the command (uv)
Use -- to separate inspector flags from server arguments

Without pre-filled env vars:

npx @modelcontextprotocol/inspector uv run server/zenml_server.py

Then manually add ZENML_STORE_URL and ZENML_STORE_API_KEY in the UI under Environment Variables before clicking Connect.

What you can test:

Tools tab: Call any MCP tool and see JSON request/response
Resources tab: Browse exposed resources (none currently)
Prompts tab: View prompt templates (none currently)
History: See all previous tool calls in the session

Testing MCP Apps with Docker + Cloudflare Tunnel

MCP Apps (interactive HTML UIs rendered in sandboxed iframes) require Streamable HTTP transport and a publicly reachable URL. Use Docker + Cloudflare tunnel for local testing.

Note: As of late January 2026, Claude Desktop and Claude.ai do not render MCP Apps (the tool calls work, but the interactive iframe UI does not appear). MCP Apps currently work with third-party clients that support the MCP Apps specification. This testing workflow is primarily useful for development validation.

1. Build the Docker image:

docker build --no-cache -t mcp-zenml:test .

2. Run the container:

docker run --rm -d --name mcp-zenml-test -p 8001:8001 \
  -e ZENML_STORE_URL=https://your-server.zenml.io \
  -e ZENML_STORE_API_KEY=ZENKEY_... \
  -e ZENML_ACTIVE_PROJECT_ID=your-project-id \
  mcp-zenml:test --transport streamable-http --host 0.0.0.0 --port 8001 --disable-dns-rebinding-protection

3. Start a Cloudflare tunnel:

npx cloudflared tunnel --url http://localhost:8001

This prints a public URL like https://random-words.trycloudflare.com.

4. Connect from an MCP client:

Add the tunnel URL with /mcp path as a Streamable HTTP MCP server: https://random-words.trycloudflare.com/mcp
Ask the assistant to use the app (e.g., "open the run activity chart")
If the app UI does not render (blank/no iframe), this is typically a client capability limitation rather than a server issue

Gotchas:

ZENML_ACTIVE_PROJECT_ID is required — without it, tools like list_pipeline_runs fail with "No project is currently set as active"
Port 8000 may be in use — the MCP Inspector or other services often occupy 8000; use 8001+ for Docker
Tunnel URL changes on restart — each npx cloudflared tunnel invocation gets a new random URL; update your MCP client configuration accordingly
Container logs are essential — run docker logs mcp-zenml-test to see server errors (they won't appear in the browser/iframe)
The Dockerfile copies server/ui/ automatically, so new MCP App HTML files are included in the build

Project Structure

server/ - Main MCP server implementation
- server/ui/ - MCP App HTML files (self-contained single-file apps)
scripts/ - Development and testing utilities
assets/ - Project assets and images
Root files include configuration for Desktop Extensions (DXT) support

Type Checking with ty

The project uses ty for static type checking - an extremely fast Python type checker from Astral (creators of uv and ruff).

Configuration: pyproject.toml under [tool.ty]

Python version: 3.12
Extra paths: server/ (allows import zenml_mcp_analytics to resolve)
Include patterns: server/**/*.py, scripts/**/*.py
Third-party imports: Ignored (since deps are installed on-the-fly via PEP 723)

Running type checks:

uvx ty check                    # Basic check
uvx ty check --output-format=github  # For CI (annotations)
bash scripts/format.sh          # Runs ruff + ty together

Suppressing false positives: Use # type: ignore[rule-name] or # ty: ignore[rule-name] comments when needed (prefer rule-specific suppressions).

CI Integration: Type checking runs as a separate job in PR tests (.github/workflows/pr-test.yml).

Note on third-party imports: Since this project uses PEP 723 inline script metadata for dependencies (installed on-the-fly by uv run), ty runs in isolation and can't see them. The unresolved-import = "ignore" setting handles this. First-party imports (like zenml_mcp_analytics) are still checked.

Important Implementation Details

Logging: Configured to use stderr and suppress ZenML internal logging to prevent JSON protocol conflicts
Error Handling: All tool functions wrapped with exception handling decorator
Lazy Loading: ZenML client initialized only when needed to improve startup performance
Environment Variables: Server configuration via ZENML_STORE_URL and ZENML_STORE_API_KEY
Type Hints: All public functions have type hints; type checking enforced in CI

Release Process

Triggering a Release

Releases are done via GitHub Actions:

gh workflow run release.yml --repo zenml-io/mcp-zenml -f version=X.Y.Z

This triggers:

Pre-release Tests: Runs smoke tests and Docker build verification as a gate
Release Orchestrator (release.yml): Bumps version files, creates tag, builds .mcpb bundle
Release Docker (release-docker.yml): Triggered by v*.*.* tag push, builds Docker image, publishes to MCP Registry

Note: The release will fail if tests don't pass. This prevents releasing broken builds.

Version Files

Four files must stay in sync (handled by scripts/bump_version.py):

VERSION - Source of truth
manifest.json - DXT/MCPB manifest
server.json - MCP Registry server definition
pyproject.toml - Project configuration (if present)

Debugging MCP Registry Schema Failures

The MCP Registry schema evolves frequently. If the "Publish to MCP Registry" step fails with a deprecated schema error:

Find the current schema version by checking the mcp-publisher source:

curl -s https://raw.githubusercontent.com/modelcontextprotocol/registry/main/pkg/model/constants.go | grep CurrentSchemaVersion

Verify the schema URL exists:

curl -sI "https://static.modelcontextprotocol.io/schemas/YYYY-MM-DD/server.schema.json" | head -1
# Should return HTTP/2 200

Update server.json with the new schema URL
Check the changelog for breaking changes: https://github.com/modelcontextprotocol/registry/blob/main/docs/reference/server-json/CHANGELOG.md

Common Schema Migration Issues

snake_case → camelCase (2025-09-16): Field names like registry_type became registryType
OCI identifier format (2025-12-11): Removed registryBaseUrl and separate version fields; use canonical identifier instead: docker.io/owner/image:version
Removed fields: status and privacy_policies are no longer valid

Release Cleanup

If a release fails partway through, clean up before retrying:

# Delete failed release and tag
gh release delete vX.Y.Z --repo zenml-io/mcp-zenml --yes
git push origin --delete vX.Y.Z

# Then re-trigger with the corrected code
gh workflow run release.yml --repo zenml-io/mcp-zenml -f version=X.Y.Z

Important: The release-docker.yml workflow checks out code at the tag, not from HEAD. If you push a fix to main, you must delete and recreate the tag for the fix to take effect.