Fully Local VMware Operations with Ollama
March 9, 2026 · View on GitHub
Run VMware infrastructure operations using a local LLM — no cloud API keys required.
Prerequisites
- Ollama installed: https://ollama.com
- vmware-aiops installed:
uv tool install vmware-aiops - VMware config ready:
~/.vmware-aiops/config.yaml+~/.vmware-aiops/.env
Step 1: Pull a local model
# Recommended: Qwen2.5-Coder 32B (best tool-calling accuracy, needs 24GB VRAM)
ollama pull qwen2.5-coder:32b
# Alternative: 14B (needs 10GB VRAM)
ollama pull qwen2.5-coder:14b
# Lightweight: 7B (needs 6GB VRAM, lower accuracy for complex operations)
ollama pull qwen2.5-coder:7b
Step 2: Choose your agent
Option A: Aider (simplest)
# Install aider
pip install aider-chat
# Run with Ollama + vmware-aiops conventions
aider --conventions codex-skill/AGENTS.md --model ollama/qwen2.5-coder:32b
Then ask in natural language:
> List all VMs on my lab ESXi
> Show active alarms on vcenter-prod
> What's the datastore usage?
Option B: Goose (Block)
Edit ~/.config/goose/config.yaml:
extensions:
vmware-aiops:
name: VMware AIops
cmd: vmware-aiops-mcp
enabled: true
type: stdio
timeout: 300
envs:
VMWARE_AIOPS_CONFIG: ~/.vmware-aiops/config.yaml
Then:
goose session
> Check health status of all my VMware hosts
Option C: Continue (VS Code)
Add to your Continue MCP config:
mcpServers:
- name: vmware-aiops
command: vmware-aiops-mcp
env:
VMWARE_AIOPS_CONFIG: ~/.vmware-aiops/config.yaml
Configure Ollama as your model provider in Continue settings.
Step 3: Read-only mode (recommended for production)
For production environments, use vmware-monitor instead:
uv tool install vmware-monitor
# Aider
aider --conventions codex-skill/AGENTS.md --model ollama/qwen2.5-coder:32b
# Or configure MCP with vmware-monitor-mcp instead of vmware-aiops-mcp
vmware-monitor has zero destructive operations in its codebase — safe to use with any model.
Model comparison for VMware operations
| Model | VRAM | Tool calling | Complex ops | Recommended for |
|---|---|---|---|---|
| Qwen2.5-Coder 32B | 24GB | ~90% | Good | Full operations |
| Qwen2.5-Coder 14B | 10GB | ~80% | Fair | Monitoring + simple ops |
| Qwen2.5-Coder 7B | 6GB | ~60% | Poor | Monitoring only |
| DeepSeek-Coder-V2 16B | 12GB | ~75% | Fair | Alternative to Qwen |
Troubleshooting
# Verify vmware-aiops config and connection
vmware-aiops inventory vms --target <your-target>
# Verify Ollama is running
ollama list
# Check MCP server works standalone
echo '{"jsonrpc":"2.0","method":"initialize","params":{"protocolVersion":"2024-11-05","capabilities":{},"clientInfo":{"name":"test","version":"1.0"}},"id":1}' | vmware-aiops-mcp