Zsh LLM CLI Autocomplete Plugin

April 22, 2026 · View on GitHub

Inline command completion for Zsh:

grey ghost text suggestions while you type
press Shift+. (the > key on a US layout) to accept — Tab stays normal Zsh completion
local inference with a base model + LoRA adapter

This project ships a ready-to-use runtime and also includes training utilities for custom adapters.

Demo

Download Trailer.mov (raw file) · Try the demo in your browser — same flow as the trailer; press Space to advance, ← to go back.

Inline shell command completion in Zsh (POSTDISPLAY + region_highlight)
Smart commit message completion from staged diff context
History-aware suggestions (prefix matching + workflow-aware tie-breaker)
Safety filtering for risky git push force flags unless explicitly typed
Unix-socket daemon for low-latency completion (~/.cache/zsh-autocomplete.sock)

All runtime inference is local on your machine.

git clone https://github.com/duoyuncloud/zsh-llm-cli-autocomplete-tool.git
cd zsh-llm-cli-autocomplete-tool
./install.sh
source ~/.zshrc

Then type in your shell; when a grey suggestion appears, press Shift+. (>) to accept it. For example:

Creates/uses venv
Installs runtime dependencies
Downloads pre-trained LoRA adapter from Hugging Face (duoyuncloud/zsh-autocomplete-lora)
Reads adapter metadata to detect the correct base model
Merges base + adapter into a local merged model cache
Adds plugin source lines into ~/.zshrc
Starts daemon: python -m model_completer.daemon

install.sh edits your shell config. If you want completions off without uninstalling, use ai-disable (see below).

The plugin gathers lightweight context (cwd, git info, scripts/targets, recent commands) and sends it to the daemon for completion.

After sourcing the plugin:

ai-help — list available ai-* commands and what they do
ai-setup — install/download/start helpers
ai-status — show daemon/model/enabled state
ai-enable / ai-disable — turn completions on or off (no need to comment out ~/.zshrc lines)
ai-restart — restart daemon
ai-debug — quick diagnostics

Commit intent is detected early (for partial prefixes like git c, git co, git com).

When commit intent is detected:

This repo uses both:

direct history prefix reuse (fast-path)
workflow-aware tie-breaker for ambiguous prefixes (example: after git commit, short ambiguous git ... inputs can prefer git push when transition confidence is strong)

This keeps strong personalized prefix matching while reducing repetitive wrong-next-command loops.

If completions do not appear:

ai-status
ai-debug
ai-restart

Training-related files are intentionally kept in this repository:

Typical flow (advanced):

pip install -r requirements-training.txt
./run_training.sh
./upload_to_huggingface.sh

You can override base/repo through environment variables in scripts:

MIT. See LICENSE.