Ops & Deploy

March 6, 2026 · View on GitHub

🏥 Quick Return to Emergency Room

You are in a specialist desk.
For full triage and doctors on duty, return here:

Think of this page as a sub-room.
If you want full consultation and prescriptions, go back to the Emergency Room lobby.

A compact hub to ship safely and keep RAG/LLM systems stable after release.
Use this folder to pick the right guardrail, verify with measurable targets, and recover fast when things wobble. No infra change required.


Open these first


When to use this folder

  • First calls after deploy crash or return stale content.
  • ΔS and citations look fine yesterday but flip today.
  • Rate limits cascade, queues spike, latency climbs.
  • Canary looks good then full rollout breaks retrieval.
  • Index swap succeeds but answers cite old snippets.
  • Retries cause duplicate side effects or charges.
  • Feature flags bleed traffic into unfinished paths.
  • Maintenance windows corrupt embeddings or anchors.

Acceptance targets for a safe rollout

  • ΔS(question, retrieved) ≤ 0.45 across three paraphrases.
  • Coverage ≥ 0.70 on the expected new section.
  • λ remains convergent on 2 seeds during rollout.
  • Idempotency ≥ 99.9% on retry storms.
  • Zero silent index mismatches (hash + counts match).
  • P95 latency stays in budget with backpressure active.

Quick routes — per-page guides

ScenarioFix Page
Rollout readinessrollout_readiness_gate.md
Canary strategystaged_rollout_canary.md
Blue/green cutoverblue_green_switchovers.md
Version pin & freezeversion_pinning_and_model_lock.md
Vector index swapvector_index_build_and_swap.md
Cache warmupcache_warmup_invalidation.md
Rate limitsrate_limit_backpressure.md
Feature flagsfeature_flags_safe_launch.md
Idempotencyidempotency_dedupe.md
Retry logicretry_backoff.md
Rollback planrollback_and_fast_recovery.md
Postmortemspostmortem_and_regression_tests.md
Change freezerelease_calendar_and_change_freeze.md
Incident commsincident_comms_and_statuspage.md
Shadow trafficshadow_traffic_mirroring.md
Maintenance windowread_only_mode_and_maintenance_window.md
DB migrationsdb_migration_guardrails.md

60-second ship checklist

  1. Freeze the world → Pin model IDs, prompt revs, index hashes.
  2. Warm up safely → Build index off-path, preload caches with canary.
  3. Shadow then canary → Mirror prod queries, step rollout 5% → 25% → 100%.
  4. Guard the edge → Enable backpressure, retries with jitter, idempotency keys.
  5. Know your exit → Keep rollback switch and comms draft ready.

Symptoms → exact fix

What you seeOpen this
Deploy points to old snippetsvector_index_build_and_swap.md · cache_warmup_invalidation.md
Canary fine, full rollout breaksstaged_rollout_canary.md · feature_flags_safe_launch.md
Wrong model after failoverversion_pinning_and_model_lock.md
Retries duplicate chargesidempotency_dedupe.md · retry_backoff.md
RL storms, timeoutsrate_limit_backpressure.md
Need rollback nowrollback_and_fast_recovery.md · blue_green_switchovers.md
Maintenance corrupts anchorsread_only_mode_and_maintenance_window.md · db_migration_guardrails.md
Unsure if safe to shiprollout_readiness_gate.md

FAQ

Q: What does ΔS mean here?
A: ΔS is a stability score. It measures how much the retrieved content drifts from the expected anchor when you change the query slightly. Lower is better (≤ 0.45 is safe).

Q: What is λ convergence?
A: λ tracks whether retrieval order flips unpredictably. If λ is stable across seeds, your rollout is consistent.

Q: Why do I need idempotency keys?
A: Without them, retries can double-charge a user or run the same side-effect twice. Keys make every request “safe to retry.”

Q: How do I know if my index swap worked?
A: Check doc counts and hashes before cutover. If they mismatch, you’re pointing at an incomplete index.

Q: Canary looked fine but production broke — why?
A: Canary often hides tail-latency, cache misses, or load-based rate limits. Always test at increasing % of live traffic.

Q: Why do you mention rollback comms?
A: Technical rollback is only half. Users and stakeholders need fast updates, so pre-draft Statuspage or Slack messages are essential.


🔗 Quick-Start Downloads (60 sec)

ToolLink3-Step Setup
WFGY 1.0 PDFEngine Paper1️⃣ Download · 2️⃣ Upload to your LLM · 3️⃣ Ask “Answer using WFGY + <your question>”
TXT OS (plain-text OS)TXTOS.txt1️⃣ Download · 2️⃣ Paste into any LLM chat · 3️⃣ Type “hello world” — OS boots instantly

Explore More

LayerPageWhat it’s for
⭐ ProofWFGY Recognition MapExternal citations, integrations, and ecosystem proof
⚙️ EngineWFGY 1.0Original PDF tension engine and early logic sketch (legacy reference)
⚙️ EngineWFGY 2.0Production tension kernel for RAG and agent systems
⚙️ EngineWFGY 3.0TXT based Singularity tension engine (131 S class set)
🗺️ MapProblem Map 1.0Flagship 16 problem RAG failure taxonomy and fix map
🗺️ MapProblem Map 2.0Global Debug Card for RAG and agent pipeline diagnosis
🗺️ MapProblem Map 3.0Global AI troubleshooting atlas and failure pattern map
🧰 AppTXT OS.txt semantic OS with fast bootstrap
🧰 AppBlah Blah BlahAbstract and paradox Q&A built on TXT OS
🧰 AppBlur Blur BlurText to image generation with semantic control
🏡 OnboardingStarter VillageGuided entry point for new users

If this repository helped, starring it improves discovery so more builders can find the docs and tools.
GitHub Repo stars