README.md

May 3, 2026 · View on GitHub

Note

This README was generated by SKILL, get the ZH version from here.

EMBEDDED KV WITH JSON QUERY AND SEMANTIC VECTOR SEARCH!

A Go embedded KV database with dot-notation JSON queries, inline vector embeddings, cosine top-K search, and AOF persistence

Use Cases
Features
Architecture
File Structure
Version History
License
Author
Stars

Use Cases

ToriiDB is a four-in-one embedded database — a single Go binary providing:

KV cache — Redis-style commands and data model
Document DB — MongoDB-style JSON field queries and infix expressions
Vector DB — OpenAI embeddings inlined on each key, cosine top-K semantic retrieval
Local persistence — AOF append log + JSON snapshots, with an in-memory parsed cache

No extra service, no secondary index engine, no separate vector store; embeddings live inline on each key and flow through the same AOF and compaction paths as the KV values. Aimed at Go projects that want to replace a Redis + MongoDB + Pinecone stack with a single import, not run one behind a network.

Best suited for single-process, single-host embedded scenarios with up to ~10k entries per database:

CLI tools — local config and state storage
Prototypes / MVPs — skip the DB setup and embed directly
Desktop apps and IoT devices — embedded storage, cross-compile friendly
LINE bots and Discord bots — conversation state and user data
AI personal assistants — memory, chat history, preferences, context cache, semantic recall
Small-scale RAG and semantic search — documents, FAQs, notes with inline embeddings
Single-host API servers — sessions, tokens, config, and other lightweight storage
Personal blogs and small CMS backends — up to a few thousand posts or users
Small-to-mid projects that don't warrant Redis / MongoDB / SQLite / a dedicated vector DB

Not suitable for: high-concurrency online services, datasets larger than memory, cross-machine access, multi-process writers, heavy queries that require index acceleration, or vector corpora beyond ~10k entries where linear scan becomes a bottleneck.

A MongoDB export utility is planned; the data model maps across natively.

Features

go get github.com/agenvoy/toriidb · Documentation

Full Architecture

graph TB
    Client[REPL / Embed API] --> Exec[Exec Router]
    Exec --> Core[core - db index + embedder]
    Core --> Store[Store - 16 DBs]
    Core --> Filter[Filter Engine]
    Core --> Vector[Vector Engine]
    Store --> Memory[In-Memory Map]
    Store --> AOF[AOF Append Log]
    Store --> Snapshot[MD5 JSON Snapshot]
    Filter --> Scan[Chunked Parallel Scan]
    Vector --> TopK[Top-K Cosine Scan]
    Vector --> EmbedCache[Embedding Cache]
    Vector --> OpenAI[OpenAI Embedding Client]
    Scan --> Memory
    TopK --> Memory

File Structure

ToriiDB/
├── cmd/
│   └── test/
│       └── main.go              # REPL entry point
├── core/
│   ├── openai/                  # text-embedding-3-small client (singleton)
│   ├── store/                   # storage engine and command impls
│   │   ├── vector.go            # float32 codec + cosine + isInternal
│   │   ├── vcache.go            # __torii:embed:* cache
│   │   ├── vsearch.go           # top-K min-heap cosine scan
│   │   ├── vsim.go              # VSim + VGet
│   │   └── filter/              # query expression and operators
│   └── utils/                   # shared helpers
├── .env                         # OPENAI_API_KEY (optional, for vector features)
├── go.mod
├── Makefile
└── README.md

Version History

Version	Date	Highlights
v0.5.0	2026-04-18	Semantic vector search — `SET ... VECTOR` inline embeddings, `VSEARCH` top-K cosine, `VSIM` / `VGET`, content-hash embedding cache under `__torii:embed:*`, AOF vector persistence
v0.4.4	2026-04-16	Parsed JSON cache in `Entry` to eliminate repeated `Unmarshal` on hot paths; split-lock read/write APIs
v0.4.3	2026-04-10	AOF compaction switched from line count to byte size with 1MB floor
v0.4.2	2026-04-10	Inline AOF compaction triggered when inflation ratio exceeds 2x
v0.4.1	2026-04-10	Shared `core` struct enabling `Session`, lazy DB replay, parallel compaction, custom storage path
v0.4.0	2026-04-09	NE operator, dedicated `filter` package with infix expression parser (AND/OR/NOT), concurrent slice scan
v0.3.0	2026-04-08	FIND / QUERY commands with EQ/GT/GE/LT/LE/LIKE operators, LIMIT clause, time-based sorting
v0.2.0	2026-04-08	Document field ops (GetField/SetField/DelField), KEYS glob matching, INCR with dot-notation nested fields
v0.1.0	2026-04-08	Initial release — REPL, KV commands (SET/GET/DEL/EXIST/TYPE), AOF persistence, TTL, 16 databases (DB 0-15)

License

This project is licensed under the MIT LICENSE.

README.md

Table of Contents

Use Cases

Features

Redis × Mongo × Vector Hybrid DX

Inline Per-Key Vector Embeddings

Content-Addressed Embedding Cache

Top-K Cosine Search

16 Databases with Lazy Replay

Dual Persistence

In-Place JSON Field Ops

Infix Query Expression

Auto-Chunked Parallel Scan

Split-Lock Parsed Cache

Independent Sessions

Size-Triggered Compaction

Reserved Internal Namespace

Automatic Type Detection

Architecture

File Structure

Version History

License

Author

邱敬幃 Pardn Chiu

Stars