โจ Detailed Features
February 10, 2026 ยท View on GitHub
๐ฅ Core Features
- ๐ Real-time Embeddings - Eliminate heavy embedding storage with dynamic computation using optimized ZMQ servers and highly optimized search paradigm (overlapping and batching) with highly optimized embedding engine
- ๐ง AST-Aware Code Chunking - Intelligent code chunking that preserves semantic boundaries (functions, classes, methods) for Python, Java, C#, and TypeScript files
- ๐ Scalable Architecture - Handles millions of documents on consumer hardware; the larger your dataset, the more LEANN can save
- ๐ฏ Graph Pruning - Advanced techniques to minimize the storage overhead of vector search to a limited footprint
- ๐๏ธ Pluggable Backends - HNSW/FAISS (default), with optional DiskANN for large-scale deployments
๐ ๏ธ Technical Highlights
- ๐ Recompute Mode - Highest accuracy scenarios while eliminating vector storage overhead
- โก Zero-copy Operations - Minimize IPC overhead by transferring distances instead of embeddings
- ๐ High-throughput Embedding Pipeline - Optimized batched processing for maximum efficiency
- ๐ฏ Two-level Search - Novel coarse-to-fine search overlap for accelerated query processing (optional)
- ๐พ Memory-mapped Indices - Fast startup with raw text mapping to reduce memory overhead
- ๐ MLX Support - Ultra-fast recompute/build with quantized embedding models, accelerating building and search (minimal example)
๐จ Developer Experience
- Simple Python API - Get started in minutes
- Extensible backend system - Easy to add new algorithms
- Comprehensive examples - From basic usage to production deployment