PHP
May 14, 2026 · View on GitHub
Pre-compiled tree-sitter grammars for 306 programming languages with PHP bindings via ext-php-rs.
Installation
composer require kreuzberg/tree-sitter-language-pack
Quick Start
See the language guide for php-specific usage.
Features
- 306 languages — pre-compiled tree-sitter grammars covering every major programming language and many minor ones.
- On-demand download + cache — parsers fetched at first use; subsequent runs hit the local cache.
- Code intelligence — extract functions, classes, imports, exports, symbols, docstrings, and diagnostics with one API.
- Syntax-aware chunking — semantic chunks for RAG/LLM pipelines.
- Polyglot bindings — Rust core with native bindings for Python, TypeScript, Go, Java, C#, Ruby, PHP, Elixir, and WebAssembly via alef.
Documentation
- Documentation -- Full docs and API reference
- GitHub Repository -- Source, issues, and discussions
Part of Kreuzberg.dev
- Kreuzberg — document intelligence: text, tables, metadata from 91+ formats with optional OCR.
- Kreuzberg Cloud — managed extraction API with SDKs, dashboards, and observability.
- kreuzcrawl — web crawling and scraping with HTML→Markdown and headless-Chrome fallback.
- html-to-markdown — fast, lossless HTML→Markdown engine.
- liter-llm — universal LLM API client with native bindings for 14 languages and 143 providers.
- alef — the polyglot binding generator that produces this README and all per-language bindings.
- Discord — community, roadmap, announcements.
Contributing
Contributions are welcome! See CONTRIBUTING.md for guidelines.
Join our Discord community for questions and discussion.
License
MIT -- see LICENSE for details.