Streaming

June 18, 2026 · View on GitHub

LLM4Free Logo LLM4Free Logo

LLM4Free

Formerly WebScout — Your All-in-One Python Toolkit for Web Search, AI Interaction, Digital Utilities, and More

Access diverse search engines, cutting-edge AI models, temporary communication tools, media utilities, developer helpers, and powerful CLI interfaces -- all through one unified library.

PyPI Version Monthly Downloads Total Downloads Python Version Ask DeepWiki


Table of Contents


Important

LLM4Free uses a single unified OpenAI-compatible interface for all providers. Every provider implements client.chat.completions.create(...) — identical to the OpenAI Python SDK.

Note

LLM4Free supports 40+ AI providers including: HeckAI, ChatGPT, Groq, DeepInfra, Nvidia, Sambanova, OpenRouter, HuggingFace, OllamaSwarm, and many more.

Telegram Group YouTube Buy Me A Coffee


Features

Search & AI

  • Multi-Engine Search -- DuckDuckGo, Bing, Brave, Yahoo, Mojeek, Wikipedia. (Search Docs)
  • 40+ AI Providers -- All OpenAI-compatible for easy switching. (Architecture)
  • AI-Powered Search -- Perplexity, IAsk, Monica, AyeSoul, WebPilotAI.
  • OpenAI-Compatible API Server -- Serve any LLM4Free provider via OpenAI endpoints. (Server Docs)
  • Unified Python Client -- Auto-failover chat and image generation. (Client Docs)

Media & Content

  • Text-to-Image -- PollinationsAI, Together, Miragic, MagicStudio. (TTI Docs)
  • Text-to-Speech -- ElevenLabs, Deepgram, OpenAI FM, Parler, Qwen, MurfAI, and more. (Model Registry)
  • Speech-to-Text -- ElevenLabs STT.
  • YouTube Toolkit -- Video downloads, transcription, API access. (Docs)
  • Weather Tools -- Detailed weather info with ASCII display. (Weather Docs)

Developer Tools

Privacy & Utilities

  • Temp Mail -- Disposable email via Emailnator, MailTM, TempMailIO.
  • Proxy Manager -- Automatic proxy rotation. (Architecture)
  • Awesome Prompts -- Curated system prompts for AI personas. (Prompts Docs)

Installation

pip (Standard)

pip install -U llm4free

# With API server support
pip install -U "llm4free[api]"

# With development tools
pip install -U "llm4free[dev]"
uv add llm4free

# Run without installing
uv run llm4free --help

# Install as global tool
uv tool install llm4free

Docker

docker pull OEvortex/llm4free:latest
docker run -it OEvortex/llm4free:latest

See docs/DOCKER.md for full Docker deployment options including compose profiles.


Quick Start

AI Chat (No API Key)

from llm4free.Provider.Openai_comp.heckai import HeckAI

client = HeckAI()
response = client.chat.completions.create(
    model="google/gemini-2.5-flash-preview",
    messages=[{"role": "user", "content": "Explain quantum computing in simple terms"}],
)
print(response.choices[0].message.content)
from llm4free import DuckDuckGoSearch

search = DuckDuckGoSearch()
results = search.text("best practices for API design", max_results=5)
for result in results:
    print(f"{result['title']}: {result['href']}")

Image Generation

from llm4free.Provider.TTI import PollinationsAI

gen = PollinationsAI()
path = gen.generate_image(prompt="A serene mountain landscape at sunset")
print(f"Saved to: {path}")

See docs/getting-started.md for the full quick-start guide.


Command Line Interface

LLM4Free provides a rich CLI powered by Rich with multi-engine support.

llm4free --help                       # List all commands
llm4free version                      # Show version
llm4free text -k "python programming" # DuckDuckGo search (default)
llm4free images -k "mountains"        # Image search
llm4free news -k "AI breakthrough" -t w  # News from last week
llm4free weather -l "New York"        # Weather info
llm4free translate -k "Hola" --to en  # Translation

Supported Engines

CategoryEngines
textddg, bing, brave, yahoo, mojeek, wikipedia
imagesddg, bing, brave, yahoo
videosddg, brave, yahoo
newsddg, bing, brave, yahoo
suggestionsddg, bing, brave, yahoo
weatherddg, yahoo
answersddg
translateddg
mapsddg
# Use a specific engine
llm4free text -k "climate change" -e bing
llm4free text -k "quantum physics" -e wikipedia

Full CLI reference: docs/cli.md


AI Chat Providers

All providers use the OpenAI-compatible interface (client.chat.completions.create(...)).

Free Providers (No Auth Required)

from llm4free.Provider.Openai_comp.heckai import HeckAI
from llm4free.Provider.Openai_comp.artingai import ArtingAI
from llm4free.Provider.Openai_comp.freeai import FreeAI

# HeckAI - multiple models
client = HeckAI()
response = client.chat.completions.create(
    model="google/gemini-2.5-flash-preview",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

# ArtingAI
client = ArtingAI()
response = client.chat.completions.create(
    model="gpt-5",
    messages=[{"role": "user", "content": "Hello!"}],
)

Authenticated Providers

from llm4free.Provider.Openai_comp.Auth.groq import Groq
from llm4free.Provider.Openai_comp.Auth.deepinfra import DeepInfra

groq = Groq(api_key="your-key")
response = groq.chat.completions.create(
    model="llama-3.3-70b-versatile",
    messages=[{"role": "user", "content": "Write a Python function to sort a list"}],
)
print(response.choices[0].message.content)

Streaming

from llm4free.Provider.Openai_comp.heckai import HeckAI

client = HeckAI()
stream = client.chat.completions.create(
    model="google/gemini-2.5-flash-preview",
    messages=[{"role": "user", "content": "Tell me a joke"}],
    stream=True,
)
for chunk in stream:
    if chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content, end="")

See llm4free/llm/ for all available provider implementations.


Search Engines

from llm4free import DuckDuckGoSearch, BingSearch, YahooSearch, BraveSearch

# DuckDuckGo
ddg = DuckDuckGoSearch()
results = ddg.text("python frameworks", max_results=5)

# Bing
bing = BingSearch()
results = bing.text("climate change solutions")

# Brave
brave = BraveSearch()
results = brave.text("machine learning tutorials")

Search docs: docs/search.md


Text-to-Image

from llm4free.Provider.TTI import PollinationsAI, TogetherImage

# PollinationsAI
poll = PollinationsAI()
poll.generate_image(prompt="A cyberpunk city at night")

# Together AI
together = TogetherImage()
together.generate_image(prompt="A robot playing chess")

TTI docs: docs/getting-started.md#image-generation


Text-to-Speech

from llm4free.Provider.TTS import ElevenlabsTTS, ParlerTTS

tts = ElevenlabsTTS()
tts.text_to_speech("Hello, world!", voice="alloy")

TTS model registry: docs/models.md


OpenAI-Compatible API Server

Run a local FastAPI server that serves any LLM4Free provider through standard OpenAI endpoints.

# Start the server
llm4free-server

# Custom config
llm4free-server --port 8080 --host 0.0.0.0 --debug

Use with the OpenAI Python Client

from openai import OpenAI

client = OpenAI(api_key="dummy", base_url="http://localhost:8000/v1")

response = client.chat.completions.create(
    model="ChatGPT/gpt-4o",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

Docker Deployment

docker-compose up llm4free-api
docker-compose -f docker-compose.yml -f docker-compose.no-auth.yml up llm4free-api

Full server docs: docs/openai-api-server.md | Docker: docs/DOCKER.md


Python Client

The unified Client class provides auto-failover across providers with smart model resolution.

from llm4free.client import Client

client = Client(print_provider_info=True)

# Auto provider + model selection
resp = client.chat.completions.create(
    model="auto",
    messages=[{"role": "user", "content": "Summarize LLM4Free."}]
)
print(resp.choices[0].message.content)

# Streaming
stream = client.chat.completions.create(
    model="ChatGPT/gpt-4o-mini",
    messages=[{"role": "user", "content": "Write a limerick about Python."}],
    stream=True,
)
for chunk in stream:
    delta = chunk.choices[0].delta.content
    if delta:
        print(delta, end="", flush=True)

# Image generation
img = client.images.generate(prompt="A neon owl", model="auto", size="1024x1024")
print(img.data[0].url)

Client docs: docs/client.md


Tool Calling

LLM4Free has a built-in tool calling system that works with any provider.

from llm4free.Provider.Openai_comp.heckai import HeckAI
from llm4free.Provider.Openai_comp.base import Tool

def get_weather(city: str) -> str:
    return f"Weather in {city}: Sunny, 25C"

weather_tool = Tool(
    name="get_weather",
    description="Get current weather for a city.",
    parameters={"city": {"type": "string", "description": "City name."}},
    implementation=get_weather,
)

client = HeckAI(tools=[weather_tool])
response = client.chat.completions.create(
    model="google/gemini-2.5-flash-preview",
    messages=[{"role": "user", "content": "What is the weather in London?"}],
)
print(response.choices[0].message.content)

Tool calling docs: docs/tool-calling.md


Model Registry

Enumerate available models across all providers.

from llm4free import model

# All LLM models
all_models = model.llm.list()
print(f"Total: {len(all_models)}")

# Models by provider
summary = model.llm.summary()
for provider, count in summary.items():
    print(f"  {provider}: {count}")

# TTS voices
voices = model.tts.list()
print(f"Total voices: {len(voices)}")

Model registry docs: docs/models.md


Developer Tools

ToolDescriptionDocs
SwiftCLICLI framework with decoratorsdocs/swiftcli.md
ScoutHTML parser & web crawlerdocs/scout.md
LitPrinterStyled debug printingdocs/litprinter.md
LitAgentUser-agent rotationdocs/litagent.md
GitAPIGitHub data extractiondocs/gitapi.md
GGUFModel conversion & quantizationdocs/gguf.md
ZeroArtASCII art generatordocs/zeroart.md
WeatherWeather toolkitdocs/weather.md
Decorators@timeIt and @retrydocs/decorators.md
SanitizeStream sanitizationdocs/sanitize.md
PromptsSystem prompt managerdocs/awesome-prompts.md

Documentation

ResourceDescription
Getting StartedInstallation, first chat, web search, image generation
ArchitectureSystem design, layers, and data flows
CLI ReferenceAll CLI commands and options
Python ClientUnified client with auto-failover
API ServerOpenAI-compatible FastAPI server
Model RegistryEnumerate LLM, TTS, TTI models
Tool CallingFunction calling with any provider
Search DocsMulti-engine search API
ScoutHTML parser and crawler
Provider DevelopmentCreate custom providers
DeploymentProduction deployment guide
DockerDocker setup and compose profiles
InfernoLocal LLM server
TroubleshootingCommon issues and solutions
ContributingHow to contribute
Provider ModulesAll provider implementations
Docs HubFull documentation index

Contributing

See docs/contributing.md for guidelines.

  1. Fork the repository
  2. Create a feature branch
  3. Make changes with descriptive commits
  4. Submit a pull request

License

Apache-2.0. See LICENSE.md.


Made with by the LLM4Free team