AI Coding Tools: Where Pro-Grade Models Are Actually Free

December 5, 2025 · View on GitHub

Last updated: December 5, 2025 • PRs/issues welcome

Languages: Español • Português • 中文 • Français • 日本語 • हिन्दी • Deutsch

AI Coding Tools: Where Pro-Grade Models Are Actually Free

Many AI coding tools claim to be "free," but access to pro-grade models usually runs out fast, then you're downgraded. Each tool uses different limits (credits, tokens, requests), making comparison difficult. This list puts them side by side to show what you actually get for free.

TL;DR — Free Tiers for Pro‑Grade AI Coding

(tools with higher limits listed first)

Tool	Pro‑grade models	Free tier limit	Credit card
Qwen Code	Qwen3-Coder-480B	2,000 requests/day	No
Rovo Dev CLI	Claude Sonnet 4	5M tokens/day (beta)	No
Gemini CLI	Gemini 3 Pro, Gemini 2.5 Pro	Gemini 3 Pro (waitlist/paid), 100 req/day Gemini 2.5 Pro	No
Cursor	GPT-5.1-Codex-Max	Free until Dec 11, 2025 (77.9% SWE-bench)	No
Kilo Code	Claude Opus/Sonnet, Gemini 2.5 Pro, GPT‑4.1	Up to $25 signup credits (one‑time)	Yes
Warp	GPT‑5, Claude Opus 4.1, Claude Sonnet 4, Gemini 2.5 Pro	150 credits/month (first 2 months), then 75/month	No
Trae	Claude 4 Sonnet (Beta), Claude 3.7 Sonnet, GPT‑4.1, GPT‑4o, Gemini 2.5 Pro	10 fast + 50 slow requests/month	No
Amazon Q Developer	Claude Sonnet 4	50 agentic requests/month	Yes
GitHub Copilot	GPT‑4.1, Claude Opus 3.5, Gemini 2.0 Flash, Grok Code Fast 1	50 chat requests + 2,000 completions/month	No
Windsurf	OpenAI, Anthropic, Google, xAI	25 credits/month	Yes
Jules	Gemini 2.5 Pro	15 tasks/day	No
AWS Kiro	Claude 4 Sonnet, Claude 3.7 Sonnet	50 credits/month	No
Qoder	Qwen3-Coder-480B, Claude, GPT, Gemini	Free tier + 2-week Pro trial (1,000 credits)	No

Qualifying Pro‑Grade Models

Only models achieving >60% on SWE-bench Verified qualify as pro-grade for real-world coding tasks. Below is the current list

Model	SWE-bench Verified	Provider
Claude Opus 4.5	80.9%	Anthropic
GPT-5.1-Codex-Max	77.9%	OpenAI
Claude Sonnet 4.5	77.2% (82.0% w/ parallel)	Anthropic
Gemini 3 Pro	76.2%	Google
GPT-5	74.9%	OpenAI
Claude Opus 4.1	74.5%	Anthropic
Claude Sonnet 4	72.7% (80.2% w/ parallel)	Anthropic
GPT-5 mini	71.0%	OpenAI
Qwen3-Coder-480B	69.6% (interactive) / 67.0% (single)	Alibaba
Gemini 2.5 Pro	63.2%	Google

Contributing

If you spot an error, missing source link, or have updated quota/model information, please open an issue or pull request with a source. New tool contributions are welcomed! See CONTRIBUTING.md for detailed guidelines.

Disclaimer

No affiliation with any vendor. All trademarks belong to their owners. Information is for research; accuracy not guaranteed; limits/pricing change frequently.

1. AI-coding Tools with Free Access to Pro-Grade Models
2. API Providers for AI Coding Tools
3. Tools with Paid Tiers with Pro-Grade Models
4. Tools with Free Access to Basic Models
5. Local Models
Comparison Notes
Related Resources

1. AI-coding Tools with Free Access to Pro-Grade Models

(ordered from most generous to least)

Qwen Code

Qwen3-Coder-480B access

2,000 requests/day free tier via Qwen OAuth
60 requests/minute rate limit
Command-line AI workflow tool (adapted from Gemini CLI)
One-click browser authentication
No credit card required

**** GitHub | Documentation

Rovo Dev CLI

Claude Sonnet 4 access during beta

5M tokens/day free tier (20M on first day only)
Claude Sonnet 4 model (confirmed via testing)
No credit card required during beta
Token limits reset at midnight UTC
Note: Upgrade to Jira Standard/Premium/Enterprise for 20M tokens/day

**** Documentation | Token Limits

Gemini CLI

Gemini 3 Pro and Gemini 2.5 Pro access

Gemini 3 Pro now available (Dec 4, 2025) for Google AI Ultra subscribers and paid API users
Gemini 3 Pro: 76.2% SWE-bench Verified — Google's best coding model
100 requests/day limit for Gemini 2.5 Pro (free tier fallback)
250 requests/day limit for Gemini 2.5 Flash
No credit card required for free tier
Waitlist for Gemini 3 Pro access for Google AI Pro, Gemini Code Assist standard, and free tier users
Enable via /settings → Preview features → true

**** Rate Limits | Pricing | Gemini 3 Pro Announcement

Kilo Code

Claude Opus/Sonnet, Gemini 2.5 Pro, GPT-4.1 access

Up to $25 signup credits (one-time bonus)
Open source VS Code extension
Pay-as-you-go with no markup on model pricing
Credit card required to claim full bonus credits
Supports bringing your own API keys

**** GitHub | Documentation | Pricing

Warp

GPT‑5, Claude Opus 4.1, Claude Sonnet 4, Gemini 2.5 Pro access

150 AI credits/month (first 2 months), then 75 AI credits/month
Multiple providers (OpenAI GPT‑5, Claude Opus 4.1, Claude Sonnet 4, Gemini 2.5 Pro)
No credit card required for basic signup
New pricing structure announced Oct 30, 2025: Single Build plan ($20/mo) with 1,500 credits

**** Pricing

Amazon Q Developer

Claude Sonnet 4 access

50 agentic requests/month limit (multi-turn conversations)
Latest Claude models (AWS-hosted)
Credit card required
Must upgrade to Pro for continued access
Perpetual free tier

**** Pricing

GitHub Copilot

Agent Mode with GPT‑4.1, Claude Opus 3.5, Gemini 2.0 Flash, Grok Code Fast 1

50 chat requests + 2,000 completions/month limit
Agent Mode with autonomous multi-step coding
Multiple providers (GPT-4.1, Claude Opus 3.5, Gemini 2.0 Flash, Grok Code Fast 1)
No credit card required
Limited to basic features after quota

**** Plans Details | Agent Mode

Trae

Claude 4 Sonnet (Beta), Claude 3.7 Sonnet, Claude 3.5 Sonnet, GPT‑4.1, GPT‑4o, Gemini 2.5 Pro access

10 fast requests + 50 slow requests/month for premium models
1,000 slow requests/month for advanced models
5,000 auto-completions/month
VS Code-based IDE with AI integration
Multiple premium models including Claude 4 Sonnet (Beta), Claude 3.7 Sonnet, GPT‑4.1
No credit card required for free tier
Pro Plan: $10/mo (600 fast + unlimited slow requests)

**** Pricing | Documentation

Windsurf

OpenAI, Anthropic, Google, xAI model access

25 prompt credits/month limit
Multiple providers (OpenAI, Claude, Gemini, xAI)
Credit card required
Can purchase add-on credits to continue

**** Pricing

Jules

Gemini 2.5 Pro access

15 tasks/day free tier
3 concurrent tasks
Gemini 2.5 Pro model
Gmail account required (18+ years)
Task limits reset on rolling 24-hour window
No credit card required
Pro tier ($19.99/mo): 100 tasks/day (5x limits)

**** Usage Limits | Documentation

AWS Kiro

Claude 4 Sonnet, Claude 3.7 Sonnet access

50 credits/month (Free tier)
Claude 4 Sonnet and Claude 3.7 Sonnet models (AWS-hosted)
No credit card required
14-day welcome bonus: 500 credits
Paid tiers: Pro ($20/mo - 1,000 credits), Pro+ ($40/mo - 2,000 credits), Power ($200/mo - 10,000 credits)

**** Pricing | Introduction Blog

Qoder

Qwen3-Coder-480B, Claude, GPT, Gemini models

Free tier: Unlimited completions/edits + limited chat/agent requests + 2-week Pro trial (1,000 credits)
AI-powered IDE from Alibaba
Available for Windows and macOS
Primarily uses Qwen3-Coder-480B (Alibaba's flagship coding model)
Also supports Claude, GPT-4, Gemini models
Agent Mode and Quest Mode for autonomous coding
No credit card required (free tier)
Paid tiers: Pro ($20/mo - 2,000 credits), Pro+ ($60/mo - 6,000 credits)

**** Homepage | Pricing

Limits change fast. If you see a mistake, a newer quota/model, or want to add a new tool, open an issue or PR with a source. See CONTRIBUTING.md for guidelines.

2. API Providers for AI Coding Tools

(ordered from most generous to least)

These services provide API access to coding-optimized models that integrate with popular AI coding tools like Cursor, Continue.dev, Cline, and others. They don't provide standalone coding tools but offer the AI backend for existing tools.

OpenRouter

Qwen3-Coder-480B via OpenRouter

50 requests/day free tier (1,000/day if purchased $10+ credits)
Additional free models: Qwen3-30B-A3B, Qwen3-235B-A22B, Gemini Flash
OpenAI-compatible API for all major IDEs
No credit card required for free models
20 requests/minute rate limit for free tier
Works with Continue.dev, Cline, Cursor, etc.

**** Free Models | Qwen3-Coder API

Cerebras

Qwen3-235B and Llama 3.1 access

Free tier: 1M tokens/day
No credit card required
Rate limit: 30 requests/minute, 8,192 token context
Models: Qwen3-235B, Llama 3.1 70B (Note: Qwen3-Coder-480B deprecated Nov 5, 2025)
OpenAI-compatible API (works with Cursor, Continue.dev, Cline, RooCode, etc.)
Ultra-fast inference: 2,000 tokens/second (40x faster than typical providers)
Paid tiers: Developer ($10+ self-serve), Enterprise (custom pricing)

**** Pricing | API Docs | Integration Guides

3. Tools with Paid Tiers with Pro-Grade Models

Rovo Dev CLI

Jira Standard ($7.53/user/mo): 20M tokens/day

Jira Premium ($15.25/user/mo): 20M tokens/day
Jira Enterprise (custom): 20M tokens/day
4x increase from free tier (5M → 20M tokens/day)
Same Claude-based model as free tier
Token limits reset at midnight UTC

**** Documentation | Token Limits | Jira Pricing

Claude Code

Pro ($20/mo or $17/mo annually): Sonnet 4 access with more usage than free tier

Max 5x ($100/mo): ~225 messages/5 hours — 140–280h Sonnet 4 + 15–35h Opus 4.5 weekly
Max 20x ($200/mo): ~900 messages/5 hours — 240–480h Sonnet 4 + 24–40h Opus 4.5 weekly
Extended thinking modes: "think" (~4K tokens), "megathink" (~10K), "ultrathink" (~32K)
Ultrathink enables complex refactors, system architecture, and deep debugging
Opus 4.5 consumes ~5x more resources than Sonnet 4
Usage limits reset weekly with 5-hour rolling windows
Works with Opus 4.5, Sonnet 4.5, and Haiku 4.5 models

**** Pricing | Claude Code Guide

Amazon Q Developer

Pro ($19/mo): Increased limits for agentic requests

Usage may be adjusted based on regional factors and usage patterns

**** Pricing

Warp

Build ($20/mo): 1,500 AI credits/month

Reload Credits available (up to 50% cheaper than old overage rates, roll over for 12 months)
Bring Your Own API Key (BYOK) option available
New pricing effective immediately for new customers (Oct 30, 2025)
Existing monthly subscribers transition on first renewal after Dec 1, 2025
Enterprise tier: Custom pricing

**** Pricing

GitHub Copilot

Pro ($10/mo): 300 premium requests + unlimited completions/month

Pro+ ($39/mo): 1,500 premium requests + unlimited completions/month
Business ($19/user/mo): 300 premium requests + unlimited completions/user/month
Enterprise ($39/user/mo): 1,000 premium requests + unlimited completions/user/month
GPT-5.1-Codex-Max now available in public preview (Dec 4, 2025) for Pro, Pro+, Business, Enterprise
Access to multiple models (GPT-5.1-Codex-Max, GPT-4.1, Claude Opus 3.5, Gemini 2.0 Flash, Grok Code Fast 1)
Overage billing available at $0.04/request

**** Plans Details | GPT-5.1-Codex-Max Preview

Trae

Pro ($10/mo): 600 fast requests + unlimited slow requests for premium models

Unlimited slow requests for advanced models
Zero rate limits and faster access to premium models
Extra packages available: $3-$12 for additional fast requests
Multiple premium models: Claude 4 Sonnet (Beta), Claude 3.7 Sonnet, Claude 3.5 Sonnet, Gemini 2.5 Pro, GPT‑4.1, GPT‑4o
VS Code-based IDE with full AI integration
First month available for $3

**** Pricing | Documentation

Windsurf

Pro ($15/mo): 500 prompt credits/month

Teams ($30/user/mo): 500 prompt credits/user/month
Enterprise ($60+/user/mo): 1,000 prompt credits/user/month

**** Pricing

Lovable

Pro ($25/mo): 150 credits/month (5 daily credits)

Teams ($30/mo): Higher limits (undisclosed)

**** Messaging Limits

Bolt.new

$20/mo: 10M tokens/month

$200/mo: 120M tokens/month

**** Token Documentation

Cursor

Hobby (Free): Limited Agent requests + Limited Tab completions + 1-week Pro trial

Pro ($20/mo or $16/mo annually): Extended Agent limits + Unlimited Tab completions + Background Agents + Maximum context windows
Pro+ ($60/mo): 3x usage on all OpenAI, Claude, Gemini models
Ultra ($200/mo): 20x usage on all OpenAI, Claude, Gemini models + Priority access to new features
Teams ($40/user/mo): Pro features + Centralized billing + Usage analytics + SAML/OIDC SSO
Enterprise (Custom): Everything in Teams + Pooled usage + SCIM + AI code tracking API + Audit logs
GPT-5.1-Codex-Max free for all users until Dec 11, 2025 (77.9% SWE-bench Verified)
One-week Pro trial available (free tier)
Free tier now uses token-based usage tracking (not request-based)
Free models: Cursor Small, Deepseek v3, Gemini 2.5 Flash, GPT-4o mini (500/day limit), Grok 3 Mini Beta
Paid tiers: Access to OpenAI, Claude, Gemini models including GPT-5.1-Codex-Max
Note: Claude models removed from free tier ~June 2025
AI-powered code editor with autonomous coding capabilities

**** Pricing | GPT-5.1-Codex-Max Announcement

OpenAI Codex CLI

Free with ChatGPT Plus ($20/mo): 30–150 messages/5 hours for coding tasks

ChatGPT Pro ($200/mo): 300–1,500 messages/5 hours — highest usage limits
Pay-as-you-go API: GPT-5.1-Codex-Max at $1.25/$10 per million tokens (input/output)
Free OSS mode: Access to open-source models only (via --oss flag)
GPT-5.1-Codex-Max (Nov 19, 2025): 77.9% SWE-bench Verified — now default model
First model with "compaction" for multi-million token sessions (24+ hour tasks)
30% fewer thinking tokens than previous GPT-5.1-Codex
Also available in GitHub Copilot (Pro, Pro+, Business, Enterprise)
Windows support now included
Cross-platform: macOS 12+, Ubuntu 20.04+, Windows 11 via WSL2

**** GitHub Repo | GPT-5.1-Codex-Max Announcement

Codeium

Pro ($10/mo): Unlimited usage with advanced context awareness

Claude 3.5 Sonnet, GPT-4o access
Enhanced context window and personalization
Teams ($12/user/mo): Pro features + team management
Enterprise (Custom): On-premise deployment, custom models

**** Pricing

Tabnine

Pro ($12/mo): Enhanced AI completions and chat

Enterprise ($39/user/mo): Multiple LLMs, private deployment
Models: Claude 3.5 Sonnet, GPT-4o, Llama 3.3 70B, proprietary models
600+ programming languages supported
On-premises and air-gapped deployment options
Bring your own fine-tuned models

**** Pricing

JetBrains AI Assistant

AI Pro ($15/mo): Increased cloud quota + unlimited local models

AI Ultimate ($25/mo): Maximum cloud quota + advanced features
Free tier: Unlimited code completion + local models + limited cloud quota
30-day Pro trial included
All Products Pack includes AI Pro
Offline mode with local models via Ollama/LM Studio

**** AI Pricing

Jules

Pro ($19.99/mo via Google AI Pro): 100 tasks/day

5x higher limits than free tier (15 tasks/day → 100 tasks/day)
5x concurrent tasks (3 → 15 concurrent)
Higher access to latest models
Ultra (via Google AI Ultra): 300 tasks/day
20x higher limits than free tier
60 concurrent tasks
Priority access to latest models
Gmail account required (18+ years)

**** Usage Limits | Google AI Plans

SuperMaven

Pro ($10/mo): 1M token context window + chat credits

Alternative: $99/year
Chat interface with GPT-4o, Claude 3.5 Sonnet, GPT-4
Team ($10/user/mo): Pro features + team management
Note: Merged with Cursor IDE in November 2024

**** Pricing

Know better pricing or limits? Share a link in an issue or PR to help keep this updated. See CONTRIBUTING.md for guidelines.

4. Tools with Free Access to Basic Models

(unspecified/basic models)

Bolt.new

Unspecified models

1M tokens/month limit
Specific model not publicly specified
Credit card required

**** Token Documentation

Lovable

Unspecified models

5 daily credits, max 30 per month (free)
Models not publicly enumerated
Credit card required

**** Messaging Limits

v0.dev

Proprietary models (not frontier)

GPT-5 access requires v0 Premium subscription
$5 in credits/month limit
Uses proprietary models with varied routing
Credit card required

**** Updated Pricing Blog

Codeium

Unlimited free usage of basic AI coding assistance

Individual plan: Free forever with unlimited code completions, AI chat, commands
70+ programming languages supported
IDE integrations: VS Code, JetBrains, Vim/Neovim, Jupyter
No credit card required
Limited context awareness (expanded in paid tiers)
Base model only (Llama 3.1 70B), pro-graded models require subscription

**** Pricing | Documentation

Tabnine

Free tier with limited features

Basic AI code completions and chat (limited)
Local processing available
Context heavily limited in free tier
Performance dialed down to save resources
600+ programming languages supported

**** Pricing

JetBrains AI Assistant

AI Free tier included with IDEs

Unlimited code completion and local model support
Limited quota for cloud-based features
30-day AI Pro trial
Chat, code generation, commit messages with local models

**** AI Features

SuperMaven

Free tier with basic features

Basic code suggestions
7-day data retention limit
Credit card required for registration
1M token context window (impressive for free tier)

**** Pricing

Continue.dev

Free open-source extension with flexible model support

Free VS Code and JetBrains extension
Full support for local models via Ollama, LM Studio
Solo tier: Private/team/public visibility options
Supports 200+ models (requires your own API keys for cloud models)
Community hub for custom AI assistants
No vendor lock-in or usage limits for local models

**** GitHub | Model Hub

Know the official limits or models? Share a link in an issue or PR to update the information. See CONTRIBUTING.md for guidelines.

5. Local Models

Running open-weight frontier models locally provides unlimited coding assistance without API costs or usage limits. Popular tools for local deployment include Cline (VS Code extension with Plan/Act modes and MCP support), Aider (command-line assistant with built-in Git integration), and Continue.dev (open-source VS Code extension supporting 200+ models). All work seamlessly with Ollama to run frontier models like Devstral (24B parameters, optimized for agentic coding), Qwen3-Coder, DeepSeek Coder V2, Codestral, and GLM-4.5.

Note: Frontier models require substantial RAM/VRAM. In particular, for Qwen3‑Coder‑480B the Ollama‑friendly GGUF is ~150GB, and practical local inference can require ~150GB of unified memory (RAM+VRAM), which makes it hard on typical laptops; the 30B quant commonly needs ~18GB. See the Unsloth Qwen3‑Coder local guide for details (docs) and Simon Willison's article on running GLM‑4.5 AIR on his laptop to build Space Invaders for a practical example.

Comparison Notes

Goal: Compare AI coding tools by their access to pro-grade models and free tier limits.
What qualifies a model as "pro-grade"? Models must achieve ≥60% on SWE-bench Verified, demonstrating real-world software engineering capability. Current qualifying models: Claude Opus 4.5 (80.9%), GPT-5.1-Codex-Max (77.9%), Claude Sonnet 4.5 (77.2%), Gemini 3 Pro (76.2%), GPT-5 (74.9%), Claude Opus 4.1 (74.5%), Claude Sonnet 4 (72.7%), GPT-5 mini (71.0%), Qwen3-Coder-480B (69.6%), and Gemini 2.5 Pro (63.2%).
Different limit types: Tools use various quota systems - requests, tokens, credits, chats - making direct comparison challenging. Check documentation for specifics.
Real-world usage: Actual consumption varies dramatically based on coding style, task complexity, and tool implementation.

Coding with AI - Practical techniques and resources for coding with LLMs
Free LLM API Resources - Comprehensive list of free LLM APIs for building custom integrations