AI Coding Tools: Where Pro-Grade Models Are Actually Free

December 5, 2025 · View on GitHub

Last updated: December 5, 2025 • PRs/issues welcome

Languages: EspañolPortuguês中文Français日本語हिन्दीDeutsch

AI Coding Tools: Where Pro-Grade Models Are Actually Free

Many AI coding tools claim to be "free," but access to pro-grade models usually runs out fast, then you're downgraded. Each tool uses different limits (credits, tokens, requests), making comparison difficult. This list puts them side by side to show what you actually get for free.

TL;DR — Free Tiers for Pro‑Grade AI Coding

(tools with higher limits listed first)

ToolPro‑grade modelsFree tier limitCredit card
Qwen CodeQwen3-Coder-480B2,000 requests/dayNo
Rovo Dev CLIClaude Sonnet 45M tokens/day (beta)No
Gemini CLIGemini 3 Pro, Gemini 2.5 ProGemini 3 Pro (waitlist/paid), 100 req/day Gemini 2.5 ProNo
CursorGPT-5.1-Codex-MaxFree until Dec 11, 2025 (77.9% SWE-bench)No
Kilo CodeClaude Opus/Sonnet, Gemini 2.5 Pro, GPT‑4.1Up to $25 signup credits (one‑time)Yes
WarpGPT‑5, Claude Opus 4.1, Claude Sonnet 4, Gemini 2.5 Pro150 credits/month (first 2 months), then 75/monthNo
TraeClaude 4 Sonnet (Beta), Claude 3.7 Sonnet, GPT‑4.1, GPT‑4o, Gemini 2.5 Pro10 fast + 50 slow requests/monthNo
Amazon Q DeveloperClaude Sonnet 450 agentic requests/monthYes
GitHub CopilotGPT‑4.1, Claude Opus 3.5, Gemini 2.0 Flash, Grok Code Fast 150 chat requests + 2,000 completions/monthNo
WindsurfOpenAI, Anthropic, Google, xAI25 credits/monthYes
JulesGemini 2.5 Pro15 tasks/dayNo
AWS KiroClaude 4 Sonnet, Claude 3.7 Sonnet50 credits/monthNo
QoderQwen3-Coder-480B, Claude, GPT, GeminiFree tier + 2-week Pro trial (1,000 credits)No

Qualifying Pro‑Grade Models

Only models achieving >60% on SWE-bench Verified qualify as pro-grade for real-world coding tasks. Below is the current list

ModelSWE-bench VerifiedProvider
Claude Opus 4.580.9%Anthropic
GPT-5.1-Codex-Max77.9%OpenAI
Claude Sonnet 4.577.2% (82.0% w/ parallel)Anthropic
Gemini 3 Pro76.2%Google
GPT-574.9%OpenAI
Claude Opus 4.174.5%Anthropic
Claude Sonnet 472.7% (80.2% w/ parallel)Anthropic
GPT-5 mini71.0%OpenAI
Qwen3-Coder-480B69.6% (interactive) / 67.0% (single)Alibaba
Gemini 2.5 Pro63.2%Google

Contributing

If you spot an error, missing source link, or have updated quota/model information, please open an issue or pull request with a source. New tool contributions are welcomed! See CONTRIBUTING.md for detailed guidelines.

Disclaimer

No affiliation with any vendor. All trademarks belong to their owners. Information is for research; accuracy not guaranteed; limits/pricing change frequently.

Contents

1. AI-coding Tools with Free Access to Pro-Grade Models

(ordered from most generous to least)

Qwen Code

Qwen3-Coder-480B access

  • 2,000 requests/day free tier via Qwen OAuth
  • 60 requests/minute rate limit
  • Command-line AI workflow tool (adapted from Gemini CLI)
  • One-click browser authentication
  • No credit card required

**** GitHub | Documentation


Rovo Dev CLI

Claude Sonnet 4 access during beta

  • 5M tokens/day free tier (20M on first day only)
  • Claude Sonnet 4 model (confirmed via testing)
  • No credit card required during beta
  • Token limits reset at midnight UTC
  • Note: Upgrade to Jira Standard/Premium/Enterprise for 20M tokens/day

**** Documentation | Token Limits


Gemini CLI

Gemini 3 Pro and Gemini 2.5 Pro access

  • Gemini 3 Pro now available (Dec 4, 2025) for Google AI Ultra subscribers and paid API users
  • Gemini 3 Pro: 76.2% SWE-bench Verified — Google's best coding model
  • 100 requests/day limit for Gemini 2.5 Pro (free tier fallback)
  • 250 requests/day limit for Gemini 2.5 Flash
  • No credit card required for free tier
  • Waitlist for Gemini 3 Pro access for Google AI Pro, Gemini Code Assist standard, and free tier users
  • Enable via /settings → Preview features → true

**** Rate Limits | Pricing | Gemini 3 Pro Announcement


Kilo Code

Claude Opus/Sonnet, Gemini 2.5 Pro, GPT-4.1 access

  • Up to $25 signup credits (one-time bonus)
  • Open source VS Code extension
  • Pay-as-you-go with no markup on model pricing
  • Credit card required to claim full bonus credits
  • Supports bringing your own API keys

**** GitHub | Documentation | Pricing


Warp

GPT‑5, Claude Opus 4.1, Claude Sonnet 4, Gemini 2.5 Pro access

  • 150 AI credits/month (first 2 months), then 75 AI credits/month
  • Multiple providers (OpenAI GPT‑5, Claude Opus 4.1, Claude Sonnet 4, Gemini 2.5 Pro)
  • No credit card required for basic signup
  • New pricing structure announced Oct 30, 2025: Single Build plan ($20/mo) with 1,500 credits

**** Pricing


Amazon Q Developer

Claude Sonnet 4 access

  • 50 agentic requests/month limit (multi-turn conversations)
  • Latest Claude models (AWS-hosted)
  • Credit card required
  • Must upgrade to Pro for continued access
  • Perpetual free tier

**** Pricing


GitHub Copilot

Agent Mode with GPT‑4.1, Claude Opus 3.5, Gemini 2.0 Flash, Grok Code Fast 1

  • 50 chat requests + 2,000 completions/month limit
  • Agent Mode with autonomous multi-step coding
  • Multiple providers (GPT-4.1, Claude Opus 3.5, Gemini 2.0 Flash, Grok Code Fast 1)
  • No credit card required
  • Limited to basic features after quota

**** Plans Details | Agent Mode


Trae

Claude 4 Sonnet (Beta), Claude 3.7 Sonnet, Claude 3.5 Sonnet, GPT‑4.1, GPT‑4o, Gemini 2.5 Pro access

  • 10 fast requests + 50 slow requests/month for premium models
  • 1,000 slow requests/month for advanced models
  • 5,000 auto-completions/month
  • VS Code-based IDE with AI integration
  • Multiple premium models including Claude 4 Sonnet (Beta), Claude 3.7 Sonnet, GPT‑4.1
  • No credit card required for free tier
  • Pro Plan: $10/mo (600 fast + unlimited slow requests)

**** Pricing | Documentation


Windsurf

OpenAI, Anthropic, Google, xAI model access

  • 25 prompt credits/month limit
  • Multiple providers (OpenAI, Claude, Gemini, xAI)
  • Credit card required
  • Can purchase add-on credits to continue

**** Pricing


Jules

Gemini 2.5 Pro access

  • 15 tasks/day free tier
  • 3 concurrent tasks
  • Gemini 2.5 Pro model
  • Gmail account required (18+ years)
  • Task limits reset on rolling 24-hour window
  • No credit card required
  • Pro tier ($19.99/mo): 100 tasks/day (5x limits)

**** Usage Limits | Documentation


AWS Kiro

Claude 4 Sonnet, Claude 3.7 Sonnet access

  • 50 credits/month (Free tier)
  • Claude 4 Sonnet and Claude 3.7 Sonnet models (AWS-hosted)
  • No credit card required
  • 14-day welcome bonus: 500 credits
  • Paid tiers: Pro ($20/mo - 1,000 credits), Pro+ ($40/mo - 2,000 credits), Power ($200/mo - 10,000 credits)

**** Pricing | Introduction Blog


Qoder

Qwen3-Coder-480B, Claude, GPT, Gemini models

  • Free tier: Unlimited completions/edits + limited chat/agent requests + 2-week Pro trial (1,000 credits)
  • AI-powered IDE from Alibaba
  • Available for Windows and macOS
  • Primarily uses Qwen3-Coder-480B (Alibaba's flagship coding model)
  • Also supports Claude, GPT-4, Gemini models
  • Agent Mode and Quest Mode for autonomous coding
  • No credit card required (free tier)
  • Paid tiers: Pro ($20/mo - 2,000 credits), Pro+ ($60/mo - 6,000 credits)

**** Homepage | Pricing

Limits change fast. If you see a mistake, a newer quota/model, or want to add a new tool, open an issue or PR with a source. See CONTRIBUTING.md for guidelines.


2. API Providers for AI Coding Tools

(ordered from most generous to least)

These services provide API access to coding-optimized models that integrate with popular AI coding tools like Cursor, Continue.dev, Cline, and others. They don't provide standalone coding tools but offer the AI backend for existing tools.

OpenRouter

Qwen3-Coder-480B via OpenRouter

  • 50 requests/day free tier (1,000/day if purchased $10+ credits)
  • Additional free models: Qwen3-30B-A3B, Qwen3-235B-A22B, Gemini Flash
  • OpenAI-compatible API for all major IDEs
  • No credit card required for free models
  • 20 requests/minute rate limit for free tier
  • Works with Continue.dev, Cline, Cursor, etc.

**** Free Models | Qwen3-Coder API


Cerebras

Qwen3-235B and Llama 3.1 access

  • Free tier: 1M tokens/day
  • No credit card required
  • Rate limit: 30 requests/minute, 8,192 token context
  • Models: Qwen3-235B, Llama 3.1 70B (Note: Qwen3-Coder-480B deprecated Nov 5, 2025)
  • OpenAI-compatible API (works with Cursor, Continue.dev, Cline, RooCode, etc.)
  • Ultra-fast inference: 2,000 tokens/second (40x faster than typical providers)
  • Paid tiers: Developer ($10+ self-serve), Enterprise (custom pricing)

**** Pricing | API Docs | Integration Guides


3. Tools with Paid Tiers with Pro-Grade Models

Rovo Dev CLI

Jira Standard ($7.53/user/mo): 20M tokens/day

  • Jira Premium ($15.25/user/mo): 20M tokens/day
  • Jira Enterprise (custom): 20M tokens/day
  • 4x increase from free tier (5M → 20M tokens/day)
  • Same Claude-based model as free tier
  • Token limits reset at midnight UTC

**** Documentation | Token Limits | Jira Pricing


Claude Code

Pro ($20/mo or $17/mo annually): Sonnet 4 access with more usage than free tier

  • Max 5x ($100/mo): ~225 messages/5 hours — 140–280h Sonnet 4 + 15–35h Opus 4.5 weekly
  • Max 20x ($200/mo): ~900 messages/5 hours — 240–480h Sonnet 4 + 24–40h Opus 4.5 weekly
  • Extended thinking modes: "think" (~4K tokens), "megathink" (~10K), "ultrathink" (~32K)
  • Ultrathink enables complex refactors, system architecture, and deep debugging
  • Opus 4.5 consumes ~5x more resources than Sonnet 4
  • Usage limits reset weekly with 5-hour rolling windows
  • Works with Opus 4.5, Sonnet 4.5, and Haiku 4.5 models

**** Pricing | Claude Code Guide


Amazon Q Developer

Pro ($19/mo): Increased limits for agentic requests

  • Usage may be adjusted based on regional factors and usage patterns

**** Pricing


Warp

Build ($20/mo): 1,500 AI credits/month

  • Reload Credits available (up to 50% cheaper than old overage rates, roll over for 12 months)
  • Bring Your Own API Key (BYOK) option available
  • New pricing effective immediately for new customers (Oct 30, 2025)
  • Existing monthly subscribers transition on first renewal after Dec 1, 2025
  • Enterprise tier: Custom pricing

**** Pricing


GitHub Copilot

Pro ($10/mo): 300 premium requests + unlimited completions/month

  • Pro+ ($39/mo): 1,500 premium requests + unlimited completions/month
  • Business ($19/user/mo): 300 premium requests + unlimited completions/user/month
  • Enterprise ($39/user/mo): 1,000 premium requests + unlimited completions/user/month
  • GPT-5.1-Codex-Max now available in public preview (Dec 4, 2025) for Pro, Pro+, Business, Enterprise
  • Access to multiple models (GPT-5.1-Codex-Max, GPT-4.1, Claude Opus 3.5, Gemini 2.0 Flash, Grok Code Fast 1)
  • Overage billing available at $0.04/request

**** Plans Details | GPT-5.1-Codex-Max Preview


Trae

Pro ($10/mo): 600 fast requests + unlimited slow requests for premium models

  • Unlimited slow requests for advanced models
  • Zero rate limits and faster access to premium models
  • Extra packages available: $3-$12 for additional fast requests
  • Multiple premium models: Claude 4 Sonnet (Beta), Claude 3.7 Sonnet, Claude 3.5 Sonnet, Gemini 2.5 Pro, GPT‑4.1, GPT‑4o
  • VS Code-based IDE with full AI integration
  • First month available for $3

**** Pricing | Documentation


Windsurf

Pro ($15/mo): 500 prompt credits/month

  • Teams ($30/user/mo): 500 prompt credits/user/month
  • Enterprise ($60+/user/mo): 1,000 prompt credits/user/month

**** Pricing


Lovable

Pro ($25/mo): 150 credits/month (5 daily credits)

  • Teams ($30/mo): Higher limits (undisclosed)

**** Messaging Limits


Bolt.new

$20/mo: 10M tokens/month

  • $200/mo: 120M tokens/month

**** Token Documentation


Cursor

Hobby (Free): Limited Agent requests + Limited Tab completions + 1-week Pro trial

  • Pro ($20/mo or $16/mo annually): Extended Agent limits + Unlimited Tab completions + Background Agents + Maximum context windows
  • Pro+ ($60/mo): 3x usage on all OpenAI, Claude, Gemini models
  • Ultra ($200/mo): 20x usage on all OpenAI, Claude, Gemini models + Priority access to new features
  • Teams ($40/user/mo): Pro features + Centralized billing + Usage analytics + SAML/OIDC SSO
  • Enterprise (Custom): Everything in Teams + Pooled usage + SCIM + AI code tracking API + Audit logs
  • GPT-5.1-Codex-Max free for all users until Dec 11, 2025 (77.9% SWE-bench Verified)
  • One-week Pro trial available (free tier)
  • Free tier now uses token-based usage tracking (not request-based)
  • Free models: Cursor Small, Deepseek v3, Gemini 2.5 Flash, GPT-4o mini (500/day limit), Grok 3 Mini Beta
  • Paid tiers: Access to OpenAI, Claude, Gemini models including GPT-5.1-Codex-Max
  • Note: Claude models removed from free tier ~June 2025
  • AI-powered code editor with autonomous coding capabilities

**** Pricing | GPT-5.1-Codex-Max Announcement


OpenAI Codex CLI

Free with ChatGPT Plus ($20/mo): 30–150 messages/5 hours for coding tasks

  • ChatGPT Pro ($200/mo): 300–1,500 messages/5 hours — highest usage limits
  • Pay-as-you-go API: GPT-5.1-Codex-Max at $1.25/$10 per million tokens (input/output)
  • Free OSS mode: Access to open-source models only (via --oss flag)
  • GPT-5.1-Codex-Max (Nov 19, 2025): 77.9% SWE-bench Verified — now default model
  • First model with "compaction" for multi-million token sessions (24+ hour tasks)
  • 30% fewer thinking tokens than previous GPT-5.1-Codex
  • Also available in GitHub Copilot (Pro, Pro+, Business, Enterprise)
  • Windows support now included
  • Cross-platform: macOS 12+, Ubuntu 20.04+, Windows 11 via WSL2

**** GitHub Repo | GPT-5.1-Codex-Max Announcement


Codeium

Pro ($10/mo): Unlimited usage with advanced context awareness

  • Claude 3.5 Sonnet, GPT-4o access
  • Enhanced context window and personalization
  • Teams ($12/user/mo): Pro features + team management
  • Enterprise (Custom): On-premise deployment, custom models

**** Pricing


Tabnine

Pro ($12/mo): Enhanced AI completions and chat

  • Enterprise ($39/user/mo): Multiple LLMs, private deployment
  • Models: Claude 3.5 Sonnet, GPT-4o, Llama 3.3 70B, proprietary models
  • 600+ programming languages supported
  • On-premises and air-gapped deployment options
  • Bring your own fine-tuned models

**** Pricing


JetBrains AI Assistant

AI Pro ($15/mo): Increased cloud quota + unlimited local models

  • AI Ultimate ($25/mo): Maximum cloud quota + advanced features
  • Free tier: Unlimited code completion + local models + limited cloud quota
  • 30-day Pro trial included
  • All Products Pack includes AI Pro
  • Offline mode with local models via Ollama/LM Studio

**** AI Pricing


Jules

Pro ($19.99/mo via Google AI Pro): 100 tasks/day

  • 5x higher limits than free tier (15 tasks/day → 100 tasks/day)
  • 5x concurrent tasks (3 → 15 concurrent)
  • Higher access to latest models
  • Ultra (via Google AI Ultra): 300 tasks/day
  • 20x higher limits than free tier
  • 60 concurrent tasks
  • Priority access to latest models
  • Gmail account required (18+ years)

**** Usage Limits | Google AI Plans


SuperMaven

Pro ($10/mo): 1M token context window + chat credits

  • Alternative: $99/year
  • Chat interface with GPT-4o, Claude 3.5 Sonnet, GPT-4
  • Team ($10/user/mo): Pro features + team management
  • Note: Merged with Cursor IDE in November 2024

**** Pricing

Know better pricing or limits? Share a link in an issue or PR to help keep this updated. See CONTRIBUTING.md for guidelines.


4. Tools with Free Access to Basic Models

(unspecified/basic models)

Bolt.new

Unspecified models

  • 1M tokens/month limit
  • Specific model not publicly specified
  • Credit card required

**** Token Documentation


Lovable

Unspecified models

  • 5 daily credits, max 30 per month (free)
  • Models not publicly enumerated
  • Credit card required

**** Messaging Limits


v0.dev

Proprietary models (not frontier)

  • GPT-5 access requires v0 Premium subscription
  • $5 in credits/month limit
  • Uses proprietary models with varied routing
  • Credit card required

**** Updated Pricing Blog


Codeium

Unlimited free usage of basic AI coding assistance

  • Individual plan: Free forever with unlimited code completions, AI chat, commands
  • 70+ programming languages supported
  • IDE integrations: VS Code, JetBrains, Vim/Neovim, Jupyter
  • No credit card required
  • Limited context awareness (expanded in paid tiers)
  • Base model only (Llama 3.1 70B), pro-graded models require subscription

**** Pricing | Documentation


Tabnine

Free tier with limited features

  • Basic AI code completions and chat (limited)
  • Local processing available
  • Context heavily limited in free tier
  • Performance dialed down to save resources
  • 600+ programming languages supported

**** Pricing


JetBrains AI Assistant

AI Free tier included with IDEs

  • Unlimited code completion and local model support
  • Limited quota for cloud-based features
  • 30-day AI Pro trial
  • Chat, code generation, commit messages with local models

**** AI Features


SuperMaven

Free tier with basic features

  • Basic code suggestions
  • 7-day data retention limit
  • Credit card required for registration
  • 1M token context window (impressive for free tier)

**** Pricing


Continue.dev

Free open-source extension with flexible model support

  • Free VS Code and JetBrains extension
  • Full support for local models via Ollama, LM Studio
  • Solo tier: Private/team/public visibility options
  • Supports 200+ models (requires your own API keys for cloud models)
  • Community hub for custom AI assistants
  • No vendor lock-in or usage limits for local models

**** GitHub | Model Hub

Know the official limits or models? Share a link in an issue or PR to update the information. See CONTRIBUTING.md for guidelines.


5. Local Models

Running open-weight frontier models locally provides unlimited coding assistance without API costs or usage limits. Popular tools for local deployment include Cline (VS Code extension with Plan/Act modes and MCP support), Aider (command-line assistant with built-in Git integration), and Continue.dev (open-source VS Code extension supporting 200+ models). All work seamlessly with Ollama to run frontier models like Devstral (24B parameters, optimized for agentic coding), Qwen3-Coder, DeepSeek Coder V2, Codestral, and GLM-4.5.

Note: Frontier models require substantial RAM/VRAM. In particular, for Qwen3‑Coder‑480B the Ollama‑friendly GGUF is ~150GB, and practical local inference can require ~150GB of unified memory (RAM+VRAM), which makes it hard on typical laptops; the 30B quant commonly needs ~18GB. See the Unsloth Qwen3‑Coder local guide for details (docs) and Simon Willison's article on running GLM‑4.5 AIR on his laptop to build Space Invaders for a practical example.


Comparison Notes

  • Goal: Compare AI coding tools by their access to pro-grade models and free tier limits.
  • What qualifies a model as "pro-grade"? Models must achieve ≥60% on SWE-bench Verified, demonstrating real-world software engineering capability. Current qualifying models: Claude Opus 4.5 (80.9%), GPT-5.1-Codex-Max (77.9%), Claude Sonnet 4.5 (77.2%), Gemini 3 Pro (76.2%), GPT-5 (74.9%), Claude Opus 4.1 (74.5%), Claude Sonnet 4 (72.7%), GPT-5 mini (71.0%), Qwen3-Coder-480B (69.6%), and Gemini 2.5 Pro (63.2%).
  • Different limit types: Tools use various quota systems - requests, tokens, credits, chats - making direct comparison challenging. Check documentation for specifics.
  • Real-world usage: Actual consumption varies dramatically based on coding style, task complexity, and tool implementation.