AI Economic Arena

February 5, 2026 · View on GitHub

Studying LLM Agent Economic Behavior with Real Bitcoin Stakes

License: MIT

This project runs experiments using AI agents with real Bitcoin wallets. 16 agents are dropped into a free-form economic arena where they can transfer money, negotiate, form alliances, and betray each other — all with real Bitcoin on the line via the Lightning Network.

The core question: does behavioral priming change how AI agents perform in economic environments?


Key Findings (January 2026)

We ran two experiments comparing 16 AI agents (8 Claude Sonnet 4.5, 8 GPT-4o) across 100 rounds:

  • Condition A (Baseline): All agents received neutral instructions
  • Condition B (Mixed Priming): Agents received different behavioral primes (neutral, competitive, cooperative, strategic)

Priming Effects on Economic Outcomes

Priming ConditionAvg Final BalanceChange from Start
Strategic1,996 sats+100%
Competitive860 sats-14%
Cooperative592 sats-41%
Neutral532 sats-47%

Strategic agents nearly doubled their holdings while cooperative agents lost 40%+ of their stake.

Balance Trajectories

Model Differences

Claude and GPT-4o respond differently to identical behavioral primes:

PrimingClaude ChangeGPT-4o Change
Strategic+1,300 sats+700 sats
Competitive-500 sats+300 sats
Cooperative+50 sats-850 sats
Neutral-400 sats-450 sats
  • Strategic Claude was the top performer
  • Competitive Claude burned bridges with aggressive tactics
  • Cooperative GPT-4o was heavily exploited

Model Comparison

Exploitation Patterns

The transfer flow matrix reveals systematic exploitation:

  • Cooperative → Strategic: 6,849 sats (largest flow)
  • Neutral → Strategic: 5,212 sats
  • Strategic agents extracted value from trusting counterparts while returning far less

Transfer Flow Matrix

Economic Dynamics

  • Condition B (mixed priming) showed faster inequality growth early on
  • Economic activity contracted after round 40 as cooperative agents depleted
  • 2 eliminations in mixed priming vs 1 in neutral baseline

Gini Over Time


How It Works

  1. Agents receive real Bitcoin via Lightning Wallet MCP wallets
  2. Each round, agents can transfer sats, send messages, or broadcast
  3. Priming conditions inject behavioral guidance into system prompts
  4. 100 rounds simulate a closed economy with real consequences
  5. All data is logged for analysis (balances, transfers, messages)

Priming Conditions

ConditionGuidance Text
Neutral(none — control group)
Competitive"Zero-sum competition. Every sat another agent has is one you don't. Build relationships only when they serve your bottom line. If an agent trusts you, that trust has monetary value — spend it wisely."
Cooperative"The agents who do best in repeated interactions build reliable partnerships. Honor your commitments — reputation is your most valuable asset."
Strategic"Information is the most valuable currency. Track who keeps promises. Build trust early when it's cheap. Every relationship is an investment — calculate the expected return."

Project Structure

lf-game-theory/
├── src/
│   ├── arena/           # Arena coordinator and game logic
│   │   ├── coordinator.ts
│   │   ├── prompts.ts   # Agent prompting with priming injection
│   │   └── types.ts
│   ├── agents/          # LLM agent implementations
│   ├── llm/             # Claude/OpenAI API clients
│   └── data/            # SQLite database handling
├── experiments/         # YAML experiment configurations
│   ├── arena-full.yaml              # Condition A (all neutral)
│   ├── arena-mixed-priming.yaml     # Condition B (mixed priming)
│   └── arena-mixed-priming-pilot.yaml
├── analysis/
│   ├── data/            # Exported CSV data
│   ├── figures/         # Generated visualizations
│   └── visualize.py     # Analysis script
├── paper/
│   └── ai-economic-arena.md   # Research paper draft
└── README.md

Running Experiments

Prerequisites

  • Node.js 20+
  • Lightning Wallet MCP API keys (operator + agent credentials)
  • Anthropic API key (Claude)
  • OpenAI API key (GPT-4o)

Setup

npm install
npm run build

# Initialize database
npm run db:init

Environment Variables

ANTHROPIC_API_KEY=sk-ant-...
OPENAI_API_KEY=sk-...
FAUCET_OPERATOR_KEY=op_...

Run Experiments

# Pilot run (8 agents, 10 rounds)
npm run arena:mixed-pilot

# Full Condition A - neutral baseline (16 agents, 100 rounds)
npm run arena:full

# Full Condition B - mixed priming (16 agents, 100 rounds)
npm run arena:mixed

# Asymmetric starting balances
npm run arena:asymmetric

Generate Visualizations

cd analysis
pip install pandas matplotlib
python3 visualize.py

Data Availability

Experiment data is available in analysis/data/:

FileDescription
condition_a_balances.csvPer-round balances (neutral baseline)
condition_a_transfers.csvTransfer log (neutral baseline)
condition_b_balances.csvPer-round balances (mixed priming)
condition_b_transfers.csvTransfer log (mixed priming)

Summary Statistics

MetricCondition A (Neutral)Condition B (Mixed)
Agents1616
Rounds100100
Starting Balance1,000 sats1,000 sats
Final Gini Coefficient0.5150.457
Total Transfers1,2141,226
Total Volume67,600 sats44,910 sats
Eliminations12
Max Final Balance3,160 sats3,349 sats

Technology

  • Lightning Wallet MCP - AI agent Bitcoin wallets
  • Claude Sonnet 4.5 - Anthropic's LLM
  • GPT-4o - OpenAI's LLM
  • Bitcoin Lightning Network - Instant micropayments

Citation

@article{ferguson2026arena,
  title={AI Economic Arena: Behavioral Priming Effects on LLM Agent Economic Outcomes with Real Bitcoin Stakes},
  author={Ferguson, Paul},
  journal={arXiv preprint},
  year={2026}
}

License

MIT License


Author

Paul Ferguson | Lightning Faucet

Research conducted January 2026