README.md

March 12, 2026 ยท View on GitHub

ScienceClaw โ€” AI Research Gateway

A self-evolving AI research colleague for scientists.

Stars 285 Skills 28+ Disciplines Zero Hallucination License


Why ScienceClaw?

General-purpose AI assistants are built for everyone. ScienceClaw is built for researchers.

The core idea is simple: an AI that does real scientific work โ€” searching literature, querying databases, running analyses โ€” and gets better at it the more you use it. It remembers your research context across sessions, adapts its skills to your field, and never fabricates a citation.

ScienceClaw is built on the OpenClaw engine, but redesigned from the ground up for academic research.

ScienceClaw vs Standard AI


๐Ÿงฌ Core 1: Self-Evolving Skills

This is ScienceClaw's most important feature.

Most AI tools ship with a fixed set of capabilities. ScienceClaw's skills evolve with you. Every time you complete a research task, the system learns:

Skill Self-Evolution Cycle

What this means in practice:

  • Week 1: You study immunology. ScienceClaw learns that PubMed + Semantic Scholar works best for your queries, that you prefer forest plots over tables, and that you always need PMID + DOI in citations.
  • Week 4: The system has created specialized skills for your subfield โ€” optimized search templates, preferred statistical methods, database priority chains tuned to immunology literature.
  • Month 3: ScienceClaw handles your domain like a trained research assistant. It knows which databases to hit first, which journals matter, and how you like your output formatted.

Compared to standard OpenClaw: OpenClaw ships with ~54 general-purpose skills that don't change. ScienceClaw starts with 285 skills and grows from there โ€” the agent writes new SKILL.md files at runtime without any redeployment.


๐Ÿง  Core 2: Research Memory That Persists

Standard AI assistants forget everything when the conversation ends. ScienceClaw doesn't.

Four-Layer Research Memory

What this enables:

  • "Continue the literature review we started last Tuesday" โ€” it remembers where you left off
  • "Use the same search strategy that worked for the BRCA2 project" โ€” it retrieves past patterns
  • Cross-session knowledge accumulation โ€” findings from project A can inform project B
  • Smart context pruning โ€” when the context window fills up, it preserves statistical results, effect sizes, and key citations while compacting intermediate steps

Compared to standard OpenClaw: OpenClaw has a basic memory plugin. ScienceClaw adds temporal decay weighting, LanceDB vector storage, and cross-session research pattern retrieval โ€” specifically designed for long-running academic work.


โฑ๏ธ Core 3: Built for Long-Duration Research

A real literature review takes hours, not seconds. Most AI tools time out after a few minutes. ScienceClaw is engineered for extended research sessions:

CapabilityStandard OpenClawScienceClaw
Agent timeout600s (10 min)3600s (1 hour+)
Session persistenceEnds with conversationHeartbeat keeps sessions alive across interruptions
Research depthSingle-pass responseMulti-phase protocol with mandatory depth thresholds
Minimum effortNo guaranteeQuick=5, Survey=30, Review=60, Systematic=100+ tool calls
Early stoppingCommonAnti-premature-conclusion checklist blocks shallow answers
Context managementBasic truncationSmart compaction preserves key findings when context fills up

The persistence protocol enforces real research depth. Before ScienceClaw concludes any task, it must verify:

  • โœ… Searched at least 3 different databases/sources
  • โœ… Retrieved full metadata (not just titles)
  • โœ… Cross-referenced findings across sources
  • โœ… Checked for contradictory evidence
  • โœ… Verified key statistics against primary sources
  • โœ… Organized results into a structured output file
  • โœ… Met the minimum tool-call threshold for the task type

If any box is unchecked, it keeps working instead of giving you a half-baked answer.

Compared to standard OpenClaw: OpenClaw's default 10-minute timeout is fine for sending messages and setting reminders. ScienceClaw's 1-hour sessions with heartbeat monitoring and mandatory depth enforcement are built for real academic research.


๐Ÿšซ Core 4: Zero Hallucination

This is the highest-priority rule in the entire system. It's non-negotiable.

The problem: General AI assistants routinely fabricate citations โ€” inventing DOIs, making up author names, citing papers that don't exist. In scientific work, this is catastrophic.

ScienceClaw's approach:

EVERY citation must come from a tool result in the CURRENT conversation.

If a database didn't return it โ†’ you can't cite it.
If you're not sure โ†’ say "not verified" explicitly.
If you can't find evidence โ†’ say so. Don't guess.

No "I think." No "probably." No hallucinated PMIDs.

This is enforced at the protocol level in SCIENCE.md โ€” the 629-line research protocol that governs all agent behavior. It's not a suggestion. It's a hard rule that applies before any other instruction.

Compared to standard OpenClaw: OpenClaw has no special hallucination controls. ScienceClaw's SCIENCE.md protocol treats every factual claim as requiring evidence โ€” the same standard you'd apply to a manuscript under peer review.


๐ŸŒ Core 5: All of Science, Not Just Biomedicine

ScienceClaw covers natural sciences AND social sciences across dozens of disciplines:

Scientific Discipline Coverage

๐Ÿ“‹ Full discipline & database list

Natural Sciences

DomainKey Skills & Databases
BiomedicinePubMed, UniProt, KEGG, PDB, ClinicalTrials, gnomAD, scanpy, biopython
ChemistryPubChem, ChEMBL, RDKit, drug-discovery, molecular-dynamics
GenomicsNCBI Entrez, Ensembl, ClinVar, GEO, phylogenetics
Materials ScienceMaterials Project, pymatgen, materials-screening
Physicsastropy, quantum-computing, physics-solver, simulation
Environmental ScienceCopernicus climate data, geospatial analysis, GIS tools
Food ScienceSpecialized analysis pipelines

Social Sciences

DomainKey Skills & Databases
EconomicsWorld Bank, SSRN, census data, econometrics
Political SciencePolicy analysis, legislative data
PsychologyExperimental design, statistical testing, meta-analysis
LinguisticsspaCy, NLTK, NLP analysis
EducationResearch methodology, assessment analysis
SociologyNetwork analysis, survey methods

Cross-Disciplinary Tools

CategoryCapabilities
StatisticsSciPy, statsmodels, scikit-learn, effect sizes, confidence intervals, multiple comparison corrections
Visualizationmatplotlib, plotly, seaborn, publication-quality figures
WritingLaTeX papers, systematic reviews (PRISMA), grant proposals, patent drafting
MathematicsSymPy symbolic computation, numerical methods, optimization

285 skills total โ€” and growing, because the self-evolution system creates new ones as you work.

Compared to standard OpenClaw: OpenClaw has no scientific database integrations. No PubMed, no UniProt, no arXiv, no World Bank. ScienceClaw connects to 25+ academic databases with structured API query skills across all major scientific disciplines.


Quick Start

# Clone
git clone https://github.com/beita6969/ScienceClaw.git
cd ScienceClaw

# One-click setup (installs everything: Node, Python, MCP servers, skills)
chmod +x setup.sh && ./setup.sh

# Or manual install
pnpm install && npx openclaw onboard

Enable Research Features

The setup.sh script automatically configures everything. For manual setup, edit ~/.openclaw/openclaw.json:

{
  "gateway": { "mode": "local" },
  "plugins": {
    "slots": { "memory": "memory-core" },
    "entries": {
      "memory-core": { "enabled": true },
      "memory-lancedb": { "enabled": true }
    }
  },
  "agents": {
    "defaults": {
      "heartbeat": { "interval": 1800 }
    }
  }
}

Project Structure

ScienceClaw/
โ”œโ”€โ”€ setup.sh                # ๐Ÿฆž One-click setup (run this first!)
โ”œโ”€โ”€ SCIENCE.md              # 629-line research protocol (the brain)
โ”œโ”€โ”€ skills/                 # 285 skill definitions (and growing)
โ”‚   โ”œโ”€โ”€ skill-evolution/    # Self-improving skill system
โ”‚   โ”œโ”€โ”€ research-reflection/# Post-task learning & evaluation
โ”‚   โ”œโ”€โ”€ skill-creator/      # Runtime skill generation
โ”‚   โ””โ”€โ”€ ...
โ”œโ”€โ”€ src/                    # Core engine
โ”‚   โ”œโ”€โ”€ memory/             # 4-layer memory (temporal decay, LanceDB)
โ”‚   โ”œโ”€โ”€ agents/             # Agent orchestration & persistence
โ”‚   โ””โ”€โ”€ skills/             # Skill loading & execution
โ”œโ”€โ”€ ui/                     # Web-based research gateway UI
โ”œโ”€โ”€ extensions/             # Plugin system
โ”œโ”€โ”€ deploy/                 # Docker, Fly.io, Podman configs
โ”œโ”€โ”€ config/                 # Vitest, build, lint configs
โ””โ”€โ”€ docs/                   # Documentation

Contact Us

๐Ÿ“ง mingdazhang@ieee.org

License

MIT โ€” see LICENSE.