Sample Business Database

April 10, 2026 · View on GitHub

Looking for a realistic sample database for SQL practice, software testing, or demo data? This is a modern alternative to AdventureWorks and Northwind — with balanced accounting, real tax compliance, and 3 years of simulated business operations.

Realistic, relational business datasets generated by simulating SMEs day-by-day over 3 financial years. Every transaction flows through double-entry accounting. Not random data — simulated business operations.

Free to use for development, testing, training, and evaluation.

Browse all datasets →

Full datasets: 42 tables, 39K–83K rows, CSV + SQL + Parquet + SQLite — mindweave.tech/datasets | Kaggle (AU sample)


Three Variants

VariantCompanyComplianceDirectory
AustralianOutback Outdoor Supplies Pty LtdATO PAYG, GST 10%, BAS, Super 11.5%./ (root)
USSummit Outdoor Gear LLCIRS federal tax, FICA, ~7.5% sales tax./us/
UKPeak District Outdoor Supplies LtdHMRC PAYE, NI 10%, VAT 20%, Pension 3%+5%./uk/

Each variant has the same 42-table structure with 44 foreign key relationships, but with genuinely different tax rules, salary ranges, payroll frequency, financial year, and locale-specific data.


Quick Start

# Clone and explore
git clone https://github.com/MindweaveTech/sme-sim-sample.git
cd sme-sim-sample

# Australian data (root directory)
sqlite3 :memory: <<'SQL'
.mode csv
.import companies.csv companies
.import sales_orders_sample.csv sales_orders
SELECT name, country FROM companies;
SELECT count(*) as total_sales FROM sales_orders;
SQL

# US data
sqlite3 :memory: <<'SQL'
.mode csv
.import us/companies.csv companies
.import us/sales_orders_sample.csv sales_orders
SELECT name, country FROM companies;
SELECT count(*) as total_sales FROM sales_orders;
SQL

# UK data
sqlite3 :memory: <<'SQL'
.mode csv
.import uk/companies.csv companies
.import uk/sales_orders_sample.csv sales_orders
SELECT name, country FROM companies;
SELECT count(*) as total_sales FROM sales_orders;
SQL

What makes this different from AdventureWorks / Northwind / Faker

FeatureAdventureWorksNorthwindFaker/Mockaroosme-sim
Cross-domain traceabilityPartialNoNoFull — sale > invoice > payment > bank > journal entry
Double-entry accountingNoNoNoYes — debits always equal credits
Tax complianceUS-onlyNoneNoneAU + US + UK
Temporal realismStaticStaticRandomSimulated — seasonal patterns, staff turnover, payment behaviour
Relational integrityGoodBasicNone44 FK relationships, all enforced
Still maintainedLast updated 2014Last updated ~2000N/A2025

Sample contents per variant

Each directory contains full reference tables plus sampled transaction tables (~200 rows each from the full dataset).

Reference tables: companies, departments, chart_of_accounts, products, customers, _row_counts

Sampled tables: sales_orders_sample, journal_entries_sample, employees_sample

Who this is for

  • Developers building ERP, accounting, or business software
  • QA teams testing complex business workflows across modules
  • Consultants running demos and training without exposing client data
  • Data engineers building ETL pipelines against a realistic source
  • Students studying business systems and accounting
  • AI/ML teams needing realistic business data for training

Full datasets & products

View all products at mindweave.tech/datasets.

Australian

ProductTablesRowsPrice
Free Sample (this repo)26~2,800Free
Complete SME Dataset4283,000+$49
Multi-Company Bundle126246,000+$99
Enterprise Pack210400,000+$199

US

ProductTablesRowsPrice
US Complete4278,000+$49
US Multi-Company126230,000+$99

UK

ProductTablesRowsPrice
UK Complete4239,000+$49
UK Multi-Company126124,000+$99

Industry Verticals

ProductIndustryRowsPrice
AU Restaurant & PubHospitality226,000+$79
US Restaurant & BarHospitality259,000+$79
UK Pub & KitchenHospitality180,000+$79
AU Consulting FirmProfessional Services143,000+$79
US Consulting FirmProfessional Services156,000+$79
UK Advisory FirmProfessional Services109,000+$79

Also available on Kaggle and Hugging Face.

Keywords

Sample database, test data, dummy data, synthetic data, AdventureWorks alternative, Northwind alternative, SQL practice database, business dataset, accounting test data, ERP test data, double-entry accounting, mock data, demo database.

License

This sample is free to use for development, testing, training, and evaluation. Full datasets are commercially licensed — see mindweave.tech/datasets for terms.


Built by Mindweave Technologies | Dev.to: AdventureWorks Is Dead | Dev.to: 5 Reasons AdventureWorks Is Terrible