Workstation Benchmark Report

February 20, 2026 · View on GitHub

Date: 20-02-2026 System: Intel Core i7-12700F / AMD Radeon RX 7700 XT / 64 GB DDR5 / Ubuntu 25.10

Open-source benchmarks of Daniel Rosehill's desktop workstation, with full system inventory and hardware photos.

System Summary

This is a mid-tower desktop workstation built around Intel's 12th Gen Alder Lake platform. It combines a 12-core/20-thread hybrid CPU with an AMD RDNA 3 GPU, 64 GB of DDR5 memory, and a 5-drive all-SSD btrfs RAID5 storage pool totalling 4.5 TiB. The system runs Ubuntu 25.10 with KDE Plasma 6 on Wayland, and is used for AI/ML workloads (ROCm), software development, technical communications, and media processing. All four GPU display outputs are occupied across three 1080p monitors and one mini display.

System Overview

ComponentDetails
CPUIntel Core i7-12700F — 12 cores / 20 threads (8P + 4E), 4.9 GHz turbo
GPUAMD Radeon RX 7700 XT (Sapphire Pulse) — 12 GB GDDR6, amdgpu + Mesa 25.2.8
RAM64 GB DDR5-4800 (4x16 GB Kingston)
Storage5-drive btrfs RAID5 pool (~4.5 TiB) — see details below
MotherboardMSI PRO B760M-A WIFI (LGA 1700, Intel B760)
PSUSeasonic Focus GX-850 (850W, 80+ Gold, fully modular)
CPU CoolerThermalright Peerless Assassin 120 SE V3 (250W+ rated)
CaseBe Quiet Pure Base 500 (mid-tower)
Displays3x 1920x1080 + 1x 1024x600 (all 4 GPU outputs used)
OSUbuntu 25.10 "Questing Quokka", KDE Plasma 6.4.5, Wayland
Kernel6.14.0-15-generic (PREEMPT_DYNAMIC)
NetworkingRealtek RTL8125 2.5GbE (wired) + Intel Wi-Fi 6E
AudioPipeWire 1.4.7, FiiO K11 USB DAC

Storage & Btrfs RAID

All 5 SSDs are pooled into a single btrfs filesystem spanning the NVMe and 4 SATA drives:

DeviceModelCapacityInterface
nvme0n1p3Crucial CT1000P3SSD8930.0 GBNVMe PCIe Gen 4
sdaSanDisk SSD PLUS 1000GB931.5 GBSATA 3.2
sdbKingston SA400S37894.3 GBSATA
sdcSanDisk SSD PLUS931.5 GBSATA
sddSanDisk SSD PLUS931.5 GBSATA

RAID Configuration

ProfileLevelTotalUsed
DataRAID52.86 TiB2.73 TiB (95.6%)
MetadataRAID163.0 GiB55.8 GiB (88.5%)
SystemRAID132.0 MiB208 KiB
  • Total capacity: 4.51 TiB across 5 devices
  • Usable (estimated): ~798 GiB free
  • Data ratio: 1.25 (RAID5 — one drive of parity, 80% usable)
  • Metadata ratio: 2.00 (RAID1 — mirrored across 2 devices)
  • Mount options: rw,noatime,compress=zstd:3,ssd,discard=async,space_cache=v2
  • Health: Zero I/O errors on all 5 devices

Subvolume Layout

SubvolumeMount Point
@/ (root)
@home/home
@snapshots/.snapshots

Benchmark Results

CPU & Memory

CPU and Memory Benchmarks

  • CPU (single-thread): 1,399 events/sec (sysbench)
  • CPU (multi-thread, 20T): 15,677 events/sec (sysbench) — 11.2x scaling
  • CPU (stress-ng): 27,029 bogo ops/s across all methods
  • Memory (single-thread): 34,644 MiB/s
  • Memory (multi-thread, 20T): 128,309 MiB/s

Disk I/O (fio)

Disk I/O Benchmarks

TestBandwidthIOPS
Sequential Read (1M)9,636 MiB/s9,635
Sequential Write (1M)8,483 MiB/s8,483
Random Read (4K)12.7 GiB/s3,335,000
Random Write (4K)5,335 MiB/s1,366,000

Note: btrfs internally converts O_DIRECT to buffered I/O in many code paths, so even with direct=1 and drop_caches, fio results reflect kernel buffer + zstd compression performance rather than raw disk throughput. See the btrfs scrub rate below for actual disk speed.

Btrfs RAID5 Filesystem Benchmarks

TestResultNotes
Scrub throughput (initial)490 MiB/sBurst rate at 5s
Scrub throughput (sustained)135 MiB/sStable rate at 60s, with parity verification
Scrub errors0Clean across all 5 devices
Snapshot creation0.680sReadonly snapshot of root subvolume
Snapshot deletion0.015sInstant (no-commit)

The sustained scrub rate of ~135 MiB/s represents real physical read throughput across the RAID5 array with parity computation — this is the true disk performance figure. Snapshot operations are near-instant due to btrfs copy-on-write.

GPU (OpenGL)

GPU Benchmarks

glmark2 Score: 9,809

Top scenes: texture 13,361 FPS, build (VBO) 12,409 FPS, shading 12,353 FPS.

GPU Compute (ROCm 6.4.0)

ROCm compute benchmarks using PyTorch 2.9.1+rocm6.3 on the AMD Radeon RX 7700 XT (gfx1101, 54 CUs / 27 WGPs, 12 GB GDDR6, 192-bit bus).

ROCm Compute Summary

GEMM Throughput (Matrix Multiply)

GEMM Throughput

SizeFP32 (TFLOPS)FP16 (TFLOPS)
1024x10248.1732.63
2048x20488.2047.65
4096x40968.9248.53
8192x81928.4148.78

Peak observed: 8.9 TFLOPS FP32, 48.8 TFLOPS FP16. The ~5.5x FP16/FP32 ratio reflects RDNA 3's packed FP16 execution (2x rate) combined with reduced memory pressure.

ResNet-50 Inference

ResNet-50 Inference

Batch SizeFP32 (img/s)FP16 (img/s)
1264353
87821,554
327261,668

Transformer Layer (Single Layer Forward Pass)

Transformer Layer Latency

ConfigurationFP32 (ms)FP16 (ms)FP16 + Flash Attn (ms)
BERT-base (B=1, L=512)2.760.520.38
BERT-base (B=8, L=512)25.152.472.15
LLaMA-7B-like (B=1, L=2048)472.3129.5427.76

Flash attention is experimental on gfx1101 (enabled via TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL=1). Performance gains are modest (~6-27%) but expected to improve with future ROCm releases.

Conv2D Throughput

Conv2D Latency

ConfigurationTime (ms/iter)
ResNet first layer (B=32, 3->64, 224x224, k=7)2.18
ResNet mid layer (B=32, 256->256, 56x56, k=3)3.58
ResNet first layer (B=1)0.08
ResNet deep layer (B=8, 512->512, 28x28, k=3)0.92

GPU Memory Bandwidth

GPU Memory Bandwidth

TestBandwidth
VRAM D2D copy (64 MB)299.7 GB/s
VRAM D2D copy (256 MB)304.1 GB/s
VRAM D2D copy (1 GB)305.9 GB/s
PCIe H->D (pinned)12.2–12.8 GB/s
PCIe D->H (pinned)11.5–12.3 GB/s

Theoretical peak VRAM bandwidth is 432 GB/s (2248 MHz effective x 192-bit). The measured ~306 GB/s (71% efficiency) through a simple copy kernel is typical for a single-pass kernel without vectorized loads.

PCIe bandwidth of ~12 GB/s reflects PCIe 4.0 x16 operation (theoretical max ~25 GB/s bidirectional, ~13 GB/s unidirectional after protocol overhead).

Summary

Summary

Tools Used

ToolVersionPurpose
sysbench1.0.20CPU and memory benchmarks
stress-ngsystemCPU stress / bogo-ops
fio3.39Disk I/O benchmarks
glmark2-wayland2023.01OpenGL GPU benchmark
btrfs scrubkernel 6.14RAID5 real-disk throughput
PyTorch2.9.1+rocm6.3ROCm GPU compute (GEMM, inference, transformer)
ROCm6.4.0AMD GPU compute stack
HIP6.4.0GPU memory bandwidth (hipMemcpy, kernel copy)
Typst0.13.1Report generation (PDF)

Files

  • report.pdf — Full benchmark report with photos and raw data appendices
  • report.typ — Typst source
  • pics/ — Hardware photos
  • charts/ — Extracted chart pages

Test Conditions

  • Normal desktop workload (KDE Plasma, browser, terminal running)
  • Kernel: 6.14.0-15-generic (PREEMPT_DYNAMIC)
  • CPU governor: default (schedutil)
  • No special tuning applied (no CPU pinning, no governor override, no drop_caches)
  • ROCm benchmarks run with HSA_OVERRIDE_GFX_VERSION=11.0.1 (standard for gfx1101 GPUs)
  • Flash attention benchmarks used TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL=1