Runtime and accuracy metrics for all release models

March 5, 2026 ยท View on GitHub

Setup

The runtime and accuracy reported in this page are generated using n2-standard-96 GCP instances which has the following configuration:

GCP instance type: n2-standard-96
CPUs: 96-core (vCPU)
Memory: 384GiB
GPUs: 0

Details of metrics can be found here:

Sample sheet contains details of the input files used to generate this report.

Note: Each model type uses different coverages.

Accuracy

Tumor-normal accuracy

Model typesampletypetotal.truthtotal.querytpfpfnprecisionrecallf1_score
wgsHCC1395SNVs39447380703765341717940.9890460.9545210.971477
wgsHCC1395indels1626172715142131120.8766650.9311190.903072
wesHCC1395SNVs11591104109410650.9909420.9439170.966858
wesHCC1395indels484542360.9333330.8750.903226
pacbioHCC1395SNVs394473851737001151624460.9606410.9379930.949182
pacbioHCC1395indels1626162013143063120.8111110.8081180.809612
ontHCC1395SNVs39447315823060597788420.9690650.7758510.861761
ontHCC1395indels1626126210392235870.8232960.6389910.719529
ffpe-wgsHCC1395SNVs394473414532250189571970.9445010.8175530.876454
ffpe-wgsHCC1395indels1626161212673453590.785980.7792130.782582
ffpe-wesHCC1395SNVs1159990956342030.9656570.8248490.889716
ffpe-wesHCC1395indels484740780.8510640.8333330.842105

Tumor-only accuracy

Model typesampletypetotal.truthtotal.querytpfpfnprecisionrecallf1_score
wgs-tumor-onlyHCC1395SNVs394474462435663896137840.7991890.9040740.848402
wgs-tumor-onlyHCC1395indels16263772137923932470.3655890.8480930.51093
wes-tumor-onlyHCC1395SNVs1159987948392110.9604860.8179470.883504
wes-tumor-onlyHCC1395indels4855411470.7454550.8541670.796117
pacbio-tumor-onlyHCC1395SNVs3944756068372631880521840.6646040.9446350.780254
pacbio-tumor-onlyHCC1395indels1626222312329913940.5542060.7576880.640166
ont-tumor-onlyHCC1395SNVs3944751417305532086488940.594220.7745330.6725
ont-tumor-onlyHCC1395indels1626208893311556930.4468390.5738010.502423
ffpe-wgs-tumor-onlyHCC1395SNVs394473778430686709887610.8121430.7779050.794655
ffpe-wgs-tumor-onlyHCC1395indels1626194911907594360.610570.7318570.665734
ffpe-wes-tumor-onlyHCC1395SNVs115912259213042380.7518370.7946510.772651
ffpe-wes-tumor-onlyHCC1395indels48106406680.3773580.8333330.519481

Runtime

Each case study was run 5x times and the runtimes were averaged.

Model typesamplemean runtime
wgsHCC13952h 14m 41s
wesHCC139510m 47s
pacbioHCC13953h 36m 18s
ontHCC13954h 6m 46s
ffpe-wgsHCC13955h 49m
ffpe-wesHCC139517m 40s
wgs-tumor-onlyHCC13951h 30m 30s
wes-tumor-onlyHCC13955m 58s
pacbio-tumor-onlyHCC13952h 23m 52s
ont-tumor-onlyHCC13952h 39m 56s
ffpe-wes-tumor-onlyHCC13957m 9s
ffpe-wgs-tumor-onlyHCC13952h 8m 54s

How to reproduce the metrics on this page

For simplicity and consistency, we report runtime with a CPU instance with 96 CPUs This is NOT the fastest or cheapest configuration.

Use gcloud compute ssh to log in to the newly created instance.

Download and run any of the following case study scripts:

# Get the script.
curl -O https://raw.githubusercontent.com/google/deepvariant/r1.10/scripts/inference_deepsomatic.sh

# WGS
bash inference_deepsomatic.sh --model_preset WGS

# WES
bash inference_deepsomatic.sh --model_preset WES

# PACBIO
bash inference_deepsomatic.sh --model_preset PACBIO

# ONT
bash inference_deepsomatic.sh --model_preset ONT

# FFPE_WGS
bash inference_deepsomatic.sh --model_preset FFPE_WGS

# FFPE_WES
bash inference_deepsomatic.sh --model_preset FFPE_WES

# WGS_TUMOR_ONLY
bash inference_deepsomatic.sh --model_preset WGS_TUMOR_ONLY --use_default_pon_filtering

# WES_TUMOR_ONLY
bash inference_deepsomatic.sh --model_preset WES_TUMOR_ONLY --use_default_pon_filtering

# PACBIO_TUMOR_ONLY
bash inference_deepsomatic.sh --model_preset PACBIO_TUMOR_ONLY --use_default_pon_filtering

# ONT_TUMOR_ONLY
bash inference_deepsomatic.sh --model_preset ONT_TUMOR_ONLY --use_default_pon_filtering

# FFPE_WGS_TUMOR_ONLY
bash inference_deepsomatic.sh --model_preset FFPE_WGS_TUMOR_ONLY --use_default_pon_filtering

# FFPE_WES_TUMOR_ONLY
bash inference_deepsomatic.sh --model_preset FFPE_WES_TUMOR_ONLY --use_default_pon_filtering

Runtime metrics are taken from the resulting log after each stage of DeepSomatic.

The accuracy metrics came from the som.py extension of hap.py program.