BENCHMARKS.md

October 6, 2023 ยท View on GitHub

This file contains benchmark AUC performance for predicting each pathlogy on 20% of each dataset.

To reproduce these results use the script scripts/model_calibrate.py. This script will take a long time so it will cache the results so you can work with them after the compute is done. The script also calculates the calibration for a model but those outputs can be ignored for this benchmarking. Below is an example of running different models on the PadChest pc dataset and writing the output as markdown.

python model_calibrate.py pc resnet50-res512-all -mdtable
python model_calibrate.py pc chexpert -mdtable
python model_calibrate.py pc jfhealthcare -mdtable
python model_calibrate.py pc densenet121-res224-all -mdtable

Results updated: 07/23/2023

NIH ChestX-ray14

Model NameAtelectasisCardiomegalyConsolidationEdemaEffusionEmphysemaFibrosisHerniaInfiltrationMassNodulePleural_ThickeningPneumoniaPneumothorax
XRV-ResNet-resnet50-res512-all0.780.900.780.890.860.880.760.920.690.820.760.760.710.84
XRV-DenseNet121-densenet121-res224-all0.760.880.770.850.850.730.720.910.680.800.690.740.710.75
jfhealthcare-DenseNet1210.760.850.780.870.87---------
CheXpert-DenseNet121-ensemble0.800.880.790.880.87---------

Google

Model Name# ParamsLung OpacityFractureNodule or massPneumothorax
XRV-ResNet-resnet50-res512-all23,538,6420.70.88-0.92
XRV-DenseNet121-densenet121-res224-all6,966,0340.920.74-0.85

RSNA

Model Name# ParamsLung OpacityPneumonia
XRV-ResNet-resnet50-res512-all23,538,6420.850.87
XRV-DenseNet121-densenet121-res224-all6,966,0340.880.86

SIIM

Model Name# ParamsPneumothorax
XRV-ResNet-resnet50-res512-all23,538,6420.91
XRV-DenseNet121-densenet121-res224-all6,966,0340.79

PadChest

Model Name# ParamsAtelectasisCardiomegalyConsolidationEdemaEffusionEmphysemaFibrosisFractureHerniaInfiltrationMassNodulePleural_ThickeningPneumoniaPneumothorax
XRV-ResNet-resnet50-res512-all23,538,6420.800.940.880.970.950.860.960.860.950.850.850.760.850.810.87
XRV-DenseNet121-densenet121-res224-all6,966,0340.770.930.880.970.950.870.940.700.960.850.850.690.790.820.81
jfhealthcare-DenseNet12112,525,3010.780.890.820.940.96----------
CheXpert-DenseNet121-ensemble0.820.920.880.970.97----------

VinBrain

Model Name# ParamsAtelectasisCardiomegalyConsolidationEffusionInfiltrationLung OpacityPleural_ThickeningPneumothorax
XRV-ResNet-resnet50-res512-all23,538,6420.600.850.910.850.820.710.790.69
XRV-DenseNet121-densenet121-res224-all6,966,0340.670.900.930.870.860.850.840.93
jfhealthcare-DenseNet12112,525,3010.790.810.950.92----
CheXpert-DenseNet121-ensemble0.740.890.970.93----

CheXpert

Model Name# ParamsAtelectasisCardiomegalyConsolidationEdemaEnlarged CardiomediastinumFractureLung LesionLung OpacityEffusionPleural OtherPneumoniaPneumothoraxSupport Devices
XRV-ResNet-resnet50-res512-all23,538,6420.630.840.740.790.50.580.500.710.81-0.670.61-
XRV-DenseNet121-densenet121-res224-all6,966,0340.910.910.900.920.780.740.840.870.94-0.840.85-
jfhealthcare-DenseNet12112,525,3010.910.890.910.90----0.95----
CheXpert-DenseNet121-ensemble0.930.910.910.92----0.96----

MIMIC-CXR (CheXpert labels)

Model NameAtelectasisCardiomegalyConsolidationEdemaEnlarged CardiomediastinumFractureLung LesionLung OpacityEffusionPleural OtherPneumoniaPneumothoraxSupport Devices
XRV-DenseNet121-densenet121-res224-all0.880.880.910.920.840.740.820.860.92-0.820.81-
XRV-ResNet-resnet50-res512-all0.770.780.880.870.500.660.50.770.9-0.770.72-
jfhealthcare-DenseNet1210.900.820.930.93----0.94----
CheXpert-DenseNet121-ensemble0.880.840.930.93----0.94----