Pretrained models

November 13, 2024 · View on GitHub

To load pretrained model you need to define model_name (size), dataset it was trained on, train_type whether it is pretrain-only (ptn), large-margin finetuning (ft_lm), finetuning with cutmix+mixup augmentations (ft_mix).

import torch

model_name='M' # ~b3-b4 size
train_type='ft_mix'
dataset='vb2+vox2+cnc'

model = torch.hub.load('IDRnD/ReDimNet', 'ReDimNet', 
                       model_name=model_name, 
                       train_type=train_type, 
                       dataset=dataset)

All models configurations with corresponding metrics can be found in following table:

Model name (size)Train datasetTrain typeVox1-O EER(%)Vox1-E EER(%)Vox1-H EER(%)SITW EER(%)VOICES EER(%)CN-Celeb EER(%)
Mvb2+vox2+cncft_mix0.8350.7451.2841.2032.7037.474
Mvb2ptn1.3191.1282.0001.4824.1169.012
Svb2+vox2+cncft_mix0.9360.8741.5101.3102.7748.043
Svb2ptn1.5421.4082.5051.7813.9879.592
b0vox2ft_lm1.161.252.20---
b0vox2ptn------
b1vox2ft_lm0.850.971.73---
b1vox2ptn------
b2vox2ft_lm0.570.761.32---
b2vox2ptn------
b3vox2ft_lm0.500.731.33---
b3vox2ptn------
b4vox2ft_lm0.510.681.26---
b4vox2ptn------
b5vox2ft_lm0.430.611.08---
b5vox2ptn------
b6vox2ft_lm0.400.551.05---
b6vox2ptn------

Paper metrics

ModelParamsGMACsLMAS-NormVox1-O EER(%)Vox1-E EER(%)Vox1-H EER(%)
⬦ ReDimNet-B01.0M0.431.161.252.20
⬥ ReDimNet-B01.071.182.01
NeXt-TDNN-l (C=128,B=3)1.6M0.29*1.101.242.12
NeXt-TDNN (C=128,B=3)1.9M0.35*1.031.171.98
⬦ ReDimNet-B12.2M0.540.850.971.73
⬥ ReDimNet-B10.730.891.57
ECAPA (C=512)6.4M1.050.941.212.20
NeXt-TDNN-l (C=256,B=3)6.0M1.13*0.811.041.86
CAM++7.2M1.150.710.851.66
NeXt-TDNN (C=256,B=3)7.1M1.35*0.791.041.82
⬦ ReDimNet-B24.7M0.900.570.761.32
⬥ ReDimNet-B20.520.741.27
ECAPA (C=1024)14.9M2.670.981.132.09
DF-ResNet564.5M2.660.961.091.99
Gemini DF-ResNet604.1M2.50*0.941.051.80
⬦ ReDimNet-B33.0M3.000.500.731.33
⬥ ReDimNet-B30.470.691.23
ResNet346.6M4.550.820.931.68
Gemini DF-ResNet1146.5M5.000.690.861.49
⬦ ReDimNet-B46.3M4.800.510.681.26
⬥ ReDimNet-B40.440.641.17
Gemini DF-ResNet1839.2M8.250.600.811.44
DF-ResNet23312.3M11.170.580.761.44
⬦ ReDimNet-B59.2M9.870.430.611.08
⬥ ReDimNet-B50.390.591.05
ResNet29323.8M28.100.530.711.30
ECAPA227.1M187.00*0.440.621.15
⬦ ReDimNet-B615.0M20.270.400.551.05
⬥ ReDimNet-B60.370.531.00

* - means values have been estimated.