Training for libfacedetection in PyTorch

January 19, 2024 · View on GitHub

It is the training program for libfacedetection. The source code is based on MMDetection. Some data processing functions from SCRFD modifications.

Visualization of our network architecture: [netron].

Training for libfacedetection in PyTorch

Installation

Create conda environment. e.g.

conda create -n yunet python=3.8
conda activate yunet

Install PyTorch == v1.8.2 (LTS) following official instruction. e.g.
On GPU platforms (cu11.1):

# LINUX:
conda install pytorch torchvision torchaudio cudatoolkit=11.1 -c pytorch-lts -c nvidia
# WINDOWS:
conda install pytorch torchvision torchaudio cudatoolkit=11.1 -c pytorch-lts -c conda-forge

On GPU platforms (cu10.2):

conda install pytorch torchvision torchaudio cudatoolkit=10.2 -c pytorch-lts

Install MMCV >= v1.3.17 but <=1.6.0 following official instruction. e.g.

# cu11.1
pip install mmcv-full==1.3.17 -f https://download.openmmlab.com/mmcv/dist/cu111/torch1.8.0/index.html
# cu10.2
pip install mmcv-full==1.3.17 -f https://download.openmmlab.com/mmcv/dist/cu102/torch1.8.0/index.html

Clone this repository. We will call the cloned directory as $TRAIN_ROOT.

git clone https://github.com/ShiqiYu/libfacedetection.train.git
cd libfacedetection.train

Install dependencies.

python setup.py develop
pip install -r requirements.txt

Note:

Codes are based on Python 3+.
If meet error "ModuleNotFoundError: No module named 'torch.ao'", you can Ctrl + click to origin line and replace torch.ao to torch

Preparation

Download the WIDER Face dataset and its evaluation tools.

Extract zip files under $TRAIN_ROOT/data/widerface as follows:

$ tree data/widerface
data/widerface
├── wider_face_split
├── WIDER_test
├── WIDER_train
├── WIDER_val
└── labelv2
      ├── train
      │   └── labelv2.txt
      └── val
          ├── gt
          └── labelv2.txt

NOTE:
The labelv2 comes from SCRFD.

Training

Following MMdetection training processing.

CUDA_VISIBLE_DEVICES=0,1 bash tools/dist_train.sh ./configs/yunet_n.py 2 12345

Detection

python tools/detect_image.py ./configs/yunet_n.py ./weights/yunet_n.pth ./image.jpg

Evaluation on WIDER Face

python tools/test_widerface.py ./configs/yunet_n.py ./weights/yunet_n.pth --mode 2

Performance on WIDER Face (Val): confidence_threshold=0.02, nms_threshold=0.45, in origin size:

AP_easy=0.892, AP_medium=0.883, AP_hard=0.811

Export CPP source code

The following bash code can export a CPP file for project libfacedetection

python tools/yunet2cpp.py ./configs/yunet_n.py ./weights/yunet_n.pth

Export to onnx model

Export to onnx model for libfacedetection/example/opencv_dnn.

python tools/yunet2onnx.py ./configs/yunet_n.py ./weights/yunet_n.pth

Compare ONNX model with other works

Inference on exported ONNX models using ONNXRuntime:

python tools/compare_inference.py ./onnx/yunet_n.onnx --mode AUTO --eval --score_thresh 0.02 --nms_thresh 0.45

Some similar approaches(e.g. SCRFD, Yolo5face, retinaface) to inference are also supported.

With Intel i7-12700K and input_size = origin size, score_thresh = 0.02, nms_thresh = 0.45, some results are list as follow:

Model	AP_easy	AP_medium	AP_hard	#Params	Params Ratio	MFlops (320x320)	FPS(320x320)
SCRFD0.5(ICLR2022)	0.892	0.885	0.819	631,410	8.32x	184	284
Retinaface0.5(CVPR2020)	0.907	0.883	0.742	426,608	5.62X	245	235
YuNet_n(Ours)	0.892	0.883	0.811	75,856	1.00x	149	456
YuNet_s(Ours)	0.887	0.871	0.768	54,608	0.72x	96	537

The compared models can be downloaded from Google Drive.

Citation

We published a paper for the main idea of this repository:

@article{yunet,
  title={YuNet: A Tiny Millisecond-level Face Detector},
  author={Wu, Wei and Peng, Hanyang and Yu, Shiqi},
  journal={Machine Intelligence Research},
  pages={1--10},
  year={2023},
  doi={10.1007/s11633-023-1423-y},
  publisher={Springer}
}

The paper can be open accessed at https://link.springer.com/article/10.1007/s11633-023-1423-y.

The loss used in training is EIoU, a novel extended IoU. More details can be found in:

@article{eiou,
 author={Peng, Hanyang and Yu, Shiqi},
 journal={IEEE Transactions on Image Processing},
 title={A Systematic IoU-Related Method: Beyond Simplified Regression for Better Localization},
 year={2021},
 volume={30},
 pages={5032-5044},
 doi={10.1109/TIP.2021.3077144}
}

The paper can be open accessed at https://ieeexplore.ieee.org/document/9429909.

We also published a paper on face detection to evaluate different methods.

@article{facedetect-yu,
  author={Feng, Yuantao and Yu, Shiqi and Peng, Hanyang and Li, Yan-Ran and Zhang, Jianguo},
  journal={IEEE Transactions on Biometrics, Behavior, and Identity Science}, 
  title={Detect Faces Efficiently: A Survey and Evaluations}, 
  year={2022},
  volume={4},
  number={1},
  pages={1-18},
  doi={10.1109/TBIOM.2021.3120412}
}

The paper can be open accessed at https://ieeexplore.ieee.org/document/9580485