README.md

March 27, 2024 ยท View on GitHub

6D Rotation Representation for Unconstrained Head Pose Estimation

Note

  • The default train dataset is 300W-LP
  • The default test dataset is AFLW2000

Installation

conda create -n PyTorch python=3.11
conda activate PyTorch
conda install pytorch torchvision torchaudio pytorch-cuda=11.8 -c pytorch -c nvidia
pip install opencv-python==4.5.5.64
pip install scipy
pip install tqdm
pip install timm

Train

  • Configure your dataset path in main.py for training
  • Download IMAGENET pretrained weights
  • Run python main.py --train for Single-GPU training
  • Run bash main.sh $ --train for Multi-GPU training, $ is number of GPUs

Test

  • Configure your dataset path in main.py for testing
  • Run python main.py --test for testing

Demo

  • Configure your video path in main.py for visualizing the demo
  • Run python main.py --demo for demo

Results

BackboneEpochsPitchYawRollMAEParameters(M)FLOPS (B)Throughput (images/s)
RepNet-A2904.783.683.253.9025.495.11322
RepNet-B1G2*304.913.633.373.9741.368.8792

* means that the results are from original repo, see reference

Alt Text

Reference