PolyNet: A Pursuit of Structural Diversity in Very Deep Networks

July 23, 2017 ยท View on GitHub

By Xingcheng Zhang, Zhizhong Li, Chen Change Loy, Dahua Lin

Multimedia Laboratory, The Chinese University of Hong Kong

Homepage pdf

BibTeX

@article{zhang2016polynet,
  title={Polynet: A pursuit of structural diversity in very deep networks},
  author={Zhang, Xingcheng and Li, Zhizhong and Loy, Chen Change and Lin, Dahua},
  journal={arXiv preprint arXiv:1611.05725},
  year={2016}
}

The Very Deep PolyNet

PolyNet

  • Visualization

  • Models

    NOTE: The model is trained using our own deep learning framework Parrots. The caffe model is converted from the Parrots model, and the proto file is based on Yuanjun's fork of Caffe.

Results

modeltraining speed* (#imgs/second)single-crop val top-1single-crop val top-5single-crop test top-5multi-crop val top-1multi-crop val top-5
ResNet-152-22.166.16-19.384.49
ResNet-152^279 ( 8 GPUs)20.935.545.5018.503.97
ResNet-269^245 (16 GPUs)19.784.894.8217.543.55
ResNet-500^248 (32 GPUs)19.664.784.7017.593.63
Inception-v4-20.05.0-1 7.73.8
Inception-ResNet-v2-19.94.9-17.83.7
Inception-ResNet-v2314 ( 8 GPUs)20.055.055.1118.413.98
Very Deep Inception-ResNet278 (32 GPUs)19.104.484.4617.393.56
Very Deep PolyNet290 (32 GPUs)18.714.254.3317.363.45

^ The ResNet models are trained by Tong Xiao;

* Training speed is measured on Parrots using NVIDIA TITAN X Graphics Cards.