UniConvNet: Expanding Effective Receptive Field while Maintaining Asymptotically Gaussian Distribution for ConvNets of Any Scale

November 21, 2025 · View on GitHub

Official PyTorch implementation of UniConvNet, from the following paper:

UniConvNet: Expanding Effective Receptive Field while Maintaining Asymptotically Gaussian Distribution for ConvNets of Any Scale.
ICCV 2025.
Yuhao Wang, Wei Xi
Xi'an Jiaotong University
[arXiv]


We propose UniConvNet, a pure ConvNet model constructed entirely from standard ConvNet modules. UniConvNet performs well on both lightweight and large-scale models.

Catalog

  • ImageNet-1K Training Code
  • ImageNet-22K Pre-training Code
  • ImageNet-1K Fine-tuning Code
  • Downstream Transfer (Detection, Segmentation) Code (Coming soon ...)

Results and Pre-trained Models

ImageNet-1K trained models

nameresolutionacc@1#paramsFLOPsmodel(hugging face)model(baidu)
UniConvNet-A224x22477.03.4M0.589Gmodelmodel
UniConvNet-P0224x22479.15.2M0.932Gmodelmodel
UniConvNet-P1224x22479.66.1M0.895Gmodelmodel
UniConvNet-P2224x22480.57.6M1.25Gmodelmodel
UniConvNet-N0224x22481.610.2M1.65Gmodelmodel
UniConvNet-N1224x22482.213.1M1.88Gmodelmodel
UniConvNet-N2224x22482.715.0M2.47Gmodelmodel
UniConvNet-N3224x22483.219.7M3.37Gmodelmodel
UniConvNet-T224x22484.230.3M5.1Gmodelmodel
UniConvNet-T384x38485.430.3M15.0Gmodelmodel
UniConvNet-S224x22484.550.0M8.48Gmodelmodel
UniConvNet-S384x38485.750.0M24.9Gmodelmodel
UniConvNet-B224x22485.097.6M15.9Gmodelmodel
UniConvNet-B384x38485.997.6M46.6Gmodelmodel

ImageNet-22K trained models

nameresolutionacc@1#paramsFLOPs22k model
(hugging face)
22k model (baidu)1k model
(hugging face)
22k model (baidu)
ConvNeXt-L384x38488.2201.8M100.1Gmodelmodelmodelmodel
ConvNeXt-XL384x38488.4226.7M115.2Gmodelmodelmodelmodel

Installation

Please check INSTALL.md for installation instructions.

Evaluation

We give an example evaluation command for a ImageNet-1K pre-trained UniConvNet-A:

Single-GPU

python main.py --model UniConvNet_A --eval true \
--resume https://huggingface.co/ai-modelwithcode/UniConvNet/resolve/main/uniconvnet_a_1k_224.pth \
--input_size 224 --drop_path 0.05 \
--data_path /path/to/imagenet-1k

Multi-GPU

python -m torch.distributed.launch --nproc_per_node=8 main.py \
--model UniConvNet_A --eval true \
--resume https://huggingface.co/ai-modelwithcode/UniConvNet/resolve/main/uniconvnet_a_1k_224.pth \
--input_size 224 --drop_path 0.05 \
--data_path /path/to/imagenet-1k

This should give

* Acc@1 77.030 Acc@5 93.364 loss 0.983
  • For evaluating other model variants, change --model, --resume, --input_size accordingly. You can get the url to pre-trained models from the tables above.
  • Setting model-specific --drop_path is not strictly required in evaluation, as the DropPath module in timm behaves the same during evaluation; but it is required in training. See TRAINING.md or our paper for the values used for different models.

Training

See TRAINING.md for training and fine-tuning instructions.

Acknowledgement

This repository is built using the timm library, ConvNeXt and InternImage repositories.

License

This project is released under the MIT license. Please see the LICENSE file for more information.

Citation

If you find this repository helpful, please consider citing:

@InProceedings{Wang_2025_ICCV,
    author    = {Wang, Yuhao and Xi, Wei},
    title     = {UniConvNet: Expanding Effective Receptive Field while Maintaining Asymptotically Gaussian Distribution for ConvNets of Any Scale},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2025},
    pages     = {20922-20933}
}