Procrustes ResNet: ResNet with Norm-Preserving Transition Blocks

Pytorch implementation of Procrustes ResNet (ProcResNet) proposed in:

Zaeemzadeh, Alireza, Nazanin Rahnavard, and Mubarak Shah. "Norm-Preservation: Why Residual Networks Can Become Extremely Deep?." IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 2020 link

Note: For the original impementation using Chainer, see here.

Requirements

Tested on:

Python 3.9.2
cuda 11.2
torch 1.8.1
torchvision 0.9.1
numpy 1.20.1

Quick Start

Sample commands:

python main.py --model_file 'models/procresnet.py' --model_name 'ProcResNet166' --regul_freq 0.5 --batchsize 128 --training_epoch 300 --lr_decay_epoch 150 225 --initial_lr 0.1 --dataset 'cifar10'

python main.py --model_file 'models/procresnet.py' --model_name 'ProcResNet274' --dataset 'cifar10'

python main.py --model_file 'models/resnet.py'     --model_name 'ResNet272'     --dataset 'cifar10'

python main.py --model_file 'models/procresnet.py' --model_name 'ProcResNet274' --dataset 'cifar100'

python main.py --model_file 'models/resnet.py'     --model_name 'ResNet272'     --dataset 'cifar100'

'regul_freq' is a number in range [0, 1] and determines how often the regularization is performed (Default: 0.5).

About Regularization of the Conv Layers

The ProcResNet class has a method called 'regularize_convs', which is called after gradient descent update to enforce norm-preservation on the transition blocks.

See the details at regularize_convs function in models/procresnet.py.

Gradient norm ratio for ResNet (top) and ProcResNet (bottom):

Exprimental Results on CIFAR10

Model Name	Depth	#Params	Error (%)
ResNet	272	2.82M	4.73
ResNet	632	6.52M	4.59
ResNet	1001	10.32M	4.52
ProcResNet	274	2.83M	4.20
ProcResNet	634	6.53M	3.78
ProcResNet	1003	10.33M	3.84

Citing This Work

If you find this work useful, please use the following BibTeX entry.

@article{zaeemzadeh2018norm,
  title={Norm-Preservation: Why Residual Networks Can Become Extremely Deep?},
  author={Zaeemzadeh, Alireza and Rahnavard, Nazanin and Shah, Mubarak},
  journal = {Pattern Analysis and Machine Intelligence, IEEE Transactions on},
  year = {2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
models		models
README.md		README.md
main.py		main.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Procrustes ResNet: ResNet with Norm-Preserving Transition Blocks

Requirements

Quick Start

About Regularization of the Conv Layers

Exprimental Results on CIFAR10

Citing This Work

About

Releases

Packages

Contributors 5

Languages

zaeemzadeh/ProcResNet_PyTorch

Folders and files

Latest commit

History

Repository files navigation

Procrustes ResNet: ResNet with Norm-Preserving Transition Blocks

Requirements

Quick Start

About Regularization of the Conv Layers

Exprimental Results on CIFAR10

Citing This Work

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages