FSG: Fast and Slow Gradient Approximation for Binary Neural Network Optimization

Codes for Accepted Paper : "Fast and Slow Gradient Approximation for Binary Neural Network Optimization" in AAAI 2025.

How to use it

Dependencies

pip install -r requirements.txt

Prepare pre-trained model

Please check MetaQuant tutorial to train your own pre-trained model.

Or you can use the default pretrained model provided by us. Uploaded in Results/model-dataset/model-dataset-pretrain.pth

Quick Start

The following commands run FSG on ResNet20 using CIFAR10 dataset with dorefa as forward quantization method and Adam as optimization.

The Fast-net is Multi-FC and the Slow-net is Mamba

The resulting quantized model is quantized using 1 bits: {+1, -1} for all layers (conv, fc).

Initial learning rate is set as 1e-3 and decreases by a factor of 0.1 every 30 epochs: 1e-3->1e-4->1e-5:

CUDA_VISIBLE_DEVICES='0' python meta-quantize.py -m ResNet44 -d CIFAR10 -q dorefa -bw 1 -o adam -meta MetaFastAndSlow -hidden 100 -lr 1e-3 -n 100

The following commands run FSG on ResNet56 using CIFAR100 dataset with dorefa as forward quantization method and Adam as optimization.

CUDA_VISIBLE_DEVICES='0' python meta-quantize.py -m ResNet56 -d CIFAR100 -q dorefa -bw 1 -o adam -meta MetaFastAndSlow -hidden 100 -lr 1e-3 -n 100

Support

Leave an issue if there is any bug and email me if any concerns about paper.

Citation

Cite the paper if anything helps you:

Acknowledgements

Our project referenced the code of the following repositories. We sincerely thanks to offering useful public code base.

MetaQuant

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.vscode		.vscode
Results		Results
figs		figs
meta_utils		meta_utils
models_CIFAR		models_CIFAR
utils		utils
.gitignore		.gitignore
README.md		README.md
full_precision.py		full_precision.py
loss_cur.ipynb		loss_cur.ipynb
meta-quantize-tutorial.ipynb		meta-quantize-tutorial.ipynb
meta-quantize.py		meta-quantize.py
requirements.txt		requirements.txt
run.sh		run.sh
train_baseline_quant.py		train_baseline_quant.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FSG: Fast and Slow Gradient Approximation for Binary Neural Network Optimization

How to use it

Dependencies

Prepare pre-trained model

Quick Start

Support

Citation

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FSG: Fast and Slow Gradient Approximation for Binary Neural Network Optimization

How to use it

Dependencies

Prepare pre-trained model

Quick Start

Support

Citation

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages