Automated Data-aware Pruning and reTraining (ADaPT)

This is the open-source tool connected to the paper "Now that I can see, I can improve: Enabling data-driven finetuning of CNNs on the edge" that was published in the EDLCV workshop in CVPR 2020. If you use this tool in a publication, we would appreciate using the following citation:

@misc{rajagopal2020i,
      title={Now that I can see, I can improve: Enabling data-driven finetuning of CNNs on the edge}, 
      author={Aditya Rajagopal and Christos-Savvas Bouganis},
      year={2020},
      eprint={2006.08554},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

The README goes through various settings in the config files that allow to perform automated pruning and retarining of CNNs. The pruners/ folder here has all the details on pruning CNNs on the CIFAR dataset and was used to collect results for the paper. For an updated pruners folder that has ImageNet pruning as well as a wider variety of networks for which pruning is implemented, please check out https://github.com/ICIdsl/pruners.git. The updated pruners also works as a standalone library that can be used in your own projects. The adapt.yml file has the conda environment that would satisfy all the required dependencies for this project.

Installation

git clone https://github.com/ICIdsl/pytorch_training.git 
cd pytorch_training
git submodule update --init src/ar4414/pruning

Pruning Configurations

As you go down the config file, within the pruning_hyperpamaters section, each true before will cancel out the following true or falses, i.e. priority is top down. This README has details of the main pruning framework. Details on scripts available are in a README in the scripts folder and plotting options are in a README in the plotting folder.

Get GOps for a given design

Get_Gops : True
Logs : path to json file with the time stamps of all the logs
Inference_Gops :
- if True, inference gops stored in gops.json file inside folder with logs
- if False, training gops stored in gops.json file inside folder with logs
- inference gops is gops per minibatch, training gops is minibatch independent as it stores gops per epoch

Performance of subset on network trained on entire dataset

This performs inference on the network trained on the entire dataset, but with images in the subset only. There is no retraining performed, and provides data on performance if original network is directly deployed.

Unpruned_Test_Acc: True
- stores into logs json the dict {'test_top1':#, 'gops':infernece_gops} per network and subset

Performance of subset on network pruned and then trained on entire dataset

This looks at performance if pruning was performed in a data agnostic manner, i.e. the network is pruned at epoch 0, and retrained for the same epoch budget as the data aware case. For each pruning percentage, inference is performed on the various subsets and recorded in the logs.json file. The prequisite for this is that the above training must be performed and the json file updated with logs of training on the entire dataset. (gen_data_agnostic_l1_prune.py in the scripts folder generates scripts for this training)

Pruned_Test_Acc : True
Trained_on :
- default 'entire_dataset' works at the moment
- this parameter specifies the pruned models have been retrained after pruning on what subset/dataset
Logs : path to json file which holds the timestamps for logs from runs where pruning was data agnostic

Channels that would've been pruned before performing finetuning

No_Finetune_Channels_Pruned : True
Pretrained : path to model that was trained originally before any finetuning

location is currently set by default to '{path to pruning dir}/logs/{arch}/{dataset}/baseline/pre_ft_pp_{pruning percentage}.pth.tar'

Prune filters

Multiple configurations possible. Base parameters are as follows:

Prune_Filters : True
Pruning_Perc : percentage of network (memory wise) to be pruned
Finetune_Budget : Number of epochs after pruning that retraining should be performed
Prune_After : Epoch at which pruning should be performed

Finetune on subset before pruning

Finetune : True
Static : True
- currently no other mode exists for finetuning
- static refers to setting a pre-defined and fixed number of epochs for finetuning and retraining

Prune and retrain from scratch

Finetune : False
Retrain : True

Prune model without any finetuning and retrain from scratch on dataset specified by Sub_Name and Sub_Classes parameters

Perform inference on pruned model without any retraining

Finetune : False
Retrain : False

will prune model specified in Pretrained to pruning percentage sepcified by Pruning_Perc and report inference statistics

Performing binary search DaPR methodology

Binary_Search : True
- setting this to true will perform the binary search algorithm described in the paper "Now that I can see, I can improve : Enabling data-driven finetuning of CNNs on the edge"
- the other settings that need to be set are 'Pruning_After'(n_f in paper), 'Finetune_Budget(n_r in paper)'
Currently the values of p_l is fixed to 5%, p_u is fixed to 95% and p_i is fixed to 5% corresponding to the paper.
The Lr_Schedule term works the same as usual and cycles through the desired learning rates after n_f epochs everytime a new iteration of binary search is performed.

Name		Name	Last commit message	Last commit date
Latest commit History 191 Commits
configs		configs
dapr		dapr
models		models
plotting		plotting
pruners		pruners
scripts		scripts
.gitignore		.gitignore
README.md		README.md
adapt.yml		adapt.yml
app.py		app.py
checkpointing.py		checkpointing.py
entropy.py		entropy.py
fbs_channel_probability.py		fbs_channel_probability.py
gop_calculator.py		gop_calculator.py
inference.py		inference.py
main.py		main.py
model_creator.py		model_creator.py
param_parser.py		param_parser.py
pruning_layers.py		pruning_layers.py
training.py		training.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automated Data-aware Pruning and reTraining (ADaPT)

Installation

Pruning Configurations

Get GOps for a given design

Performance of subset on network trained on entire dataset

Performance of subset on network pruned and then trained on entire dataset

Channels that would've been pruned before performing finetuning

Prune filters

Finetune on subset before pruning

Prune and retrain from scratch

Perform inference on pruned model without any retraining

Performing binary search DaPR methodology

About

Releases

Packages

Languages

ICIdsl/pruning

Folders and files

Latest commit

History

Repository files navigation

Automated Data-aware Pruning and reTraining (ADaPT)

Installation

Pruning Configurations

Get GOps for a given design

Performance of subset on network trained on entire dataset

Performance of subset on network pruned and then trained on entire dataset

Channels that would've been pruned before performing finetuning

Prune filters

Finetune on subset before pruning

Prune and retrain from scratch

Perform inference on pruned model without any retraining

Performing binary search DaPR methodology

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages