A library for computing loss landscapes for neural networks

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

Visualizing the Loss Landscape of Neural Nets

This repository is a fork of the original repository by the authors of the paper

Hao Li, Zheng Xu, Gavin Taylor, Christoph Studer and Tom Goldstein. Visualizing the Loss Landscape of Neural Nets. NIPS, 2018.

We add simple and easy to use installation and running instructions.

An interactive 3D visualizer for loss surfaces has been provided by telesens.

Given a network architecture and its pre-trained parameters, this tool calculates and visualizes the loss surface along random direction(s) near the optimal parameters. The calculation can be done in parallel with multiple GPUs per node, and multiple nodes. The random direction(s) and loss surface values are stored in HDF5 (.h5) files after they are produced.

Setup

Installation

Tested on Ubuntu 16.04.6 LTS with Conda 4.8.3.

Option 1

Run conda env create python=3.8 -f env.yml

(created with conda env export -f env.yml --no-builds)

Option 2

Run conda create python=3.8 --name loss_landscape --file env_explicit.txt

(created with conda list --explicit > env_explicit.txt)

Troubleshooting

If none of the above options work: Try to install the packages manually. The most important packages are listed in the section Environment.

Environment

What exactly do I need to do to make it work?

If you have a new dataset: add a new folder datasets/{your_dataset_name}.
Add you data to datasets/{your_dataset_name}/data.
Add the model definitions to a file in datasets/{your_dataset_name}/models.
Add your trained network to a file in datasets/{your_dataset_name}/trained_nets/{your_model_with_hyper_parameters}.
Add a file data_loader.py in datasets/{your_dataset_name} and implement the method get_data_loaders(). You can find documentation in data_loader.py.
Add a file model_loader.py in datasets/{your_dataset_name} and implement the method load(). Also add to the file a dictionary called models containing a mapping between the name of your model and the model function. You can find documentation in model_loader.py.

Examples for running it

Locally without GPU

Implicit (short version):

python plot_surface.py --name test_plot --model resnet56 --dataset cifar10 --x=-1:1:51 --y=-1:1:51 --plot \
--model_file datasets/cifar10/trained_nets/resnet56_sgd_lr=0.1_bs=128_wd=0.0005/model_300.t7

Explicit (long version):

python plot_surface.py --name test_plot --model resnet56 --dataset cifar10 --x=-1:1:51 --y=-1:1:51 --plot \
--model_file datasets/cifar10/trained_nets/resnet56_sgd_lr=0.1_bs=128_wd=0.0005/model_300.t7 \
--dir_type weights --xnorm filter --xignore biasbn --ynorm filter --yignore biasbn

On a server with 4 GPUs and 16 CPUs

Implicit (short version):

nohup python plot_surface.py --name test_plot --model init_baseline_vgglike --dataset cinic10 --x=-1:1:51 --y=-1:1:51 --plot \
--model_file datasets/cinic10/trained_nets/init_baseline_vgglike_sgd_lr=0.1_bs=128_wd=0.0005_mom=0.9_save_epoch=1_ngpu=4/model_10.t7 \
--cuda --ngpu 4 --threads 8 --batch_size 8192 > nohup.out &

Explicit (long version):

nohup python plot_surface.py --name test_plot --model init_baseline_vgglike --dataset cinic10 --x=-1:1:51 --y=-1:1:51 --plot \
--model_file datasets/cinic10/trained_nets/init_baseline_vgglike_sgd_lr=0.1_bs=128_wd=0.0005_mom=0.9_save_epoch=1_ngpu=4/model_10.t7 \
--cuda --ngpu 4 --threads 8 --batch_size 8192 \
--dir_type weights --xnorm filter --xignore biasbn --ynorm filter --yignore biasbn > nohup.out &

Please find the description of all the possible parameters in plot_surface.py. More examples can be found in plot_examples.sh.

Make sure you do not use mpi when you run it on a single machine.

Pretrained Models

The code accepts pre-trained PyTorch models for the CIFAR-10 and CINIC-10 datasets out of the box, but other datasets can also be added. To load the pre-trained model correctly, the model file should contain state_dict, which is saved from the state_dict() method. The default path for pre-trained networks is cifar10/trained_nets. Some of the pre-trained models and plotted figures can be downloaded here:

VGG-9 (349 MB)
ResNet-56 (10 MB)
ResNet-56-noshort (20 MB)
DenseNet-121 (75 MB)

Data preprocessing

The data pre-processing method used for visualization should be consistent with the one used for model training. No data augmentation (random cropping or horizontal flipping) is used in calculating the loss values.

Troubleshooting

libgfortran 4.0.0 does not seem to be compatible with linux. Make sure you don't update the dependencies to include this.

Citation

If you find this code useful in your research, please cite:

@inproceedings{visualloss,
  title={Visualizing the Loss Landscape of Neural Nets},
  author={Li, Hao and Xu, Zheng and Taylor, Gavin and Studer, Christoph and Goldstein, Tom},
  booktitle={Neural Information Processing Systems},
  year={2018}
}

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

0.0.6.dev6 pre-release

Sep 25, 2020

0.0.6.dev5 pre-release

Sep 23, 2020

0.0.6.dev4 pre-release

Sep 23, 2020

0.0.6.dev3 pre-release

Sep 23, 2020

0.0.6.dev2 pre-release

Jun 26, 2020

0.0.6.dev1 pre-release

Jun 26, 2020

This version

0.0.5

Jun 15, 2020

0.0.4

Jun 14, 2020

0.0.3

Jun 14, 2020

0.0.2

Jun 3, 2020

0.0.1

Jun 2, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

loss_landscape-0.0.5.tar.gz (17.7 kB view details)

Uploaded Jun 15, 2020 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

loss_landscape-0.0.5-py3-none-any.whl (19.5 kB view details)

Uploaded Jun 15, 2020 Python 3

File details

Details for the file loss_landscape-0.0.5.tar.gz.

File metadata

Download URL: loss_landscape-0.0.5.tar.gz
Upload date: Jun 15, 2020
Size: 17.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/47.1.1 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.2

File hashes

Hashes for loss_landscape-0.0.5.tar.gz
Algorithm	Hash digest
SHA256	`e4176f3efafaf0c29791b04a4fc82b7fe4b3babc8264abfb51c0b8663cce3e65`
MD5	`256ce39b2fcdfcfb4969dd0b946532a9`
BLAKE2b-256	`ef96406c05a52aa388756985e35f368d40157d947c2e886473f6254678b9a37c`

See more details on using hashes here.

File details

Details for the file loss_landscape-0.0.5-py3-none-any.whl.

File metadata

Download URL: loss_landscape-0.0.5-py3-none-any.whl
Upload date: Jun 15, 2020
Size: 19.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/47.1.1 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.2

File hashes

Hashes for loss_landscape-0.0.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6601e120166090f1fdb304dee262a1170a2304d3ec15d155f1991eaf46deb7fe`
MD5	`346f89aab5a1960b47968d5f341b10e0`
BLAKE2b-256	`b9b55d3678199440d931aeed98ba150edafca7075f2252e82a9c6f854e65d3f6`

See more details on using hashes here.

loss-landscape 0.0.5

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Visualizing the Loss Landscape of Neural Nets

Setup

Installation

Option 1

Option 2

Troubleshooting

Environment

What exactly do I need to do to make it work?

Examples for running it

Locally without GPU

On a server with 4 GPUs and 16 CPUs

Pretrained Models

Data preprocessing

Troubleshooting

Citation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes