weightwatcher

Analyze weight matrices of Deep Neural Networks

These details have not been verified by PyPI

Project links

Project description

Weight Watcher

Current Version: 0.4

Weight Watcher analyzes the Fat Tails in the weight matrices of Deep Neural Networks (DNNs).

This tool can predict the trends in the generalization accuracy of a series of DNNs, such as VGG11, VGG13, ..., or even the entire series of ResNet models--without needing a test set !

This relies upon recent research into the Heavy (Fat) Tailed Self Regularization in DNNs

The tool lets one compute a averager capacity, or quality, metric for a series of DNNs, trained on the same data, but with different hyperparameters, or even different but related architectures. For example, it can predict that VGG19_BN generalizes better than VGG19, and better than VGG16_BN, VGG16, etc.

Types of Capacity Metrics:

There are 2 basic types metrics we use

alpha (the average power law exponent)
weighted alpha / log_alpha_norm (scale adjusted alpha metrics)

The average alpha can be used to compare one or more DNN models with different hyperparemeter settings, but of the same depth. The average weighted alpha is suitable for DNNs of differing depths.

Here is an example of the Weighted Alpha capacity metric for all the current pretrained VGG models. alt text

Notice: we did not peek at the ImageNet test data to build this plot.

Frameworks supported

Tensorflow 2.x / Keras
PyTorch
HuggingFace

Layers supported

Dense / Linear / Fully Connected (and Conv1D)
Conv2D

Installation

pip install weightwatcher

Usage

import weightwatcher as ww
import torchvision.models as models

model = models.vgg19_bn(pretrained=True)
watcher = ww.WeightWatcher(model=model)
details = watcher.analyze()
summary = watcher.get_summary(details)

It is as easy to run and generates a pandas dataframe with details (and plots) for each layer

Sample Details Dataframe

and summary dict of generalization metrics

    {'log_norm': 2.11,
      'alpha': 3.06,
      'alpha_weighted': 2.78,
      'log_alpha_norm': 3.21,
      'log_spectral_norm': 0.89,
      'stable_rank': 20.90,
      'mp_softrank': 0.52}]

More examples are include the Demo Notebook

and will be made available shortly in a Jupyter book

Advanced Usage

The watcher object has several functions and analyze features described below

analyze( model=None, layers=[], min_evals=0, max_evals=None,
	 plot=True, randomize=True, mp_fit=True, ww2x=False):
...
describe(self, model=None, layers=[], min_evals=0, max_evals=None,
         plot=True, randomize=True, mp_fit=True, ww2x=False):
...
get_details()
get_summary(details) or get_summary()
get_ESD()
...
distances(model_1, model_2)

filter by layer types

ww.LAYER_TYPE.CONV2D |  ww.LAYER_TYPE.CONV2D |  ww.LAYER_TYPE.DENSE

details=watcher.analyze(layers=[ww.LAYER_TYPE.CONV2D])

filter by ids or name

details=watcher.analyze(layers=[20])

minimum, maximum number of eigenvalues of the layer weight matrix

Sets the minimum and maximum size of the weight matrices analyzed. Setting max is useful for a quick debugging.

details = watcher.analyze(min_evals=50, max_evals=500)

plots (for each layer)

Create ESD plots for each layer weight matrix to observe how well the power law fits work

details = watcher.analyze(plot=True)

compare layer ESD to randomized W matrix

The randomize option compares the ESD of the layer weight matrix (W) to the ESD of the randomized W matrix. This is good way to visualize the correlations in the true ESD.

details = watcher.analyze(randomize=True, plot=True)

fit ESDs to a Marchenko-Pastur (MP) distrbution

Attempts to the fit the ESD to an MP dist.

details = watcher.analyze(mp_fit=True, plot=True)

and reports the

num_spikes, mp_sigma, and mp_sofrank

Also works for randomized ESD and reports

rand_num_spikes, rand_mp_sigma, and rand_mp_sofrank

get the ESD for a specific layer, for visualization or further analysis

watcher.analyze()
esd = watcher.get_ESD()

describe a model

Describe a model and report the details dataframe, without analyzing it

details = watcher.describe(model=model)

get summary

Get the average metrics, as a summary (dict), from the given (or current) details dataframe

details = watcher.analyze(model=model)
summary = watcher.get_summary(model)

or just

watcher.analyze()
summary = watcher.get_summary()

compare 2 models

The new distances method reports the distances between 2 models, such as the norm between the initial weight matrices and the final, trained weight matrices

details = watcher.distances(initial_model, trained_model)

compatability with version 0.2x

The new 0.4 version of weightwatcher treats each layer as a single, unified set of eigenvalues. In contrast, the 0.2x versions split the Conv2D layers into n slices, 1 for each receptive field. The ww2x option provides results which are back-compatable with the 0.2x version of weightwatcher, with details provide for each slice for each layer.

details = watcher.analyze(ww2x=True)

Demo Notebook

Calculation Consulting homepage

Calculated Content Blog

This tool is based on state-of-the-art research done in collaboration with UC Berkeley:

and has been presented at Stanford, UC Berkeley, etc:

and major AI conferences like ICML, KDD, etc.

KDD2019 Workshop

and has been the subject many popular podcasts

KDD 2019 Workshop: Slides

Aggregate Intellect Podcast

Latest paper and results

Predicting trends in the quality of state-of-the-art neural networks without access to training or testing data

Repo for latest paper

Talk on latest results, Stanford ICME 2020

How to Release

Publishing to the PyPI repository:

# 1. Check in the latest code with the correct revision number (__version__ in __init__.py)
vi weightwatcher/__init__.py # Increse release number, remove -dev to revision number
git commit
# 2. Check out latest version from the repo in a fresh directory
cd ~/temp/
git clone https://github.com/CalculatedContent/WeightWatcher
cd WeightWatcher/
# 3. Use the latest version of the tools
python -m pip install --upgrade setuptools wheel twine
# 4. Create the package
python setup.py sdist bdist_wheel
# 5. Test the package
twine check dist/*
# 6. Upload the package to PyPI
twine upload dist/*
# 7. Tag/Release in github by creating a new release (https://github.com/CalculatedContent/WeightWatcher/releases/new)

License

Apache License 2.0

Slack Channel

We have a slack channel for the tool if you need help For an invite, please send an email to charles@calculationconsulting.com

Contributors

Charles H Martin, PhD Calculation Consulting

Serena Peng

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.7.7

Feb 21, 2026

0.7.6

Nov 13, 2025

0.7.5.5

Sep 11, 2024

0.7.5.2

Mar 6, 2024

0.7.5

Feb 27, 2024

0.7.4.9

Feb 26, 2024

0.7.4.8

Feb 26, 2024

0.7.4.7

Feb 13, 2024

0.7.4.3

Jan 28, 2024

0.7.4.2

Jan 25, 2024

0.7.4.1

Jan 20, 2024

0.7.3.2

Nov 20, 2023

0.7.3.1

Aug 25, 2023

0.7.1.5

Apr 16, 2023

0.7.1.4

Apr 16, 2023

0.7.0.9

Apr 5, 2023

0.7.0.8

Apr 3, 2023

0.7.0.6

Mar 30, 2023

0.7.0.5

Mar 29, 2023

0.7.0.4

Mar 29, 2023

0.7.0.2

Mar 25, 2023

0.7

Mar 21, 2023

0.6.4

Jan 22, 2023

0.6.3.3 yanked

Jan 22, 2023

Reason this release was yanked:

wrong name

0.6.3

Jan 22, 2023

0.6.1

Nov 14, 2022

0.6.0 yanked

Nov 13, 2022

Reason this release was yanked:

missing telly

0.5.7

Nov 3, 2022

0.5.6

Feb 8, 2022

0.5.5

Oct 17, 2021

0.5.1

Aug 18, 2021

0.5

Aug 17, 2021

0.4.7

May 26, 2021

0.4.6

Apr 20, 2021

0.4.5

Apr 19, 2021

0.4.4

Apr 2, 2021

0.4.2

Apr 1, 2021

0.4.1

Mar 30, 2021

This version

0.4.0

Oct 24, 2020

0.2.7

Jan 16, 2020

0.2.6

Jan 14, 2020

0.2.5

Jan 13, 2020

0.2.4

Jan 11, 2020

0.2.3

Jan 9, 2020

0.2.2

Dec 28, 2019

0.2.1

Nov 7, 2019

0.2

Nov 4, 2019

0.1.2

Jun 5, 2019

0.1.1

Nov 28, 2018

0.1

Nov 28, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

weightwatcher-0.4.0.tar.gz (35.0 kB view details)

Uploaded Oct 24, 2020 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

weightwatcher-0.4.0-py3-none-any.whl (30.4 kB view details)

Uploaded Oct 24, 2020 Python 3

File details

Details for the file weightwatcher-0.4.0.tar.gz.

File metadata

Download URL: weightwatcher-0.4.0.tar.gz
Upload date: Oct 24, 2020
Size: 35.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/50.3.2 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.5

File hashes

Hashes for weightwatcher-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`7923ad4c403aa03eeb8c40c2dbc0f6236bb52e071d197ac05af1c1421add3009`
MD5	`3360517a00266625e9004a1353bc8a38`
BLAKE2b-256	`872d9da72552ec57d74c66da9c9a98368565f4d033e32433a2f1aa7b02d59e3d`

See more details on using hashes here.

File details

Details for the file weightwatcher-0.4.0-py3-none-any.whl.

File metadata

Download URL: weightwatcher-0.4.0-py3-none-any.whl
Upload date: Oct 24, 2020
Size: 30.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/50.3.2 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.5

File hashes

Hashes for weightwatcher-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d8cab9731a6f08e744706ccdb282e1271edec6a9dc65eab08654a1b2b0c56b17`
MD5	`d2b47dac9f3be3e45f73d18247438fc8`
BLAKE2b-256	`173e85fdb22e3c140ca2ff4a8e7637ce022691cac301a9304f5e18d504be68e1`

See more details on using hashes here.

weightwatcher 0.4.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Weight Watcher

Current Version: 0.4

Types of Capacity Metrics:

Frameworks supported

Layers supported

Installation

Usage

Advanced Usage

filter by layer types

filter by ids or name

minimum, maximum number of eigenvalues of the layer weight matrix

plots (for each layer)

compare layer ESD to randomized W matrix

fit ESDs to a Marchenko-Pastur (MP) distrbution

get the ESD for a specific layer, for visualization or further analysis

describe a model

get summary

compare 2 models

compatability with version 0.2x

KDD2019 Workshop

Latest paper and results

How to Release

License

Slack Channel

Contributors

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes