Gradient Agreement Filtering (GAF) Package

Project description

Gradient Agreement Filtering (GAF)

This package implements the Gradient Agreement Filtering (GAF) optimization algorithm.

GAF is a novel optimization algorithm that improves gradient-based optimization by filtering out gradients of data batches that do not agree with each other and nearly eliminates the need for a validation set without risk of overfitting (even with noisy labels). It bolts on top of existing optimization procedures such as SGD, SGD with Nesterov momentum, Adam, AdamW, RMSProp, etc and outperforms in all cases. Full paper here:

TODO: Insert arxiv paper link.

Repo Features

Supports multiple optimizers: SGD, SGD with Nesterov momentum, Adam, AdamW, RMSProp.
Implements Gradient Agreement Filtering based on cosine distance.
Allows for label noise injection by flipping a percentage of labels.
Customizable hyperparameters via command-line arguments.
Logging and tracking with Weights & Biases (wandb).

Requirements

Python 3.6 or higher
PyTorch 1.7 or higher
torchvision 0.8 or higher
numpy
wandb

Installation

You can install via:

git clone https://github.com/<insert your username>/gradient_agreement_filtering.git
cd gradient_agreement_filtering
pip install .

or via pip:

pip install gradient-agreement-filtering

Usage

We provide two ways to easily incorporate GAF into your existing training.

step_GAF(): If you want to use GAF inside your existing train loop, you can just replace your typical:

...
optimizer.zero_grad()
outputs = model(batch)
loss = criterion(outputs, labels)
loss.backward()
optimizer.step()
...

with one call to step_GAF() as per below:

from gradient_agreement_filtering import step_GAF
...
results = step_GAF(model, 
          optimizer, 
          criterion, 
          list_of_microbatches,
          wandb=True,
          verbose=True,
          cos_distance_thresh=0.97,
          device=gpu_device)
...

train_GAF():

If you want to use GAF as the train loop, you can just replace your typical hugging face / keras style interface:

trainer.Train()

with one call to train_GAF() as per below:

from gradient_agreement_filtering import train_GAF
...
train_GAF(model,
           args,
           train_dataset,
           val_dataset,
           optimizer,
           criterion,
           wandb=True,
           verbose=True,
           cos_distance_thresh=0.97,
           device=gpu_device)
...

Examples

NOTE: running with wandb

For all of the scripts below, if you want to run with wandb, you can either fill in the:

os.environ["WANDB_API_KEY"] = "<your-wandb-api-key>"

Or you can prepend any of the calls below with:

WANDB_API_KEY=<your-wandb-api-key> python *.py

Or you can login on the system first then run the .py via:

wandb login <your-wandb-api-key>

Or you can run without it. Choice is yours.

Now please review the examples below.

1_cifar_100_train_loop_exposed.py

This file uses step_GAF() to train a ResNet18 model on the CIFAR-100 dataset using PyTorch with the ability to add noise to the labels to observe how GAF performs under noisy conditions. The code supports various optimizers and configurations, allowing you to experiment with different settings to understand the impact of GAF on model training.

Example call:

python examples/1_cifar_100_train_loop_exposed.py --GAF True --optimizer "SGD+Nesterov+val_plateau" --learning_rate 0.01 --momentum 0.9 --nesterov True --wandb True --verbose True --num_samples_per_class_per_batch 1 --num_batches_to_force_agreement 2 --label_error_percentage 0.15 --cos_distance_thresh 0.97

2_cifar_100_trainer.py

This file uses train_GAF() to train a ResNet18 model on the CIFAR-100 dataset using PyTorch just to show how it works.

Example call:

python examples/2_cifar_100_trainer.py

3_cifar_100N_train_loop_exposed.py

This file uses step_GAF() to train a ResNet34 model on the CIFAR-100N-Fine dataset using PyTorch to observe how GAF performs under typical labeling noise. The code supports various optimizers and configurations, allowing you to experiment with different settings to understand the impact of GAF on model training.

Example call:

python examples/3_cifar_100N_Fine_train_loop_exposed.py --GAF True --optimizer "SGD+Nesterov+val_plateau"  --cifarn True --learning_rate 0.01 --momentum 0.9 --nesterov True --wandb True --verbose True --num_samples_per_class_per_batch 2 --num_batches_to_force_agreement 2 --cos_distance_thresh 0.97

Acknowledgement

To cite this work, please use the following BibTeX entry:

TODO

Citing GAF

Insert bibtex

License

This package is licensed under the MIT license. See LICENSE for details.

Project details

Release history Release notifications | RSS feed

This version

0.1.3

Dec 11, 2024

0.1.1

Dec 11, 2024

0.1.0

Dec 11, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gradient_agreement_filtering-0.1.3.tar.gz (7.4 kB view details)

Uploaded Dec 11, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gradient_agreement_filtering-0.1.3-py3-none-any.whl (7.9 kB view details)

Uploaded Dec 11, 2024 Python 3

File details

Details for the file gradient_agreement_filtering-0.1.3.tar.gz.

File metadata

Download URL: gradient_agreement_filtering-0.1.3.tar.gz
Upload date: Dec 11, 2024
Size: 7.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.9.21

File hashes

Hashes for gradient_agreement_filtering-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`0392a871d2b0425b497bc371847a98eb7bba221954ec9bd8af5f16b7f1aa19c6`
MD5	`abae462cf6912888020681cd7fdf5db5`
BLAKE2b-256	`e38ab4c703f69f239b3187bc5bba743096460f9752f8e497b5c4635583a058a7`

See more details on using hashes here.

File details

Details for the file gradient_agreement_filtering-0.1.3-py3-none-any.whl.

File metadata

Download URL: gradient_agreement_filtering-0.1.3-py3-none-any.whl
Upload date: Dec 11, 2024
Size: 7.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.9.21

File hashes

Hashes for gradient_agreement_filtering-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9a55fff923439d6b03256406a8da93ae1ea0bc0e5f7dfe2918155b9e03401edb`
MD5	`5dc947ba275d0494f129e0659faf796c`
BLAKE2b-256	`def06bda9e979b6b0d8bc5f5c014dcced54b26a8fd4902a6ddf05d5627bbfea8`

See more details on using hashes here.

gradient-agreement-filtering 0.1.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Gradient Agreement Filtering (GAF)

Repo Features

Requirements

Installation

Usage

Examples

NOTE: running with wandb

1_cifar_100_train_loop_exposed.py

2_cifar_100_trainer.py

3_cifar_100N_train_loop_exposed.py

Acknowledgement

Citing GAF

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes