Skip to main content

A gradient processing and optimization library in JAX.

Project description

Optax

CI status Documentation Status pypi

Introduction

Optax is a gradient processing and optimization library for JAX.

Optax is designed to facilitate research by providing building blocks that can be easily recombined in custom ways.

Our goals are to

  • Provide simple, well-tested, efficient implementations of core components.
  • Improve research productivity by enabling to easily combine low-level ingredients into custom optimizers (or other gradient processing components).
  • Accelerate adoption of new ideas by making it easy for anyone to contribute.

We favor focusing on small composable building blocks that can be effectively combined into custom solutions. Others may build upon these basic components in more complicated abstractions. Whenever reasonable, implementations prioritize readability and structuring code to match standard equations, over code reuse.

An initial prototype of this library was made available in JAX's experimental folder as jax.experimental.optix. Given the wide adoption across DeepMind of optix, and after a few iterations on the API, optix was eventually moved out of experimental as a standalone open-source library, and renamed optax.

Documentation on Optax can be found at optax.readthedocs.io.

Installation

You can install the latest released version of Optax from PyPI via:

pip install optax

or you can install the latest development version from GitHub:

pip install git+https://github.com/google-deepmind/optax.git

Quickstart

Optax contains implementations of many popular optimizers and loss functions. For example, the following code snippet uses the Adam optimizer from optax.adam and the mean squared error from optax.l2_loss. We initialize the optimizer state using the init function and params of the model.

optimizer = optax.adam(learning_rate)
# Obtain the `opt_state` that contains statistics for the optimizer.
params = {'w': jnp.ones((num_weights,))}
opt_state = optimizer.init(params)

To write the update loop we need a loss function that can be differentiated by Jax (with jax.grad in this example) to obtain the gradients.

compute_loss = lambda params, x, y: optax.l2_loss(params['w'].dot(x), y)
grads = jax.grad(compute_loss)(params, xs, ys)

The gradients are then converted via optimizer.update to obtain the updates that should be applied to the current parameters to obtain the new ones. optax.apply_updates is a convenience utility to do this.

updates, opt_state = optimizer.update(grads, opt_state)
params = optax.apply_updates(params, updates)

You can continue the quick start in the Optax 🚀 Getting started notebook.

Development

We welcome new contributors.

Source code

You can check the latest sources with the following command.

git clone https://github.com/google-deepmind/optax.git

Testing

To run the tests, please execute the following script.

sh test.sh

Documentation

To build the documentation, first ensure that all the dependencies are installed.

pip install -e ".[docs]"

Then, execute the following.

cd docs
make html

Benchmarks

If you feel lost in the crowd of available optimizers for deep learning, there exist some extensive benchmarks:

Benchmarking Neural Network Training Algorithms, Dahl G. et al, 2023,

Descending through a Crowded Valley — Benchmarking Deep Learning Optimizers, Schmidt R. et al, 2021.

If you are interested in developing your own benchmark for some tasks, consider the following framework

Benchopt: Reproducible, efficient and collaborative optimization benchmarks, Moreau T. et al, 2022.

Finally, if you are searching for some recommendations on tuning optimizers, consider taking a look at

Deep Learning Tuning Playbook, Godbole V. et al, 2023.

Citing Optax

This repository is part of the DeepMind JAX Ecosystem, to cite Optax please use the citation:

@software{deepmind2020jax,
  title = {The {D}eep{M}ind {JAX} {E}cosystem},
  author = {DeepMind and Babuschkin, Igor and Baumli, Kate and Bell, Alison and Bhupatiraju, Surya and Bruce, Jake and Buchlovsky, Peter and Budden, David and Cai, Trevor and Clark, Aidan and Danihelka, Ivo and Dedieu, Antoine and Fantacci, Claudio and Godwin, Jonathan and Jones, Chris and Hemsley, Ross and Hennigan, Tom and Hessel, Matteo and Hou, Shaobo and Kapturowski, Steven and Keck, Thomas and Kemaev, Iurii and King, Michael and Kunesch, Markus and Martens, Lena and Merzic, Hamza and Mikulik, Vladimir and Norman, Tamara and Papamakarios, George and Quan, John and Ring, Roman and Ruiz, Francisco and Sanchez, Alvaro and Sartran, Laurent and Schneider, Rosalia and Sezener, Eren and Spencer, Stephen and Srinivasan, Srivatsan and Stanojevi\'{c}, Milo\v{s} and Stokowiec, Wojciech and Wang, Luyu and Zhou, Guangyao and Viola, Fabio},
  url = {http://github.com/google-deepmind},
  year = {2020},
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

optax-0.2.4.tar.gz (229.7 kB view details)

Uploaded Source

Built Distribution

optax-0.2.4-py3-none-any.whl (319.2 kB view details)

Uploaded Python 3

File details

Details for the file optax-0.2.4.tar.gz.

File metadata

  • Download URL: optax-0.2.4.tar.gz
  • Upload date:
  • Size: 229.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for optax-0.2.4.tar.gz
Algorithm Hash digest
SHA256 4e05d3d5307e6dde4c319187ae36e6cd3a0c035d4ed25e9e992449a304f47336
MD5 9cdd2f9ae8aef2e709e336e6d09411f8
BLAKE2b-256 afb5f88a0d851547b2e6b2c7e7e6509ad66236b3e7019f1f095bb03dbaa61fa1

See more details on using hashes here.

File details

Details for the file optax-0.2.4-py3-none-any.whl.

File metadata

  • Download URL: optax-0.2.4-py3-none-any.whl
  • Upload date:
  • Size: 319.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for optax-0.2.4-py3-none-any.whl
Algorithm Hash digest
SHA256 db35c04e50b52596662efb002334de08c2a0a74971e4da33f467e84fac08886a
MD5 75da99bca79c999c245299b8f4c1e3d2
BLAKE2b-256 5c2428d0bb21600a78e46754947333ec9a297044af884d360092eb8561575fe9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page