diffusers

Diffusers

These details have not been verified by PyPI

Project links

Homepage

Project description

🤗 Diffusers provides pretrained diffusion models across multiple modalities, such as vision and audio, and serves as a modular toolbox for inference and training of diffusion models.

More precisely, 🤗 Diffusers offers:

State-of-the-art diffusion pipelines that can be run in inference with just a couple of lines of code (see src/diffusers/pipelines).
Various noise schedulers that can be used interchangeably for the prefered speed vs. quality trade-off in inference (see src/diffusers/schedulers).
Multiple types of models, such as UNet, that can be used as building blocks in an end-to-end diffusion system (see src/diffusers/models).
Training examples to show how to train the most popular diffusion models (see examples).

Definitions

Models: Neural network that models $p_\theta(\mathbf{x}_{t-1}|\mathbf{x}_t)$ (see image below) and is trained end-to-end to denoise a noisy input to an image. Examples: UNet, Conditioned UNet, 3D UNet, Transformer UNet

Figure from DDPM paper (https://arxiv.org/abs/2006.11239).

Schedulers: Algorithm class for both inference and training. The class provides functionality to compute previous image according to alpha, beta schedule as well as predict noise for training. Examples: DDPM, DDIM, PNDM, DEIS

Sampling and training algorithms. Figure from DDPM paper (https://arxiv.org/abs/2006.11239).

Diffusion Pipeline: End-to-end pipeline that includes multiple diffusion models, possible text encoders, ... Examples: Glide, Latent-Diffusion, Imagen, DALL-E 2

Figure from ImageGen (https://imagen.research.google/).

Philosophy

Readability and clarity is prefered over highly optimized code. A strong importance is put on providing readable, intuitive and elementary code design. E.g., the provided schedulers are separated from the provided models and provide well-commented code that can be read alongside the original paper.
Diffusers is modality independent and focusses on providing pretrained models and tools to build systems that generate continous outputs, e.g. vision and audio.
Diffusion models and schedulers are provided as consise, elementary building blocks whereas diffusion pipelines are a collection of end-to-end diffusion systems that can be used out-of-the-box, should stay as close as possible to their original implementation and can include components of other library, such as text-encoders. Examples for diffusion pipelines are Glide and Latent Diffusion.

Quickstart

In order to get started, we recommend taking a look at two notebooks:

The Diffusers notebook, which showcases an end-to-end example of usage for diffusion models, schedulers and pipelines. Take a look at this notebook to learn how to use the pipeline abstraction, which takes care of everything (model, scheduler, noise handling) for you, but also to get an understanding of each independent building blocks in the library.
The Training diffusers notebook, which summarizes diffuser model training methods. This notebook takes a step-by-step approach to training your diffuser model on an image dataset, with explanatory graphics.

Installation

pip install diffusers  # should install diffusers 0.0.4

1. `diffusers` as a toolbox for schedulers and models

diffusers is more modularized than transformers. The idea is that researchers and engineers can use only parts of the library easily for the own use cases. It could become a central place for all kinds of models, schedulers, training utils and processors that one can mix and match for one's own use case. Both models and schedulers should be load- and saveable from the Hub.

For more examples see schedulers and models

Example for Unconditonal Image generation DDPM:

import torch
from diffusers import UNet2DModel, DDIMScheduler
import PIL.Image
import numpy as np
import tqdm

torch_device = "cuda" if torch.cuda.is_available() else "cpu"

# 1. Load models
scheduler = DDIMScheduler.from_config("fusing/ddpm-celeba-hq", tensor_format="pt")
unet = UNet2DModel.from_pretrained("fusing/ddpm-celeba-hq", ddpm=True).to(torch_device)

# 2. Sample gaussian noise
generator = torch.manual_seed(23)
unet.image_size = unet.resolution
image = torch.randn(
   (1, unet.in_channels, unet.image_size, unet.image_size),
   generator=generator,
)
image = image.to(torch_device)

# 3. Denoise
num_inference_steps = 50
eta = 0.0  # <- deterministic sampling
scheduler.set_timesteps(num_inference_steps)

for t in tqdm.tqdm(scheduler.timesteps):
    # 1. predict noise residual
    with torch.no_grad():
        residual = unet(image, t)["sample"]

    prev_image = scheduler.step(residual, t, image, eta)["prev_sample"]

    # 3. set current image to prev_image: x_t -> x_t-1
    image = prev_image

# 4. process image to PIL
image_processed = image.cpu().permute(0, 2, 3, 1)
image_processed = (image_processed + 1.0) * 127.5
image_processed = image_processed.numpy().astype(np.uint8)
image_pil = PIL.Image.fromarray(image_processed[0])

# 5. save image
image_pil.save("generated_image.png")

Example for Unconditonal Image generation LDM:

In the works

For the first release, 🤗 Diffusers focuses on text-to-image diffusion techniques. However, diffusers can be used for much more than that! Over the upcoming releases, we'll be focusing on:

Diffusers for audio
Diffusers for reinforcement learning (initial work happening in https://github.com/huggingface/diffusers/pull/105).
Diffusers for video generation
Diffusers for molecule generation (initial work happening in https://github.com/huggingface/diffusers/pull/54)

A few pipeline components are already being worked on, namely:

BDDMPipeline for spectrogram-to-sound vocoding
GLIDEPipeline to support OpenAI's GLIDE model
Grad-TTS for text to audio generation / conditional audio generation

We want diffusers to be a toolbox useful for diffusers models in general; if you find yourself limited in any way by the current API, or would like to see additional models, schedulers, or techniques, please open a GitHub issue mentioning what you would like to see.

Credits

This library concretizes previous work by many different authors and would not have been possible without their great research and implementations. We'd like to thank, in particular, the following implementations which have helped us in our development and without which the API could not have been as polished today:

@CompVis' latent diffusion models library, available here
@hojonathanho original DDPM implementation, available here as well as the extremely useful translation into PyTorch by @pesser, available here
@ermongroup's DDIM implementation, available here.
@yang-song's Score-VE and Score-VP implementations, available here

We also want to thank @heejkoo for the very helpful overview of papers, code and resources on diffusion models, available here.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.36.0

Dec 8, 2025

0.35.2

Oct 15, 2025

0.35.1

Aug 20, 2025

0.35.0

Aug 19, 2025

0.34.0

Jun 24, 2025

0.33.1

Apr 10, 2025

0.33.0

Apr 9, 2025

0.32.2

Jan 15, 2025

0.32.1

Dec 25, 2024

0.32.0

Dec 23, 2024

0.31.0

Oct 22, 2024

0.30.3

Sep 17, 2024

0.30.2

Aug 31, 2024

0.30.1

Aug 24, 2024

0.30.0

Aug 7, 2024

0.29.2

Jun 27, 2024

0.29.1

Jun 20, 2024

0.29.0

Jun 12, 2024

0.28.2

Jun 4, 2024

0.28.1

Jun 4, 2024

0.28.0

May 27, 2024

0.27.2

Mar 20, 2024

0.27.1

Mar 19, 2024

0.27.0

Mar 14, 2024

0.26.3

Feb 13, 2024

0.26.2

Feb 6, 2024

0.26.1

Feb 2, 2024

0.26.0

Jan 31, 2024

0.25.1

Jan 17, 2024

0.25.0

Dec 27, 2023

0.24.0

Nov 29, 2023

0.23.1

Nov 16, 2023

0.23.0

Nov 9, 2023

0.22.3

Nov 8, 2023

0.22.2

Nov 7, 2023

0.22.1

Nov 6, 2023

0.22.0

Nov 6, 2023

0.21.4

Sep 29, 2023

0.21.3

Sep 27, 2023

0.21.2

Sep 18, 2023

0.21.1

Sep 14, 2023

0.21.0

Sep 13, 2023

0.20.2

Aug 31, 2023

0.20.1

Aug 28, 2023

0.20.0

Aug 17, 2023

0.19.3

Jul 30, 2023

0.19.2

Jul 28, 2023

0.19.1

Jul 27, 2023

0.19.0

Jul 26, 2023

0.18.2

Jul 11, 2023

0.18.1

Jul 7, 2023

0.18.0

Jul 6, 2023

0.17.1

Jun 12, 2023

0.17.0

Jun 8, 2023

0.16.1

Apr 28, 2023

0.16.0

Apr 26, 2023

0.15.1

Apr 17, 2023

0.15.0

Apr 12, 2023

0.14.0

Mar 3, 2023

0.13.1

Feb 20, 2023

0.13.0

Feb 17, 2023

0.12.1

Jan 27, 2023

0.12.0

Jan 25, 2023

0.11.1

Dec 20, 2022

0.11.0

Dec 19, 2022

0.10.2

Dec 9, 2022

0.10.1

Dec 9, 2022

0.10.0

Dec 8, 2022

0.9.0

Nov 25, 2022

0.8.1

Nov 24, 2022

0.8.0

Nov 23, 2022

0.7.2

Nov 5, 2022

0.7.1

Nov 4, 2022

0.7.0

Nov 3, 2022

0.6.0

Oct 19, 2022

0.5.1

Oct 13, 2022

0.5.0

Oct 13, 2022

0.4.2

Oct 11, 2022

0.4.1

Oct 7, 2022

0.4.0

Oct 6, 2022

0.3.0

Sep 8, 2022

0.2.4

Aug 22, 2022

0.2.3

Aug 22, 2022

0.2.2

Aug 16, 2022

0.2.1

Aug 16, 2022

0.2.0

Aug 16, 2022

0.1.3

Jul 28, 2022

This version

0.1.2

Jul 21, 2022

0.1.1

Jul 21, 2022

0.1.0

Jul 21, 2022

0.0.4

Jun 15, 2022

0.0.3

Jun 15, 2022

0.0.2

Jun 7, 2022

0.0.1

May 30, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

diffusers-0.1.2.tar.gz (76.0 kB view details)

Uploaded Jul 21, 2022 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

diffusers-0.1.2-py3-none-any.whl (94.8 kB view details)

Uploaded Jul 21, 2022 Python 3

File details

Details for the file diffusers-0.1.2.tar.gz.

File metadata

Download URL: diffusers-0.1.2.tar.gz
Upload date: Jul 21, 2022
Size: 76.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.0 CPython/3.9.7

File hashes

Hashes for diffusers-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`054f5550bd20645a1bf7c238661e2b595d98be68beba258d1113681fed7dd7bb`
MD5	`88f6f0da1015578a4925ae380a6c6dd6`
BLAKE2b-256	`b58351ec5b5fdf0b4a21cb60e7e0f93b847e9b613603e5ae48d3c5fc1e5e36b2`

See more details on using hashes here.

File details

Details for the file diffusers-0.1.2-py3-none-any.whl.

File metadata

Download URL: diffusers-0.1.2-py3-none-any.whl
Upload date: Jul 21, 2022
Size: 94.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.0 CPython/3.9.7

File hashes

Hashes for diffusers-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7fdfbcbfa3e9cc9e8e54ca7cab630a16aa8e85c112fa492b4a80de2866a19e5c`
MD5	`14fc92d5a99fb64cd76e7fb912b717dc`
BLAKE2b-256	`c5309c04a27e3d61a3ca855743e17568e624daee20a988536b970971c22be45b`

See more details on using hashes here.

diffusers 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Definitions

Philosophy

Quickstart

Installation

1. `diffusers` as a toolbox for schedulers and models

Example for Unconditonal Image generation DDPM:

Example for Unconditonal Image generation LDM:

In the works

Credits

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

diffusers 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Definitions

Philosophy

Quickstart

Installation

1. diffusers as a toolbox for schedulers and models

Example for Unconditonal Image generation DDPM:

Example for Unconditonal Image generation LDM:

In the works

Credits

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

1. `diffusers` as a toolbox for schedulers and models