A universal Stable-Diffusion toolbox

These details have not been verified by PyPI

Project links

Homepage

Project description

HCP-Diffusion

Introduction

HCP-Diffusion is a toolbox for Stable Diffusion models based on 🤗 Diffusers. It facilitates flexiable configurations and component support for training, in comparison with webui and sd-scripts.

This toolbox supports Colossal-AI, which can significantly reduce GPU memory usage.

HCP-Diffusion can unify existing training methods for text-to-image generation (e.g., Prompt-tuning, Textual Inversion, DreamArtist, Fine-tuning, DreamBooth, LoRA, ControlNet, etc) and model structures through a single .yaml configuration file.

The toolbox has also implemented an upgraded version of DreamArtist with LoRA, named DreamArtist++, for one-shot text-to-image generation. Compared to DreamArtist, DreamArtist++ is more stable with higher image quality and generation controllability, and faster training speed.

Features

Layer-wise LoRA (with Conv2d)
Layer-wise fine-tuning
Layer-wise model ensemble
Prompt-tuning with multiple words
DreamArtist and DreamArtist++
Aspect Ratio Bucket (ARB) with automatic clustering
Multiple datasets with multiple data sources
Image attention mask
Word attention multiplier
Custom words that occupy multiple words
Maximum sentence length expansion
🤗 Accelerate
Colossal-AI
xFormers for UNet and text-encoder
CLIP skip
Tag shuffle and dropout
Safetensors support
Controlnet (support training)
Min-SNR loss
Custom optimizer (Lion, DAdaptation, pytorch-optimizer, ...)
Custom lr scheduler
SDXL support

Install

Install with pip:

pip install hcpdiff
# Start a new project and make initialization
hcpinit

Install from source:

git clone https://github.com/7eu7d7/HCP-Diffusion.git
cd HCP-Diffusion
pip install -e .
# Modified based on this project or start a new project and make initialization
## hcpinit

To use xFormers to reduce VRAM usage and accelerate training:

# use conda
conda install xformers -c xformers

# use pip
pip install xformers>=0.0.17

User guidance

Training

Training scripts based on 🤗 Accelerate or Colossal-AI are provided.

For 🤗 Accelerate, you may need to configure the environment before launching the scripts.
For Colossal-AI, you can use torchrun to launch the scripts.

# with Accelerate
accelerate launch -m hcpdiff.train_ac --cfg cfgs/train/cfg_file.yaml
# with Accelerate and only one GPU
accelerate launch -m hcpdiff.train_ac_single --cfg cfgs/train/cfg_file.yaml
# with Colossal-AI
# pip install colossalai
torchrun --nproc_per_node 1 -m hcpdiff.train_colo --cfg cfgs/train/cfg_file.yaml

Inference

python -m hcpdiff.visualizer --cfg cfgs/infer/cfg.yaml pretrained_model=pretrained_model_path \
        prompt='positive_prompt' \
        neg_prompt='negative_prompt' \
        seed=42

Conversion of Stable Diffusion models

The framework is based on 🤗 Diffusers. So it needs to convert the original Stable Diffusion model into a supported format using the scripts provided by 🤗 Diffusers.

Download the config file
Convert models based on config file

python -m hcpdiff.tools.sd2diffusers \
    --checkpoint_path "path_to_stable_diffusion_model" \
    --original_config_file "path_to_config_file" \
    --dump_path "save_directory" \
    [--extract_ema] # Extract EMA model
    [--from_safetensors] # Whether the original model is in safetensors format
    [--to_safetensors] # Whether to save to safetensors format

Convert VAE:

python -m hcpdiff.tools.sd2diffusers \
    --vae_pt_path "path_to_VAE_model" \
    --original_config_file "path_to_config_file" \
    --dump_path "save_directory"
    [--from_safetensors]

Tutorials

Contributing

You are welcome to contribute more models and features to this toolbox!

Team

This toolbox is maintained by HCP-Lab, SYSU.

Citation

@article{DBLP:journals/corr/abs-2211-11337,
  author    = {Ziyi Dong and
               Pengxu Wei and
               Liang Lin},
  title     = {DreamArtist: Towards Controllable One-Shot Text-to-Image Generation
               via Positive-Negative Prompt-Tuning},
  journal   = {CoRR},
  volume    = {abs/2211.11337},
  year      = {2022},
  doi       = {10.48550/arXiv.2211.11337},
  eprinttype = {arXiv},
  eprint    = {2211.11337},
}

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.9.1

Feb 20, 2024

0.9.0

Dec 7, 2023

0.7.0

Oct 28, 2023

0.6.2

Oct 17, 2023

0.6.1

Sep 26, 2023

0.6.0

Aug 29, 2023

0.5.5

Jul 21, 2023

0.5.4

Jul 20, 2023

0.4.2

May 22, 2023

0.4.1

May 12, 2023

0.4.0

May 11, 2023

0.3.7

May 7, 2023

0.3.6

May 7, 2023

0.3.5

May 6, 2023

0.3.4

May 6, 2023

0.3.3

May 6, 2023

0.3.2

May 6, 2023

0.3.1

May 6, 2023

0.3.0

May 6, 2023

0.2.2

Apr 29, 2023

0.2.1

Apr 29, 2023

0.2.0

Apr 25, 2023

0.1.6

Apr 13, 2023

0.1.5

Apr 12, 2023

0.1.4

Apr 12, 2023

0.1.3

Apr 12, 2023

0.1.2

Apr 12, 2023

0.1.1

Apr 12, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hcpdiff-0.9.1.tar.gz (118.8 kB view hashes)

Uploaded Feb 20, 2024 Source

Built Distribution

hcpdiff-0.9.1-py3-none-any.whl (179.1 kB view hashes)

Uploaded Feb 20, 2024 Python 3

Hashes for hcpdiff-0.9.1.tar.gz

Hashes for hcpdiff-0.9.1.tar.gz
Algorithm	Hash digest
SHA256	`8a3b7ce5bf537e0b89d54b24dc3ffebcfb0c20824ca4078b2ca1f33cfd19e0cf`
MD5	`574fd3c0425c555d42df99c18213f182`
BLAKE2b-256	`d949946bf4ffcc36c9414877ba47f7a4413d2e2072d37b35444a23383d5faecb`

Hashes for hcpdiff-0.9.1-py3-none-any.whl

Hashes for hcpdiff-0.9.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6c6b0fcbe2be13e675335320b9c688142585df20b711b64385e6b5fa6eb82053`
MD5	`cae1f73c902705565564c537d17ee4a5`
BLAKE2b-256	`bd152ec05feb337bc2cbf6555b2bb009ba73dcca5a8a5cbe3e85ae9809796141`