No project description provided

These details have not been verified by PyPI

Project links

Homepage

Project description

StableDiffusionInpaintingFineTune

This project provides a toolkit for fine-tuning the Stable Diffusion model for inpainting tasks (image restoration based on a mask) using PyTorch and Hugging Face Diffusers libraries.

Requirements

Before starting, you need to install the following libraries:

torch
diffusers
transformers
accelerate
huggingface_hub
PIL
numpy
tqdm

Description

StableDiffusionInpaintingFineTune


This class is responsible for fine-tuning the Stable Diffusion model for the inpainting task. It supports training both the text encoder and the UNet model and uses various settings to control the training process.

Constructor
^^^^^^^^^^^

.. code-block:: python

   __init__(self, pretrained_model_name_or_path, resolution, center_crop, ...)

- **pretrained_model_name_or_path**: The path or name of the pre-trained model.
- **resolution**: The resolution of the images.
- **center_crop**: Whether to apply center cropping during data preparation.
- **train_text_encoder**: Whether to train the text encoder.
- **dataset**: The dataset object.
- **learning_rate**: The initial learning rate.
- **max_training_steps**: The maximum number of training steps.
- **save_steps**: The number of steps between saving checkpoints.
- **train_batch_size**: The batch size.
- **gradient_accumulation_steps**: The number of steps to accumulate gradients.
- **mixed_precision**: Use of mixed precision ("fp16", "bf16", or None).
- **gradient_checkpointing**: Use of gradient checkpointing.
- **use_8bit_adam**: Use of the 8-bit Adam optimizer.
- **seed**: The random seed for reproducibility.
- **output_dir**: The directory for saving results.
- **push_to_hub**: Whether to upload the results to the Hugging Face Hub.
- **repo_id**: The repository ID on Hugging Face Hub.

Methods
^^^^^^^

- **prepare_mask_and_masked_image(image, mask)**: Prepares the mask and masked image.
- **random_mask(im_shape, ratio=1, mask_full_image=False)**: Generates a random mask.
- **load_args_for_training()**: Loads the necessary components of the model for training.
- **collate_fn(examples)**: Forms a batch of data for the model.
- **__call__(self, *args, **kwargs)**: The main method for running the training process.

Usage
-----

To start training, you should create an instance of the ``StableDiffusionInpaintingFineTune`` class and call its ``__call__`` method, passing the necessary arguments.

.. code-block:: python

   model = StableDiffusionInpaintingFineTune(
       pretrained_model_name_or_path="path_to_model",
       resolution=512,
       center_crop=True,
       ...
   )

   model()

License
-------

The project is distributed under the MIT License.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.5

Aug 28, 2024

1.4

Aug 28, 2024

1.3

Aug 28, 2024

1.2

Aug 28, 2024

1.1

Aug 28, 2024

1.0

Aug 27, 2024

0.6.2

Aug 12, 2024

0.6.1

Aug 12, 2024

0.6.0

Aug 12, 2024

0.5.7

Aug 12, 2024

0.5.6

Aug 12, 2024

0.5.5

Aug 12, 2024

This version

0.5.4

Aug 11, 2024

0.5.3

Aug 11, 2024

0.5.2

Aug 11, 2024

0.5.1

Aug 11, 2024

0.5

Aug 11, 2024

0.4.2

Aug 11, 2024

0.4.1

Aug 11, 2024

0.4.0

Aug 11, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dreamfinetune-0.5.4.tar.gz (16.2 kB view hashes)

Uploaded Aug 11, 2024 Source

Built Distribution

dreamfinetune-0.5.4-py3-none-any.whl (20.1 kB view hashes)

Uploaded Aug 11, 2024 Python 3

Hashes for dreamfinetune-0.5.4.tar.gz

Hashes for dreamfinetune-0.5.4.tar.gz
Algorithm	Hash digest
SHA256	`d15cea6807629fd1cdfbb8b6cc99647036727862c2f0394f2f10fe61597fa3b2`
MD5	`c9ca850e51911ea4ddd224842b72b5d5`
BLAKE2b-256	`a4916311b7067511415cefbbc5395e6cbd88dcf0f05a3b3dff2b69af0b2939fa`

Hashes for dreamfinetune-0.5.4-py3-none-any.whl

Hashes for dreamfinetune-0.5.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6470efd53fffa260ed5b44d9b39c6c79c5d30b5b8d8475f6670582e529bb822d`
MD5	`11a7e5f0169176e86cdb40ab9df01fbf`
BLAKE2b-256	`dce106a0d6fc1133482278dca9ba9360350e496fa232ec76c762056b140319ba`