AI imagined images. Pythonic generation of stable diffusion images.

Project description

ImaginAIry 🤖🧠

AI imagined images. Pythonic generation of stable diffusion images.

"just works" on Linux and OSX(M1).

Examples

>> pip install imaginairy
>> imagine "a scenic landscape" "a photo of a dog" "photo of a fruit bowl" "portrait photo of a freckled woman"

Console Output

🤖🧠 received 4 prompt(s) and will repeat them 1 times to create 4 images.
Loading model onto mps backend...
Generating 🖼  : "a scenic landscape" 512x512px seed:557988237 prompt-strength:7.5 steps:40 sampler-type:PLMS
    PLMS Sampler: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 40/40 [00:29<00:00,  1.36it/s]
    🖼  saved to: ./outputs/000001_557988237_PLMS40_PS7.5_a_scenic_landscape.jpg
Generating 🖼  : "a photo of a dog" 512x512px seed:277230171 prompt-strength:7.5 steps:40 sampler-type:PLMS
    PLMS Sampler: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 40/40 [00:28<00:00,  1.41it/s]
    🖼  saved to: ./outputs/000002_277230171_PLMS40_PS7.5_a_photo_of_a_dog.jpg
Generating 🖼  : "photo of a fruit bowl" 512x512px seed:639753980 prompt-strength:7.5 steps:40 sampler-type:PLMS
    PLMS Sampler: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 40/40 [00:28<00:00,  1.40it/s]
    🖼  saved to: ./outputs/000003_639753980_PLMS40_PS7.5_photo_of_a_fruit_bowl.jpg
Generating 🖼  : "portrait photo of a freckled woman" 512x512px seed:500686645 prompt-strength:7.5 steps:40 sampler-type:PLMS
    PLMS Sampler: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 40/40 [00:29<00:00,  1.37it/s]
    🖼  saved to: ./outputs/000004_500686645_PLMS40_PS7.5_portrait_photo_of_a_freckled_woman.jpg

Tiled Images

>> imagine  "gold coins" "a lush forest" "piles of old books" leaves --tile

Image-to-Image

>> imagine "portrait of a smiling lady. oil painting" --init-image girl_with_a_pearl_earring.jpg

Features

It makes images from text descriptions! 🎉
Generate images either in code or from command line.
It just works. Proper requirements are installed. model weights are automatically downloaded. No huggingface account needed. (if you have the right hardware... and aren't on windows)
Noisy logs are gone (which was surprisingly hard to accomplish)
WeightedPrompts let you smash together separate prompts (cat-dog)
Tile Mode creates tileable images
Prompt metadata saved into image file metadata

How To

from imaginairy import imagine_images, imagine_image_files, ImaginePrompt, WeightedPrompt

prompts = [
    ImaginePrompt("a scenic landscape", seed=1),
    ImaginePrompt("a bowl of fruit"),
    ImaginePrompt([
       WeightedPrompt("cat", weight=1),
       WeightedPrompt("dog", weight=1),
    ])
]
for result in imagine_images(prompts):
    # do something
    result.save("my_image.jpg")
    
# or

imagine_image_files(prompts, outdir="./my-art")

Requirements

~10 gb space for models to download
A decent computer with either a CUDA supported graphics card or M1 processor.

Improvements from CompVis

img2img actually does # of steps you specify
performance optimizations

Models Used

Not Supported

a web interface. this is a python library

Todo

performance optimizations
deploy to pypi
add tests
set up ci (test/lint/format)
add docs
notify https://github.com/CompVis/stable-diffusion/issues/25
remove yaml config
delete more unused code
Interface improvements
- ✅ init-image at command line
- prompt expansion
Image Generation Features
- upscaling
  - https://github.com/lowfuel/progrock-stable
- face improvements
  - gfpgan - https://github.com/TencentARC/GFPGAN
  - codeformer - https://github.com/sczhou/CodeFormer
- image describe feature - https://replicate.com/methexis-inc/img2prompt
- outpainting
- inpainting
  - https://github.com/andreas128/RePaint
- add more sampling methods?
- img2img but keeps img stable
  - https://www.reddit.com/r/StableDiffusion/comments/xboy90/a_better_way_of_doing_img2img_by_finding_the/
  - https://gist.github.com/trygvebw/c71334dd127d537a15e9d59790f7f5e1
- img2img for plms?
- images as actual prompts instead of just init images
- cross-attention control:
  - https://github.com/bloc97/CrossAttentionControl/blob/main/CrossAttention_Release_NoImages.ipynb
- guided generation https://colab.research.google.com/drive/1dlgggNa5Mz8sEAGU0wFCHhGLFooW_pf1#scrollTo=UDeXQKbPTdZI
- tiling
- output show-work videos
- image variations https://github.com/lstein/stable-diffusion/blob/main/VARIATIONS.md
- textual inversion
  - https://www.reddit.com/r/StableDiffusion/comments/xbwb5y/how_to_run_textual_inversion_locally_train_your/
  - https://colab.research.google.com/github/huggingface/notebooks/blob/main/diffusers/sd_textual_inversion_training.ipynb#scrollTo=50JuJUM8EG1h
- zooming videos? a la disco diffusion
- fix saturation at high CFG https://www.reddit.com/r/StableDiffusion/comments/xalo78/fixing_excessive_contrastsaturation_resulting/

Project details

Release history Release notifications | RSS feed

15.0.0

Sep 22, 2024

14.3.0

Apr 18, 2024

14.2.0

Mar 17, 2024

14.1.1

Jan 18, 2024

14.1.0

Jan 18, 2024

14.0.4

Jan 5, 2024

14.0.3

Jan 4, 2024

14.0.2

Jan 4, 2024

14.0.1

Jan 3, 2024

14.0.0

Jan 3, 2024

14.0.0b9 pre-release

Jan 2, 2024

14.0.0b8 pre-release

Jan 2, 2024

14.0.0b7 pre-release

Dec 29, 2023

14.0.0b6 pre-release

Dec 28, 2023

14.0.0b5 pre-release

Dec 3, 2023

14.0.0b4 pre-release

Nov 26, 2023

14.0.0b3 pre-release

Nov 23, 2023

14.0.0b2 pre-release

Nov 23, 2023

14.0.0b1 pre-release

Nov 23, 2023

13.2.1

Sep 29, 2023

13.2.0

Sep 17, 2023

13.1.0

Sep 9, 2023

13.0.1

May 27, 2023

13.0.0

May 22, 2023

13.0b0 pre-release

May 22, 2023

12.0.3

May 13, 2023

12.0.2

May 6, 2023

12.0.1

May 5, 2023

12.0.0

May 5, 2023

11.1.1

Apr 18, 2023

11.1.0

Mar 18, 2023

11.0.0

Mar 1, 2023

10.2.0

Feb 25, 2023

10.1.1

Feb 23, 2023

10.1.0

Feb 23, 2023

10.0.1

Feb 17, 2023

10.0.0

Feb 17, 2023

9.0.2

Feb 6, 2023

9.0.1

Feb 6, 2023

9.0.0

Feb 5, 2023

8.3.1

Jan 29, 2023

8.3.0

Jan 29, 2023

8.2.0

Jan 27, 2023

8.1.0

Jan 26, 2023

8.0.5

Jan 24, 2023

8.0.4

Jan 23, 2023

8.0.3

Jan 23, 2023

8.0.2

Jan 22, 2023

8.0.1

Jan 22, 2023

8.0.0

Jan 22, 2023

7.6.0

Jan 19, 2023

7.4.3

Jan 16, 2023

7.4.2

Jan 16, 2023

7.4.1

Jan 16, 2023

7.4.0

Jan 16, 2023

7.3.0

Dec 22, 2022

7.2.0

Dec 20, 2022

7.1.1

Dec 19, 2022

7.1.0

Dec 7, 2022

7.0.0

Dec 2, 2022

6.1.2

Nov 28, 2022

6.1.1

Nov 27, 2022

6.1.0

Nov 27, 2022

6.0.0a0 pre-release

Nov 24, 2022

5.1.0

Nov 16, 2022

5.0.1

Nov 13, 2022

5.0.0

Nov 13, 2022

4.1.0

Oct 24, 2022

4.0.0

Oct 22, 2022

3.0.1

Oct 13, 2022

3.0.0

Oct 10, 2022

2.4.1

Oct 9, 2022

2.3.1

Oct 7, 2022

2.3.0

Oct 6, 2022

2.2.1

Oct 4, 2022

2.2.0

Oct 3, 2022

2.1.0

Sep 28, 2022

2.0.3

Sep 28, 2022

2.0.2

Sep 28, 2022

2.0.1

Sep 26, 2022

2.0.0

Sep 26, 2022

1.6.2

Sep 22, 2022

1.6.1

Sep 22, 2022

1.6.0

Sep 22, 2022

1.5.4

Sep 21, 2022

1.5.3

Sep 21, 2022

1.5.1

Sep 20, 2022

1.4.0

Sep 18, 2022

1.3.0

Sep 18, 2022

1.2.0

Sep 16, 2022

1.1.4

Sep 15, 2022

1.1.3

Sep 15, 2022

1.1.2

Sep 15, 2022

1.1.1

Sep 15, 2022

1.1.0

Sep 14, 2022

1.0.2

Sep 13, 2022

1.0.1

Sep 13, 2022

0.7.3

Sep 12, 2022

This version

0.7.2

Sep 12, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

imaginAIry-0.7.2-py3-none-any.whl (67.2 kB view hashes)

Uploaded Sep 12, 2022 Python 3

Hashes for imaginAIry-0.7.2-py3-none-any.whl

Hashes for imaginAIry-0.7.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`53f0946f15ff4ec0a2d4c530d6911f67b225621565bce4b040dc7c170a8122db`
MD5	`ff773beeefa87bbd5f66cf0f1a490270`
BLAKE2b-256	`d19dfb123212b31404bbff4ae40ab619f9cd0eef391a97e2af3aa12e76e8ddf1`