A dummy project to generate images from song lyrics using Latent Stable Diffusion

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Diffuse My Lyrics!

🎶 ➡ 🧠 ➡ 🖼️

An easy way to generate images from lyrics

Description

This is a simple application that uses the spectacular Stable Diffusion model to generate images from song lyrics. So just install the library on a Colab notebook, choose your favorite song and sit back and wait for the visual interpretations of each verse!

In the Cool Outputs section (only in repository README.md), I have shown some interpretations of verses that I found very cool 😏😏😏

How To Use

First, simply install the python package in a colab notebook (right now, it's only for Colab, but extending it to general use is trivial ... as long as you have a good GPU 😅 ).

# Install the latest version of the package
$ pip install -U diffuse-my-lyrics

Now, suppose we want to feed the model with the following verses, belonging to The End, a magnificent piece by The Doors.

Ride the King's highway, baby
Weird scenes inside the gold mine
Ride the highway west, baby
Ride the snake, ride the snake
To the lake, the ancient lake, baby
The snake is long, seven miles
Ride the snake, he's old, and his skin is cold

After uploading this lyrics to the colab notebook (I am using a .txt extension), we just need to run the following commands.

# Import the Lyrics2Images class
from diffuse_my_lyrics import Lyrics2Images

l2i = Lyrics2Images(num_inference_steps=100) # In this case, we are indicating the model to run for 100 steps
l2i.run(input_path="/content/my_favourite_song.txt", output_path="my_favourite_song_folder")

After running Lyrics2Images, a folder will be created in your colab current directory (my_favourite_song_folder), where a series of images will be generated (one image for each verse of the lyrics).

One it's finished, simply zip the folder and download it!!

import shutil
shutil.make_archive("zipped_folder", 'zip', "my_favourite_song_folder")

Arguments

model_id - The model id. By default CompVis/stable-diffusion-v1-4
revision - The model revision. By default fp16
torch_dtype - The Pytorch dtype. By default torch.float16
prompt - This parameter is useful if you want to add additional information to the verse. For example, digital art, HQ, etc. By default digital art
num_inference_steps - The number of steps. By default 50
use_auth_token - This parameter determines whether to use an authentication token for Hugging Face. By default True

Next Steps

Add support for generating several images instead of just one.
Make the library usable in another environments (not just Colab)
Create argument for using a manual seed
Add custom size of output images

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.0.2

Mar 15, 2023

0.0.1

Sep 2, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

diffuse_my_lyrics-0.0.2.tar.gz (4.4 kB view hashes)

Uploaded Mar 15, 2023 Source

Built Distribution

diffuse_my_lyrics-0.0.2-py3-none-any.whl (4.3 kB view hashes)

Uploaded Mar 15, 2023 Python 3

Hashes for diffuse_my_lyrics-0.0.2.tar.gz

Hashes for diffuse_my_lyrics-0.0.2.tar.gz
Algorithm	Hash digest
SHA256	`bee3719345d19b2117ca2ead19ed1a2e4fbb05203f13d8a4ed166fcfa264d22c`
MD5	`def54e1401afc76ecd5755cb97d35584`
BLAKE2b-256	`372770a81d067685d285a0def62d7e2c53b24a06ddc978fea6119e9465272c23`

Hashes for diffuse_my_lyrics-0.0.2-py3-none-any.whl

Hashes for diffuse_my_lyrics-0.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7d2ad456078102168bde87be601f491004ea4d1796a80a3986fb574d356a4806`
MD5	`7a556ad2d1b099dc5cf6912af9bfc1b5`
BLAKE2b-256	`19eca68d0e486ef85d0d791fa4ee950c9c80d28532cca777b749db705ed106c1`