Skip to main content

Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech.

Project description

Fabler

CI PyPI Status Badge PyPI - Python Version Code style: black License: MIT


A multimodal AI story teller, built with Stable Diffusion, GPT, and neural text-to-speech (TTS).

Given a prompt as an opening line of a story, GPT writes the rest of the plot; Stable Diffusion draws an image for each sentence; a TTS model narrates each line, resulting in a fully animated video of a short story, replete with audio and visuals.

demo animation

Quickstart

Install from PyPi

$ pip install fabler

Install from Source

  1. Clone the repository
$ git clone https://github.com/christopherwoodall/fabler.git
  1. Install package requirements.
$ pip install --upgrade pip wheel
$ pip install -e ".[developer]"
  1. Run the demo. The final video will be saved as /out/out.mp4, alongside other intermediate images, audio files, and subtitles.
$ fabler --scene=scene.yaml

4 Alternatively with make:

make install && make run

Usage

  1. Load the model with defaults.
from fabler import Fabler

story_teller = Fabler.from_defaults()
story_teller.generate(...)
  1. Alternatively, configure the model with custom settings.
from fabler import Fabler, FablerConfig

config = FablerConfig(
    writer="gpt2-large",
    painter="CompVis/stable-diffusion-v1-4",
    max_new_tokens=100,
    diffusion_prompt_prefix="Van Gogh style",
)

story_teller = Fabler(config)
story_teller.generate(...)

License

Released under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

fabler-0.0.3-py3-none-any.whl (90.9 kB view details)

Uploaded Python 3

File details

Details for the file fabler-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: fabler-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 90.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.1

File hashes

Hashes for fabler-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 62310cc8a5de6fa8d630dbaac3b29111b200972845dc279c823df5d7c9ee5001
MD5 33bef4c94873f568e623cd14d8ca4956
BLAKE2b-256 aa00da0a7a717f24dcc160f903be8a632d36abf13d1c1f082b03bdd4b8bb1587

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page