Skip to main content

No project description provided

Project description

Project logo

Document-to-podcast: a Blueprint by Mozilla.ai for generating podcasts from documents using local AI

Docs Tests Ruff

This blueprint demonstrate how you can use open-source models & tools to convert input documents into a podcast featuring two speakers. It is designed to work on most local setups, meaning no external API calls or GPU access is required. This makes it more accessible and privacy-friendly by keeping everything local.

document-to-podcast Diagram

👉 📖 For more detailed guidance on using this project, please visit our Docs.

👉 🔨 Built with

👉 🧠 Check the Supported Models.

Quick-start

Get started right away using one of the options below:

Google Colab HuggingFace Spaces GitHub Codespaces
Try on Colab Try on Spaces Try on Codespaces

You can also install and use the blueprint locally:

Command Line Interface

pip install document-to-podcast
document-to-podcast \
--input_file "example_data/Mozilla-Trustworthy_AI.pdf" \
--output_folder "example_data"
--text_to_text_model "Qwen/Qwen2.5-1.5B-Instruct-GGUF/qwen2.5-1.5b-instruct-q8_0.gguf"

Graphical Interface App

git clone https://github.com/mozilla-ai/document-to-podcast.git
cd document-to-podcast
pip install -e .
python -m streamlit run demo/app.py

License

This project is licensed under the Apache 2.0 License. See the LICENSE file for details.

Contributing

Contributions are welcome! To get started, you can check out the CONTRIBUTING.md file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

document_to_podcast-1.4.1.tar.gz (2.8 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

document_to_podcast-1.4.1-py3-none-any.whl (30.4 kB view details)

Uploaded Python 3

File details

Details for the file document_to_podcast-1.4.1.tar.gz.

File metadata

  • Download URL: document_to_podcast-1.4.1.tar.gz
  • Upload date:
  • Size: 2.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for document_to_podcast-1.4.1.tar.gz
Algorithm Hash digest
SHA256 34aa40ad6a173bbc4003369d83f9d6f2ea6aa74338e0bc46ae53fb426324f1ae
MD5 787ee1978baee4d4704617e3fe177150
BLAKE2b-256 d3dd388dc51b020733156e88371c9e84b43dc0e037f4f867647cf69daec7dd16

See more details on using hashes here.

Provenance

The following attestation bundles were made for document_to_podcast-1.4.1.tar.gz:

Publisher: release.yaml on mozilla-ai/document-to-podcast

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file document_to_podcast-1.4.1-py3-none-any.whl.

File metadata

File hashes

Hashes for document_to_podcast-1.4.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c773acb149d9116c6b40c5e0ab628b288081ef7a952c86c13008073f3ac2ff16
MD5 172e3e5a95b8874af6102bcdbf7b542b
BLAKE2b-256 bd8d648bb5f87e6ef9e14be9f5cd3e39c4fc7c2224a31baab8705c76e5917df5

See more details on using hashes here.

Provenance

The following attestation bundles were made for document_to_podcast-1.4.1-py3-none-any.whl:

Publisher: release.yaml on mozilla-ai/document-to-podcast

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page