Skip to main content

No project description provided

Project description

Project logo

Document-to-podcast: a Blueprint by Mozilla.ai for generating podcasts from documents using local AI

Docs Tests Ruff

This blueprint demonstrate how you can use open-source models & tools to convert input documents into a podcast featuring two speakers. It is designed to work on most local setups, meaning no external API calls or GPU access is required. This makes it more accessible and privacy-friendly by keeping everything local.

document-to-podcast Diagram

👉 📖 For more detailed guidance on using this project, please visit our Docs.

👉 🔨 Built with

👉 🧠 Check the Supported Models.

Quick-start

Get started right away using one of the options below:

Google Colab HuggingFace Spaces GitHub Codespaces
Try on Colab Try on Spaces Try on Codespaces

You can also install and use the blueprint locally:

Command Line Interface

pip install document-to-podcast
document-to-podcast \
--input_file "example_data/Mozilla-Trustworthy_AI.pdf" \
--output_folder "example_data"
--text_to_text_model "Qwen/Qwen2.5-1.5B-Instruct-GGUF/qwen2.5-1.5b-instruct-q8_0.gguf"

Graphical Interface App

git clone https://github.com/mozilla-ai/document-to-podcast.git
cd document-to-podcast
pip install -e .
python -m streamlit run demo/app.py

License

This project is licensed under the Apache 2.0 License. See the LICENSE file for details.

Contributing

Contributions are welcome! To get started, you can check out the CONTRIBUTING.md file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

document_to_podcast-1.4.2.tar.gz (2.8 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

document_to_podcast-1.4.2-py3-none-any.whl (30.4 kB view details)

Uploaded Python 3

File details

Details for the file document_to_podcast-1.4.2.tar.gz.

File metadata

  • Download URL: document_to_podcast-1.4.2.tar.gz
  • Upload date:
  • Size: 2.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for document_to_podcast-1.4.2.tar.gz
Algorithm Hash digest
SHA256 2a8505f99f3baeacdc6afd3a2c29e29cf71a7bfcf993a307ab096b1e5f21808a
MD5 86ead9998a11c3109dfd174301da6ff3
BLAKE2b-256 0629081daa71babbe6eee5833bf64e5c4aabe8d7c103f045d2dd346770d4cd68

See more details on using hashes here.

Provenance

The following attestation bundles were made for document_to_podcast-1.4.2.tar.gz:

Publisher: release.yaml on mozilla-ai/document-to-podcast

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file document_to_podcast-1.4.2-py3-none-any.whl.

File metadata

File hashes

Hashes for document_to_podcast-1.4.2-py3-none-any.whl
Algorithm Hash digest
SHA256 dee956acd1c2a34eb839a826698c95c2a2e0c58a614ea10966e841bd28c82120
MD5 0377457b4a9ce39f09192ee938d7f53f
BLAKE2b-256 f2fdd7c9bf564f4a26104e84aa3e289d9cd0a02fd343f8ff32a31806d76dc2e1

See more details on using hashes here.

Provenance

The following attestation bundles were made for document_to_podcast-1.4.2-py3-none-any.whl:

Publisher: release.yaml on mozilla-ai/document-to-podcast

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page