Skip to main content

No project description provided

Project description

Project logo

Document-to-podcast: a Blueprint by Mozilla.ai for generating podcasts from documents using local AI

Docs Tests Ruff

This blueprint demonstrate how you can use open-source models & tools to convert input documents into a podcast featuring two speakers. It is designed to work on most local setups, meaning no external API calls or GPU access is required. This makes it more accessible and privacy-friendly by keeping everything local.

document-to-podcast Diagram

👉 📖 For more detailed guidance on using this project, please visit our Docs.

👉 🔨 Built with

👉 🧠 Check the Supported Models.

Quick-start

Get started right away using one of the options below:

Google Colab HuggingFace Spaces GitHub Codespaces
Try on Colab Try on Spaces Try on Codespaces

You can also install and use the blueprint locally:

Command Line Interface

pip install document-to-podcast
document-to-podcast \
--input_file "example_data/Mozilla-Trustworthy_AI.pdf" \
--output_folder "example_data"
--text_to_text_model "Qwen/Qwen2.5-1.5B-Instruct-GGUF/qwen2.5-1.5b-instruct-q8_0.gguf"

Graphical Interface App

git clone https://github.com/mozilla-ai/document-to-podcast.git
cd document-to-podcast
pip install -e .
python -m streamlit run demo/app.py

License

This project is licensed under the Apache 2.0 License. See the LICENSE file for details.

Contributing

Contributions are welcome! To get started, you can check out the CONTRIBUTING.md file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

document_to_podcast-1.4.0.tar.gz (2.8 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

document_to_podcast-1.4.0-py3-none-any.whl (30.4 kB view details)

Uploaded Python 3

File details

Details for the file document_to_podcast-1.4.0.tar.gz.

File metadata

  • Download URL: document_to_podcast-1.4.0.tar.gz
  • Upload date:
  • Size: 2.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for document_to_podcast-1.4.0.tar.gz
Algorithm Hash digest
SHA256 62dabccb78d3ed53cf22b326c8560f71d099857c461a4a493b206cfc02de8c7f
MD5 7e182347be46703c209796752e2ae53f
BLAKE2b-256 f7e264f09c948b890eb828f64084bb9b61ac48138445c197a4f157486ff3cf56

See more details on using hashes here.

Provenance

The following attestation bundles were made for document_to_podcast-1.4.0.tar.gz:

Publisher: release.yaml on mozilla-ai/document-to-podcast

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file document_to_podcast-1.4.0-py3-none-any.whl.

File metadata

File hashes

Hashes for document_to_podcast-1.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6d7c2046d8ad0baf4813fb1722de2756405967e83dff757df5fb339acd8faf89
MD5 5117adb32a5d96d994d38138df9c3eee
BLAKE2b-256 1aaa28e95534c90e48ccb786da36046c99b573ee0d89037575d3c2bfcc7210c6

See more details on using hashes here.

Provenance

The following attestation bundles were made for document_to_podcast-1.4.0-py3-none-any.whl:

Publisher: release.yaml on mozilla-ai/document-to-podcast

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page