OpenAI GPT based informational audiobook/podcast mp3 generator

These details have not been verified by PyPI

Project links

Repository

Project description

podgenai

podgenai is a Python 3.12 application to generate an informational single-speaker audiobook/podcast mp3 file on a given topic using an OpenAI LLM. The loosely targeted duration of the generated file is an hour, although the generated duration varies. A funded OpenAI API key is required.

Links

Caption	Link
Repo	https://github.com/impredicative/podgenai
Changelog	https://github.com/impredicative/podgenai/releases
Package	https://pypi.org/project/podgenai
Podcast	https://open.spotify.com/show/0WayD9YdeSTxcfm63lnam5
Podcast RSS	https://anchor.fm/s/f4868644/podcast/rss

Approach

The gpt-5.x-chat and tts-1 models are used. For a given topic, the high-level reference approach is:

Applicable subtopics are listed using the LLM. If however the topic is unknown to the LLM, the process is aborted.
The voice is selected using the LLM from the available choices.
Concurrently for each subtopic, the corresponding text and speech are generated using the LLM and TTS respectively.
The speech files are concatenated using ffmpeg.

Although there may sometimes exist some semantic repetition of content across subtopics, this has intentionally not been optimized away because this repetition of important points can help with learning and memorization.

Samples

These generated mp3 files are available for download, one for each voice. As a reminder, the voice is selected by the LLM.

There also is a related podcast (RSS) to which episodes may be posted over time.

A playback speed of 1.05x is recommended for non-technical topics, 1.0x for technical topics, and 0.95x for foreign language topics.

Voice	Name
analytical-male	Artificial General Intelligence (AGI): Approaches and Algorithms
elegant-female	Monero
emotive-male	Living a good life
expository-male	History of Neural Networks
informative-male	Bitcoin for nerds
serene-female	Human circulatory system (unabridged)

Setup

Common setup

In the working directory, create a file named .env, with the intended environment variable OPENAI_API_KEY=<your OpenAI API key>, or set it in a different way.
Optionally also set the environment variable PODGENAI_OPENAI_MAX_WORKERS=32 for faster generation, with its default value being 16.
Ensure that ffmpeg is available. This is automatic if using the included devcontainer definition.
Continue the setup via GitHub or PyPI as below.

Setup via GitHub using devcontainer

Continue from the common setup steps.
Clone or download this repo.
Build and provision the defined devcontainer.

Setup via GitHub manually

Continue from the common setup steps.
Clone or download this repo.
Ensure that rye is installed and available.
In the repo directory, run rye sync --no-lock.

Setup via PyPI

Continue from the common setup steps.
Create and activate a Python 3.12 devcontainer or virtual environment.
Install via PyPI: pip install -U podgenai.

Usage

Usage can be as a command-line application or as a Python library. By default, the generated mp3 file will be written to the current working directory.

Usage tips

If a requested topic fails to generate subtopics due to a refusal, retry up to a few times, as it may succeed with several attempts. If it doesn't, try rewording it, perhaps to be broader or narrower or more factual. Up to two attempts are made per run, although the first attempt will reuse the disk cache if available.
For a potentially longer list of covered subtopics, consider appending the "(unabridged)" suffix to the requested topic, e.g. "PyTorch (unabridged)".
In case the topic fails to be spoken at the start of a podcast, delete ./work/<topic>/1.*.mp3 and regenerate the output.
To optionally generate a cover art image for your topic, this custom GPT can be used.
To attempt generation in a foreign language, specify the title in the desired language along with a parenthesized prefix of the language name, e.g. "México (español)". If the generation is refused the first time, try again. Also refer to and use the --no-markers option.

Usage as application

Usage help is copied below:

$ python -m podgenai -h
Usage: python -m podgenai [OPTIONS]

  Generate and write an audiobook podcast mp3 file for the given topic to the given output file path.

Options:
  -t, --topic TEXT                Topic. If not given, the user is prompted for it.
  -p, --path PATH                 Output file or directory path. If an intended file path, it must have an ".mp3"
                                  suffix. If a directory, it must exist, and the file name is auto-determined. If not
                                  given, the output file is written to the current working directory with an auto-
                                  determined file name.
  -s, --max-sections INTEGER RANGE
                                  Maximum number of sections, between 3 and 100. If not given, it is unrestricted.
                                  [3<=x<=100]
  -m, --markers / -nm, --no-markers
                                  Include markers at the start or end of sections in the generated audio. If
                                  `--markers`, markers are included, and this is the default. If `--no-markers`,
                                  markers are excluded, as can be appropriate for foreign-language generation.
  -c, --confirm / -nc, --no-confirm
                                  Confirm before full-text and speech generation. If `--confirm`, a confirmation is
                                  interactively sought as each step of the workflow progresses, and this is the
                                  default. If `--no-confirm`, the full-text and speech are generated without
                                  confirmations.
  -h, --help                      Show this message and exit.

Usage examples:

$ python -m podgenai -t "My favorite topic"

$ python -m podgenai -t "My favorite topic" -p ~/Downloads/

$ python -m podgenai -t "My favorite topic" -p ~/Downloads/topic.mp3 -nc

$ python -m podgenai -t "L'histoire de Napoléon Bonaparte (français)" -nm

Usage as library

>>> from podgenai import generate_media
>>> import inspect

>>> print(inspect.signature(generate_media))
(topic: str, *, output_path: Optional[pathlib.Path] = None, max_sections: Optional[int] = None, markers: bool = True, confirm: bool = False) -> pathlib.Path

>>> print(inspect.getdoc(generate_media))

Return the output path after generating and writing an audiobook podcast to file for the given topic.

Params:
* `topic`: Topic.
* `output_path`: Output file or directory path.
    If an intended file path, it must have an ".mp3" suffix. If a directory, it must exist, and the file name is auto-determined.
    If not given, the output file is written to the repo directory with an auto-determined file name.
* `max_sections`: Maximum number of sections to generate. It is between 3 and 100. It is unrestricted if not given.
* `markers`: Include markers at the start or end of sections in the generated audio.
    If true, markers are included. If false, markers are excluded, as can be appropriate for foreign-language generation. Its default is true.
* `confirm`: Confirm before full-text and speech generation.
    If true, a confirmation is interactively sought after generating and printing the list of subtopics, before generating the full-text, and also before generating the speech. Its default is false.

If failed, a subclass of the `podgenai.exceptions.Error` exception is raised.

Cache

Text and speech segments are cached locally on disk in the ./work/<topic> directory. They can manually be deleted. This deletion is currently not automatic. Moreover, it can currently be necessary to delete one or more applicable cached files if the cache is to be bypassed.

Disclaimer

_{This software is provided "as is," without warranty of any kind, express or implied, including but not limited to the warranties of merchantability, fitness for a particular purpose, and noninfringement. In no event shall the authors or copyright holders be liable for any claim, damages, or other liability, whether in an action of contract, tort, or otherwise, arising from, out of, or in connection with the software or the use or other dealings in the software.}

_{Users should be aware that both the text and the audio of the generated files are produced by artificial intelligence (AI) based on the inputs given and the data available to the AI model at the time of generation. As such, inaccuracies, errors, or unintended content may occur. Users are advised to exercise caution and verify the accuracy and appropriateness of the generated content before any use or reliance.}

_{You are responsible for the costs associated with the use of the OpenAI API as required by the software, and you must comply with the OpenAI API terms of service. The software's functionality is dependent on the availability and functionality of external services and software, including but not limited to the OpenAI API and ffmpeg, over which the authors have no control.}

_{The use of the OpenAI API key and any generated content must comply with all applicable laws and regulations, including copyright laws and the terms of service of the OpenAI platform. You are solely responsible for ensuring that your use of the software and any generated content complies with the OpenAI terms of service and any other applicable laws and regulations.}

_{This software is licensed under the GNU Lesser General Public License (LGPL), which allows for both private and commercial use, modification, and distribution, subject to the terms and conditions set forth in the LGPL. You should have received a copy of the GNU Lesser General Public License along with this program. If not, see http://www.gnu.org/licenses/.}

_{The authors do not claim ownership of any content generated using this software. Responsibility for the use of any and all generated content rests with the user. Users should exercise caution and due diligence to ensure that generated content does not infringe on the rights of third parties.}

_{This disclaimer is subject to change without notice. It is your responsibility to review it periodically for updates.}

Project details

These details have not been verified by PyPI

Project links

Repository

Release history Release notifications | RSS feed

This version

0.17.2

Feb 10, 2026

0.17.1

Dec 28, 2025

0.17.0

Dec 28, 2025

0.16.0

Dec 2, 2025

0.15.2

Aug 15, 2025

0.15.1

Aug 9, 2025

0.15.0

Aug 9, 2025

0.14.0

Jun 27, 2025

0.13.0

Mar 23, 2025

0.12.0

Mar 10, 2025

0.11.1

Feb 23, 2025

0.11.0

Dec 18, 2024

0.10.1

Dec 16, 2024

0.9.0

Dec 1, 2024

0.8.0

Nov 24, 2024

0.7.0

Oct 19, 2024

0.6.2

Oct 2, 2024

0.6.1

Oct 2, 2024

0.5.7

Sep 23, 2024

0.5.6

Sep 21, 2024

0.5.4

Sep 20, 2024

0.5.3

Sep 19, 2024

0.5.2

Sep 11, 2024

0.5.1

Jul 25, 2024

0.4.0

Jul 14, 2024

0.3.0

Jul 7, 2024

0.2.2

Jun 18, 2024

0.2.1

May 19, 2024

0.1.5

May 16, 2024

0.1.4

May 15, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

podgenai-0.17.2.tar.gz (28.9 kB view details)

Uploaded Feb 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

podgenai-0.17.2-py3-none-any.whl (35.1 kB view details)

Uploaded Feb 10, 2026 Python 3

File details

Details for the file podgenai-0.17.2.tar.gz.

File metadata

Download URL: podgenai-0.17.2.tar.gz
Upload date: Feb 10, 2026
Size: 28.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.9

File hashes

Hashes for podgenai-0.17.2.tar.gz
Algorithm	Hash digest
SHA256	`021d068b7120377245e547a9a40c4a2605761947d4738180512bd67bdd1f08c5`
MD5	`e7107998f30658bf90c11c8e15270677`
BLAKE2b-256	`eae755cd8b9cae45566d0dc98daec2c6d5a70bd6c32603ed680d9f9cf31a4fd1`

See more details on using hashes here.

File details

Details for the file podgenai-0.17.2-py3-none-any.whl.

File metadata

Download URL: podgenai-0.17.2-py3-none-any.whl
Upload date: Feb 10, 2026
Size: 35.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.9

File hashes

Hashes for podgenai-0.17.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e94cf8acd4cd83fec155195f5a6e214ea95a9c43b38ae188b40d652f44a5868e`
MD5	`61b408b10494cbbc14861837409fa975`
BLAKE2b-256	`38849f0ef236653f087faadb34ec76ecb4fc2a9ac586da18d386f6f7ff6456e8`

See more details on using hashes here.

podgenai 0.17.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

podgenai

Links

Approach

Samples

Setup

Common setup

Setup via GitHub using devcontainer

Setup via GitHub manually

Setup via PyPI

Usage

Usage tips

Usage as application

Usage as library

Cache

Disclaimer

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes