Automate chunking long texts to produce a single audio file from text-to-speech APIs

These details have not been verified by PyPI

Project links

Homepage

Project description

tts-joinery

tts-joinery is a Python library and CLI tool to work around length limitations in text-to-speech APIs.

Since currently-popular APIs are limited to 4096 characters, this library will:

Chunk the input text into sentences using the NLTK Punkt module
Run each chunk through the TTS API
Join together the resulting output to produce a single MP3 file

Currently only the OpenAI API is supported, with the intent to add more in the future.

Installation

pip install tts-joinery

or use pipx to install as a standalone tool.

Requires ffmpeg for the audio file processing.

Installation may vary depending on your system. On Linux you can use your system package manager. On Mac brew install ffmpeg should work.

Usage

Command-Line Interface (CLI)

The CLI expects to find an OpenAI API Key in a OPENAI_API_KEY environment variable, or in a .env file.

Syntax

ttsjoin [OPTIONS]

Options

--input-file FILENAME   Plaintext file to process into speech, otherwise stdin
--output-file FILENAME  MP3 result, otherwise stdout
--model TEXT            Slug of the text-to-speech model to be used
--service TEXT          API service (currently only supports openai)
--voice TEXT            Slug of the voice to be used
--no-cache BOOLEAN      Disable caching
--help                  Show this message and exit.

Examples

Using an input file and specifying an output file:

ttsjoin --input-file input.txt --output-file output.mp3 --model tts-1 --service openai --voice onyx

Using stdin and stdout with default options:

echo "Your text to be processed" | ttsjoin > output.mp3

Each chunk of text is cached for performance when the same text multiple times, this can be disabled:

ttsjoin --input-file input.txt --output-file output.mp3 --no-cache

Python Library

You can also use tts-joinery as part of your Python project:

from joinery.op import JoinOp
from joinery.api.openai import OpenAIApi

tts = JoinOp(
    text='This is only a test!',
    api=OpenAIApi(
        model='tts-1-hd',
        voice='onyx',
        api_key=OPENAI_API_KEY,
    ),
)

tts.process_to_file('output.mp3')

Contributing

Contributions welcome, particularly other TTS APIs, check the issues beforehand and feel free to open a PR. Code is formatted with Black.

License

This project is licensed under the MIT License.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

1.0.1

Aug 16, 2024

1.0.0

Aug 16, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tts_joinery-1.0.1.tar.gz (7.7 kB view hashes)

Uploaded Aug 16, 2024 Source

Built Distribution

tts_joinery-1.0.1-py3-none-any.whl (8.1 kB view hashes)

Uploaded Aug 16, 2024 Python 3

Hashes for tts_joinery-1.0.1.tar.gz

Hashes for tts_joinery-1.0.1.tar.gz
Algorithm	Hash digest
SHA256	`fd8b0fab355f6404a95e26e1911f3ea410bde76b4b033d975d3b4c179e5606e8`
MD5	`983e1bc3571a70dc9a1971aa4f373b9c`
BLAKE2b-256	`ee2fb45ca9d4162958e4f273bb85d18ea269fba80a6c060f2331c44ee02a0587`

Hashes for tts_joinery-1.0.1-py3-none-any.whl

Hashes for tts_joinery-1.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`07c0e5d01d66ddf7bfc9f1da1c1f475cade944d877aaa5122fe2c4f6c739b2d5`
MD5	`72dc29b768b8a6b0b69277e9c6948006`
BLAKE2b-256	`5840e587e0638b61444e9f6180710f42f1fca14dbd1cd945f753ac46d56bcb62`