A powerful and easy-to-use Python library for generating natural-sounding speech using OpenAI text-to-speech capabilities.

Project description

🎙️ openai-tts

License Python Status

A powerful and easy-to-use Python library for generating natural-sounding speech using OpenAI's text-to-speech capabilities.

✨ Features

🔊 Convert text to high-quality speech using OpenAI's TTS API
🎭 Multiple voice options (Alloy, Ash, Ballad, Coral, Echo, Fable, Onyx, Nova, Sage, Shimmer, Verse)
🚀 Concurrent processing for faster generation of audio files
🧩 Modular and extensible architecture for adding new providers
📝 Intelligent sentence splitting for natural-sounding speech
🛠️ Comprehensive error handling and retry mechanism

🚨 Disclaimer

IMPORTANT: This library is developed for educational and research purposes only. It is not affiliated with, endorsed by, or connected to OpenAI in any way. Using this library to circumvent API restrictions, terms of service, or to access services without proper authorization may violate OpenAI's terms of service.

The developers of this library are not responsible for any misuse or violations of terms of service that may result from using this code. Users are solely responsible for ensuring their use of this library complies with all applicable terms of service and laws.

Project Structure

openai-tts/
├── LICENCE
├── README.md
├── requirements.txt
├── setup.py
├── example.py
├── openai_tts/
│   ├── __init__.py
│   ├── config.py
│   ├── utils.py
│   ├── exceptions.py
│   ├── providers/
│   │   ├── __init__.py
│   │   ├── base.py
└───└───└── openai.py

🛠️ Installation

Using PyPI (Recommended)

pip install openai-tts

Clone Locally

git clone https://github.com/sujalrajpoot/openai-tts.git
cd openai-tts
pip install -r requirements.txt

🔍 Dependencies

Python 3.8+
curl-cffi

📋 Quick Start

from openai_tts import OpenaiTTS
from openai_tts.config import VoiceType

# Initialize the TTS engine
tts = OpenaiTTS()

# Generate speech with default settings
text = "Hello world! This is a demonstration of the OpenAI TTS library."
tts.speak(text)  # Saves to default "output.mp3"

# Try different voices
tts.speak(text, voice=VoiceType.ECHO, output_path="echo_voice.mp3")
tts.speak(text, voice=VoiceType.NOVA, output_path="nova_voice.mp3")

# Control verbosity
tts.speak(text, verbose=False, output_path="quiet_output.mp3")

🎯 How It Works

The OpenAI TTS Library operates through a series of sophisticated steps:

Text Preprocessing: The input text is divided into natural sentences using our custom SentenceTokenizer, ensuring that the generated speech will sound natural with appropriate pauses.
Parallel Processing: Each sentence is processed concurrently using a thread pool, maximizing efficiency especially for longer texts.
API Interaction: The library communicates with OpenAI's TTS API, handling authentication, request formatting, and response processing.
Error Handling: Robust retry mechanisms and error handling ensure reliability even when network issues occur.
Output Generation: The audio chunks are assembled in the correct order and saved to the specified output file.

🌟 Voice Options

Choose from a variety of voice options:

📚 Advanced Usage

Custom Configuration

from openai_tts import OpenaiTTS, TTSConfig
from openai_tts.config import VoiceType

# Create custom configuration
config = TTSConfig(
    timeout=30,  # Increase timeout to 30 seconds
    verbose=True,  # Print detailed progress
    output_path="custom.mp3",  # Default output path
    voice=VoiceType.NOVA  # Default voice
)

# Initialize with custom config
tts = OpenaiTTS(config=config)

# Use the configured TTS
tts.speak("This text will be converted using the custom configuration.")

# Override specific settings for a single call
tts.speak(
    "This will use different settings just for this call.",
    voice=VoiceType.ECHO,
    output_path="override.mp3"
)

Error Handling

from openai_tts import OpenaiTTS
from openai_tts.exceptions import TTSException

tts = OpenaiTTS()

try:
    tts.speak("This is a test of error handling.")
except TTSException as e:
    print(f"An error occurred: {e}")

🤝 Contributing

Contributions are welcome! Here's how you can help:

Fork the repository

Create a feature branch:

git checkout -b feature/amazing-feature

Make your changes
Run the tests:
```
python -m unittest discover
```
Commit your changes:
```
git commit -m 'Add amazing feature'
```

Push to your branch:

git push origin feature/amazing-feature

Open a Pull Request

Please ensure your code follows the project's style guide and includes appropriate tests.

📊 Performance Considerations

The library is designed to handle large text inputs efficiently through parallel processing. However, very large texts may still take considerable time to process due to API rate limits and processing requirements.

For optimal performance:

Split very large texts into reasonable chunks before processing
Consider running resource-intensive operations in a background process
Use the verbose=True option to monitor progress during long operations

🔒 Security

This library communicates with external services. Always be mindful of:

The content you're sending to the API
Where you're storing the generated audio files
Who has access to your implementation

💡 Use Cases

🎙️ Content Creation: Generate voiceovers for videos, podcasts, or presentations
📚 Accessibility: Convert written content to audio for accessibility purposes
🤖 Chatbots and Virtual Assistants: Give your applications a voice
🎮 Gaming: Create dynamic dialogue for game characters
📱 Mobile Apps: Add speech capabilities to your applications

❓ FAQ

Q: Is this an official OpenAI library?
A: No, this is an unofficial, community-developed library for educational purposes.

Q: Do I need an OpenAI account to use this?
A: This library uses OpenAI's public TTS interface and does not require an API key.

Q: Can I use this for commercial projects?
A: Please refer to OpenAI's terms of service regarding the usage of their TTS capabilities. This library is for educational purposes only.

Q: How can I improve the speech quality?
A: Try different voices, ensure proper punctuation in your text, and break long paragraphs into natural sentences.

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🌟 If you find this library helpful, please consider starring the repository on GitHub!

📧 Questions or suggestions? Open an issue on GitHub or contact the maintainers.

Project details

Release history Release notifications | RSS feed

This version

2.0.0

Jun 24, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

openai_tts-2.0.0.tar.gz (14.2 kB view details)

Uploaded Jun 24, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

openai_tts-2.0.0-py3-none-any.whl (12.8 kB view details)

Uploaded Jun 24, 2025 Python 3

File details

Details for the file openai_tts-2.0.0.tar.gz.

File metadata

Download URL: openai_tts-2.0.0.tar.gz
Upload date: Jun 24, 2025
Size: 14.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.0

File hashes

Hashes for openai_tts-2.0.0.tar.gz
Algorithm	Hash digest
SHA256	`697a048f80e3ae07e94b836cf87ba4bdc0c2693613f0f02e632b7a5b4d3033b8`
MD5	`8c43d4fdbc8dc57ed56b412be5d301fd`
BLAKE2b-256	`c5449445984338c1b8b94dded6a6bccb364be33211db0d4faefa03a73bde1f1a`

See more details on using hashes here.

File details

Details for the file openai_tts-2.0.0-py3-none-any.whl.

File metadata

Download URL: openai_tts-2.0.0-py3-none-any.whl
Upload date: Jun 24, 2025
Size: 12.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.0

File hashes

Hashes for openai_tts-2.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`71ab291542b9f74a15b3d53196aa8e5147453c5ff2ae44fd70af0a373fb72c00`
MD5	`7bc4053dcc45c77486334323000b47bb`
BLAKE2b-256	`f9bbc5cd80c3684e98d47588e22e5e8b4b43f5cd2d06b41a365f7a2915dff4b3`

See more details on using hashes here.

openai-tts 2.0.0

Navigation

Verified details

Maintainers

Meta

Unverified details

Meta

Project description

🎙️ openai-tts

✨ Features

🚨 Disclaimer

Project Structure

🛠️ Installation

🔍 Dependencies

📋 Quick Start

🎯 How It Works

🌟 Voice Options

📚 Advanced Usage

Custom Configuration

Error Handling

🤝 Contributing

📊 Performance Considerations

🔒 Security

💡 Use Cases

❓ FAQ

📄 License

Project details

Verified details

Maintainers

Meta

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes