Generate speech using various libraries.
Project description
Sinapsis Speech
Templates for a wide range of voice generation tasks.
🐍 Installation • 📦 Packages • 🌐 Webapp • 📙 Documentation • 🔍 License
🐍 Installation
[!IMPORTANT] Sinapsis projects requires Python 3.10 or higher.
We strongly encourage the use of uv, although any other package manager should work too.
If you need to install uv please see the official documentation.
- Install using your favourite package manager.
Example with uv:
uv pip install sinapsis-elevenlabs --extra-index-url https://pypi.sinapsis.tech
or with raw pip:
pip install sinapsis-elevenlabs --extra-index-url https://pypi.sinapsis.tech
Change the name of the package for the one you want to install.
[!TIP] You can also install all the packages within this project:
uv pip install sinapsis-speech[all] --extra-index-url https://pypi.sinapsis.tech
📦 Packages
Each package can be used independently or combined to create more complex workflows. Below is an overview of the available packages:
Elevenlabs
This package provides a suite of templates and utilities for seamlessly integrating, configuring, and running text-to-speech (TTS) and voice generation functionalities powered by ElevenLabs:
-
Text-to-speech: Template for converting text into speech using ElevenLabs' voice models.
-
Voice generation: Template for generating custom synthetic voices based on user-provided descriptions.
🌐 Webapps
The webapps included in this project showcase the modularity of the templates, in this case for speech generation tasks.[!IMPORTANT] To run the app you first need to clone this repository:
git clone git@github.com:Sinapsis-ai/sinapsis-speech.git
cd sinapsis-speech
[!NOTE] If you'd like to enable external app sharing in Gradio,
export GRADIO_SHARE_APP=True
[!IMPORTANT] The CosyVoice model requires at least 4GB of ram to work.
[!IMPORTANT] Elevenlabs requires an api key to run any inference. Please go to the official website, create an account. If you already have an account, go to the token page and generate a token.
[!IMPORTANT] set your env var using
export ELEVENLABS_API_KEY='your-api-key'
[!TIP] The agent configuration can be updated using the AGENT_CONFIG_PATH environment var.
🐳 Build with Docker
IMPORTANT This docker image depends on the sinapsis-nvidia:base image. Please refer to the official sinapsis instructions to Build with Docker.
- Build the Docker image:
docker compose -f docker/compose.yaml build
- Launch the service:
docker compose -f docker/compose_apps.yaml up -d sinapsis-elevenlabs
- Check the logs
docker logs -f sinapsis-elevenlabs
- The logs will display the URL to access the webapp, e.g.,::
Running on local URL: http://127.0.0.1:7860
- To stop the app:
docker compose -f docker/compose_apps.yaml down sinapsis-elevenlabs
💻 UV
- Sync the virtual environment:
uv sync --frozen
- Install the wheel:
uv pip install sinapsis-speech[all] --extra-index-url https://pypi.sinapsis.tech
- Activate the virtual environment:
source .venv/bin/activate
- Declare PYTHONPATH
export PYTHONPATH=$PWD/webapps
NOTE if not located in sinapsis-speech folder, change $PWD for the actual path to sinapsis-speech
- Launch the demo:
python webapps/elevenlabs/elevenlabs_tts_app.py
- Open the displayed URL, e.g.:
Running on local URL: http://127.0.0.1:7860
NOTE: The URL can be different, please make sure you check the logs.
📙 Documentation
Documentation is available on the sinapsis website
Tutorials for different projects within sinapsis are available at sinapsis tutorials page
🔍 License
This project is licensed under the AGPLv3 license, which encourages open collaboration and sharing. For more details, please refer to the LICENSE file.
For commercial use, please refer to our official Sinapsis website for information on obtaining a commercial license.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sinapsis_speech-0.1.0.tar.gz.
File metadata
- Download URL: sinapsis_speech-0.1.0.tar.gz
- Upload date:
- Size: 47.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.5.16
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a39772ce6491c9532b5590569e21d4be890d71a685672de53042ca2b8ba2a52e
|
|
| MD5 |
2b070884d8459eee4af38dd30605d8aa
|
|
| BLAKE2b-256 |
8568c74915b69b11d1ac920b780b8b3bd67ccaba8924f9f1aa6f42638fb98579
|
File details
Details for the file sinapsis_speech-0.1.0-py3-none-any.whl.
File metadata
- Download URL: sinapsis_speech-0.1.0-py3-none-any.whl
- Upload date:
- Size: 36.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.5.16
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cb9a7b3c4b750ec30ae1d31efa50fd3b28942d9d479e72bbf115f713f23a1806
|
|
| MD5 |
45d1ad0155681f087f387b62629ae05f
|
|
| BLAKE2b-256 |
6741eb66494005983b0df9e4592e87c47b2d99913daf87c0fb293dfccb504ee2
|