Skip to main content

Generate speech using various libraries.

Project description



Sinapsis Speech

Templates for a wide range of voice generation tasks.

🐍 Installation📦 Packages🌐 Webapp📙 Documentation🔍 License

🐍 Installation

[!IMPORTANT] Sinapsis projects requires Python 3.10 or higher.

We strongly encourage the use of uv, although any other package manager should work too. If you need to install uv please see the official documentation.

  1. Install using your favourite package manager.

Example with uv:

  uv pip install sinapsis-elevenlabs --extra-index-url https://pypi.sinapsis.tech

or with raw pip:

  pip install sinapsis-elevenlabs --extra-index-url https://pypi.sinapsis.tech

Change the name of the package for the one you want to install.

[!TIP] You can also install all the packages within this project:

  uv pip install sinapsis-speech[all] --extra-index-url https://pypi.sinapsis.tech

📦 Packages

Each package can be used independently or combined to create more complex workflows. Below is an overview of the available packages:

Elevenlabs

This package provides a suite of templates and utilities for seamlessly integrating, configuring, and running text-to-speech (TTS) and voice generation functionalities powered by ElevenLabs:

  • Text-to-speech: Template for converting text into speech using ElevenLabs' voice models.

  • Voice generation: Template for generating custom synthetic voices based on user-provided descriptions.

🌐 Webapps

The webapps included in this project showcase the modularity of the templates, in this case for speech generation tasks.

[!IMPORTANT] To run the app you first need to clone this repository:

git clone git@github.com:Sinapsis-ai/sinapsis-speech.git
cd sinapsis-speech

[!NOTE] If you'd like to enable external app sharing in Gradio, export GRADIO_SHARE_APP=True

[!IMPORTANT] The CosyVoice model requires at least 4GB of ram to work.

[!IMPORTANT] Elevenlabs requires an api key to run any inference. Please go to the official website, create an account. If you already have an account, go to the token page and generate a token.

[!IMPORTANT] set your env var using export ELEVENLABS_API_KEY='your-api-key'

[!TIP] The agent configuration can be updated using the AGENT_CONFIG_PATH environment var.

🐳 Build with Docker

IMPORTANT This docker image depends on the sinapsis-nvidia:base image. Please refer to the official sinapsis instructions to Build with Docker.

  1. Build the Docker image:
docker compose -f docker/compose.yaml build
  1. Launch the service:
docker compose -f docker/compose_apps.yaml up -d sinapsis-elevenlabs
  1. Check the logs
docker logs -f sinapsis-elevenlabs
  1. The logs will display the URL to access the webapp, e.g.,::
Running on local URL:  http://127.0.0.1:7860
  1. To stop the app:
docker compose -f docker/compose_apps.yaml down sinapsis-elevenlabs
💻 UV
  1. Sync the virtual environment:
uv sync --frozen
  1. Install the wheel:
uv pip install sinapsis-speech[all] --extra-index-url https://pypi.sinapsis.tech
  1. Activate the virtual environment:
source .venv/bin/activate
  1. Declare PYTHONPATH
export PYTHONPATH=$PWD/webapps 

NOTE if not located in sinapsis-speech folder, change $PWD for the actual path to sinapsis-speech

  1. Launch the demo:
python webapps/elevenlabs/elevenlabs_tts_app.py
  1. Open the displayed URL, e.g.:
Running on local URL:  http://127.0.0.1:7860

NOTE: The URL can be different, please make sure you check the logs.

📙 Documentation

Documentation is available on the sinapsis website

Tutorials for different projects within sinapsis are available at sinapsis tutorials page

🔍 License

This project is licensed under the AGPLv3 license, which encourages open collaboration and sharing. For more details, please refer to the LICENSE file.

For commercial use, please refer to our official Sinapsis website for information on obtaining a commercial license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sinapsis_speech-0.1.0.tar.gz (47.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sinapsis_speech-0.1.0-py3-none-any.whl (36.1 kB view details)

Uploaded Python 3

File details

Details for the file sinapsis_speech-0.1.0.tar.gz.

File metadata

  • Download URL: sinapsis_speech-0.1.0.tar.gz
  • Upload date:
  • Size: 47.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.16

File hashes

Hashes for sinapsis_speech-0.1.0.tar.gz
Algorithm Hash digest
SHA256 a39772ce6491c9532b5590569e21d4be890d71a685672de53042ca2b8ba2a52e
MD5 2b070884d8459eee4af38dd30605d8aa
BLAKE2b-256 8568c74915b69b11d1ac920b780b8b3bd67ccaba8924f9f1aa6f42638fb98579

See more details on using hashes here.

File details

Details for the file sinapsis_speech-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for sinapsis_speech-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 cb9a7b3c4b750ec30ae1d31efa50fd3b28942d9d479e72bbf115f713f23a1806
MD5 45d1ad0155681f087f387b62629ae05f
BLAKE2b-256 6741eb66494005983b0df9e4592e87c47b2d99913daf87c0fb293dfccb504ee2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page