Skip to main content

A simple wrapper for Cartesia Sonic TTS

Project description

Cartesia Voice Manager

A wrapper around the official Cartesia API for more convenient work with text-to-speech functionality. It also includes a Gradio interface for easy interaction (currently in Russian).

TODO

  • Package and publish to PyPI
  • Translate Gradio interface to English
  • Implement voice mixing functionality in Gradio interface

Features

  • Easy-to-use wrapper for Cartesia API
  • Voice management (listing, creating custom voices, getting voice info)
  • Text-to-speech generation with various controls:
    • Language selection (auto-detect or manual)
    • Speed control
    • Emotion control
    • Text improvement options
  • Gradio web interface for interactive use (currently in Russian)

Functionality

  1. Voice Management

    • List available voices with filtering options
    • Create custom voices from audio files
    • Get detailed information about voices
  2. Text-to-Speech

    • Generate speech from text with selected voice
    • Control speech speed (very slow to very fast)
    • Add emotions to speech (happiness, sadness, anger, surprise, curiosity)
    • Adjust emotion intensity
    • Automatically improve input text for better TTS results
  3. Gradio Interface

    • User-friendly web interface for all functionalities
    • Real-time updates and previews
    • Easy configuration of TTS parameters

Note

This project is still a work in progress. Some features may be incomplete or subject to change. Contributions and suggestions are welcome!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sonic_wrapper-0.1.0.tar.gz (12.4 kB view details)

Uploaded Source

Built Distribution

sonic_wrapper-0.1.0-py3-none-any.whl (9.8 kB view details)

Uploaded Python 3

File details

Details for the file sonic_wrapper-0.1.0.tar.gz.

File metadata

  • Download URL: sonic_wrapper-0.1.0.tar.gz
  • Upload date:
  • Size: 12.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.27.2

File hashes

Hashes for sonic_wrapper-0.1.0.tar.gz
Algorithm Hash digest
SHA256 75178663bfa66877a99bc0c3354461be2a1a4343a2d17c5085b361dd19ca9228
MD5 b4bb3d0e5cf3d3a7e90b4d3862b76115
BLAKE2b-256 414ac954fe47b5539bc7236784b5945984de93be2c70d410db4823e81712b9f7

See more details on using hashes here.

File details

Details for the file sonic_wrapper-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for sonic_wrapper-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d7b3493394e7e5b0e5ca5816b716d064d5d12736d74c77051d8f96fd7fadcbb5
MD5 0ab1c9d342becdfa550f78096bb19670
BLAKE2b-256 e4209dd5304ae4c9d8004d61f51e6ab5d4af0f7a2c6b9bce007ab9d867b47537

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page