Skip to main content

Synthese vocale par concatenation de diphones WORLD — francais

Project description

lectura-tts-diphone

Synthese vocale francaise par concatenation de diphones dans le domaine WORLD.

Installation

# Sans dependances (import seul)
pip install lectura-tts-diphone

# Inference locale (pyworld + numpy + scipy)
pip install "lectura-tts-diphone[local]"

# Avec G2P integre (texte → audio)
pip install "lectura-tts-diphone[all]"

Utilisation

Depuis du texte (necessite lectura-g2p)

from lectura_tts_diphone import synthetiser

audio = synthetiser("Bonjour le monde")
# audio: numpy array float32, 44100 Hz

Depuis des phonemes IPA

from lectura_tts_diphone import creer_engine

engine = creer_engine()
audio = engine.synthesize_groups([
    {"phones": ["b", "ɔ̃", "ʒ", "u", "ʁ"], "boundary": "none"},
    {"phones": ["l", "ə", "m", "ɔ̃", "d"], "boundary": "period"},
])

Controles prosodiques

Parametre Defaut Description
duration_scale 1.0 Vitesse globale (>1 = plus lent)
pause_scale 1.0 Duree des pauses inter-groupes
macro_expressivity 2.0 Gestes prosodiques (0=neutre, 4=exagere)
micro_expressivity 5.0 Micro-variations (0=robot, 10=tres expressif)
spectral_contrast 1.5 Contraste spectral (1.0=off, 2.0=fort)
prosody_style "auto" "declaratif", "question", "exclamation", "suspensif", "neutre"
seed None Graine pour micro-prosodie reproductible

Modes de synthese

  • FLUIDE : lecture naturelle, enchainement continu
  • MOT_A_MOT : lecture mot par mot avec pauses
  • SYLLABES : lecture syllabe par syllabe

Architecture

Texte → [G2P] → Phonemes IPA → Diphone chain
                                      ↓
                              WORLD params (F0 + SP + AP)
                                      ↓
                              Stretch + Concat (overlap)
                                      ↓
                              Prosodie (F0 contour + durees)
                                      ↓
                              GV compensation (contraste spectral)
                                      ↓
                              pw.synthesize → Audio 44100 Hz

Les diphones sont des parametres WORLD (F0 + spectral envelope + aperiodicity) extraits du corpus SIWIS et moyennes par type de transition phonetique.

Emplacements des modeles

Recherche dans l'ordre :

  1. Parametre models_dir explicite
  2. $LECTURA_MODELS_DIR/tts_diphone/
  3. ~/.lectura/models/tts_diphone/
  4. Modeles embarques dans le package

Fichier requis : diphones.dpk.gz (ou .dpk.gz.enc chiffre) Fichier optionnel : diphone_statistics.pkl

Licence

Double licence : AGPL-3.0 (code) + Licence Commerciale (modeles).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lectura_tts_diphone-1.3.3.tar.gz (37.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

lectura_tts_diphone-1.3.3-py3-none-any.whl (36.5 kB view details)

Uploaded Python 3

File details

Details for the file lectura_tts_diphone-1.3.3.tar.gz.

File metadata

  • Download URL: lectura_tts_diphone-1.3.3.tar.gz
  • Upload date:
  • Size: 37.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for lectura_tts_diphone-1.3.3.tar.gz
Algorithm Hash digest
SHA256 3ce1c0b913adb693454949dbeb1dbd948416aa89f00455593b03967b103d2227
MD5 e75c978c31d25aca6e8b0c53110eac36
BLAKE2b-256 c544680d8cdaeb3685b91b85101c57c02cdf9898a0c814b727dbff38520ad863

See more details on using hashes here.

File details

Details for the file lectura_tts_diphone-1.3.3-py3-none-any.whl.

File metadata

File hashes

Hashes for lectura_tts_diphone-1.3.3-py3-none-any.whl
Algorithm Hash digest
SHA256 07c15712c31cdde170c0b6841e5e2b01cafcf8cc85226b6abe49b89339cd3c76
MD5 8b00edbda07ac6a72ef23bdaf48598e4
BLAKE2b-256 4f7ed8c130cf444bd5539a9810a6b0215efdcfe3dadde63b6e1bd9a603a6824c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page