Pipeline STT complet du francais — audio vers texte (CTC + P2G)
Project description
Lectura STT — Pipeline STT complet du francais
Pipeline de transcription automatique du francais : audio vers texte. Chaine le decodeur CTC (audio → phones IPA) avec le pipeline P2G (phones → orthographe).
Installation
# Mode minimal (CTC uniquement, transcription phonetique)
pip install lectura-stt
# Avec pipeline P2G complet (formules + noms propres)
pip install lectura-stt[p2g]
# Avec backend ONNX (inference locale rapide)
pip install lectura-stt[onnx]
# Avec support micro
pip install lectura-stt[micro]
Exemple
import numpy as np
from lectura_stt import creer_engine
engine = creer_engine()
# Charger un fichier WAV
import wave
with wave.open("bonjour.wav", "rb") as wf:
sr = wf.getframerate()
audio = np.frombuffer(
wf.readframes(wf.getnframes()), dtype=np.int16
).astype(np.float32) / 32768.0
result = engine.transcrire(audio, sr=sr)
print(result.ipa) # "b ɔ̃ ʒ u ʁ | l ə | m ɔ̃ d ."
print(result.texte) # "Bonjour le monde."
Architecture
Audio 16kHz mono
|
v
[lectura-ctc] --> IPA phones "b ɔ̃ ʒ u ʁ | l ə | m ɔ̃ d ."
|
v
[_parse_ctc] --> mots IPA ["bɔ̃ʒuʁ", "lə", "mɔ̃d"] + ponctuation ["."]
|
v
[lectura-p2g] --> ortho ["bonjour", "le", "monde"]
|
v
[_assembler] --> "Bonjour le monde."
Licence
AGPL-3.0-or-later — voir LICENCE.txt. Licence commerciale disponible — voir LICENCE-COMMERCIALE.md.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file lectura_stt-1.0.0.tar.gz.
File metadata
- Download URL: lectura_stt-1.0.0.tar.gz
- Upload date:
- Size: 21.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e40022765902c1c99e442a484088bc3f34dd63939434296e3d560f655b499686
|
|
| MD5 |
6af557f4c11e8ad34851089cba4903b0
|
|
| BLAKE2b-256 |
58be8e92745f09395b263a150b57f943dbd355252223756c5621ead9e23a2a14
|
File details
Details for the file lectura_stt-1.0.0-py3-none-any.whl.
File metadata
- Download URL: lectura_stt-1.0.0-py3-none-any.whl
- Upload date:
- Size: 20.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
370c150841743abba60e93b5c7915270ff9923f946cdacdca8c99c98573767e6
|
|
| MD5 |
5f95fcd8664baf5bc02abc9de7340890
|
|
| BLAKE2b-256 |
11baf339a82a82b93f021f6adefbf8dccd735f49bb7937c4b8edb259b92e9ef2
|