Skip to main content

Decodeur phonetique CTC du francais — audio vers phones IPA (CNN-BiGRU-CTC)

Project description

lectura-ctc — Decodeur phonetique CTC du francais

Transcription audio vers phonemes IPA via un modele CNN-BiGRU-CTC medium (10.6M params, PER ~4.34%).

Le modele medium supporte les sigles, formules (nombres, dates, calculs) et la parole courante grace a un vocabulaire etendu de 59 tokens et un fine-tuning specialise.

Installation

# Avec backend ONNX (recommande)
pip install lectura-ctc[onnx]

# Sans ONNX (mode API uniquement)
pip install lectura-ctc

Utilisation

import numpy as np
from lectura_ctc import creer_engine

engine = creer_engine()

# Audio PCM float32 mono 16kHz
audio = np.zeros(16000, dtype=np.float32)  # 1 seconde de silence
result = engine.transcrire(audio)
print(result)  # chaine IPA : "b ɔ̃ ʒ u ʁ | l ə | m ɔ̃ d"

Modele

Caracteristique Valeur
Architecture CNN [48, 96] + BiGRU 384x4 + CTC head
Parametres 10.6M
Performance PER ~4.34% (formules v2)
Vocabulaire 59 tokens (46 phones IPA + liaisons + ponctuation + speciaux)
Entrainement Pre-train corpus complet + fine-tune formules/sigles multispeaker

Backends

Backend Dependance Latence Modele
ONNX Runtime onnxruntime ~10 ms/s audio phone_ctc_int8.onnx (38 Mo)
API aucune ~100 ms/s audio serveur Lectura

Parametres audio

  • Sample rate : 16 kHz
  • Format : PCM float32 mono
  • Mel : 80 bins, n_fft=512, hop=160, win=400

Licence

Double licence : AGPL-3.0 (libre) + Licence Commerciale (payante).

Les modeles ONNX sont distribues separement — voir MODEL_LICENCE.md.

Auteur

Max Carriere — lec-tu-ra.com

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lectura_ctc-2.0.0.tar.gz (25.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

lectura_ctc-2.0.0-py3-none-any.whl (25.0 kB view details)

Uploaded Python 3

File details

Details for the file lectura_ctc-2.0.0.tar.gz.

File metadata

  • Download URL: lectura_ctc-2.0.0.tar.gz
  • Upload date:
  • Size: 25.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for lectura_ctc-2.0.0.tar.gz
Algorithm Hash digest
SHA256 7af5e8d291dae3391d2ac41a9ae168535e25ff939086111610188811c8ca7a60
MD5 d168b295f76fcd5f3b1ebe2512f052ba
BLAKE2b-256 46eb473412dfa59847afb83ae30b9fedc948d6b2ef296640e91af86e39bf568f

See more details on using hashes here.

File details

Details for the file lectura_ctc-2.0.0-py3-none-any.whl.

File metadata

  • Download URL: lectura_ctc-2.0.0-py3-none-any.whl
  • Upload date:
  • Size: 25.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for lectura_ctc-2.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 14b5fb80bc76cd791ac9f83aab544f94e36eab94443c09750a0cb7f0afde6fe0
MD5 2c1de85baaf030bf2fbb82c8944aba80
BLAKE2b-256 e7bbddfa93e83a63fa3a605661da5a7a5b9d6305ecf817cc15ed28a231baf9d4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page