Automatic Speech Recognition (ASR) SDK for Nigerian languages Yoruba, Igbo, Hausa, and Nigerian-accented English using CTranslate2-converted Whisper models
Project description
NaijaLingo ASR SDK
ASR SDK for Nigerian languages using CTranslate2-converted Whisper models.
Install
pip install naijalingo-asr[audio]
Quickstart
from naijalingo_asr import transcribe
text = transcribe("/path/to/audio.wav", language="yo")
print(text)
text = transcribe("/path/to/igbo_audio.wav", language="ig")
print(text)
CLI
naijalingo-asr --audio_path /path/to/audio.wav --language yo
CLI options
--language(required): Language code (yo,ig,ha,en).--device(optional):auto(default),cpu, orcuda.--compute-type(optional):auto(default),float16,int8,int8_float16.--beam-size(optional): Beam search size (default:5).--no-vad(optional flag): Disable VAD filter (enabled by default).--temperature(optional): Sampling temperature (default:0.0).--initial-prompt(optional): Context prompt to prime decoding.--log-level(optional):DEBUG,INFO,WARNING(default),ERROR.
Examples:
# GPU with float16
naijalingo-asr --audio_path /path/to/audio.wav --language yo \
--device cuda --compute-type float16 --beam-size 5 --log-level INFO
# CPU with int8
naijalingo-asr --audio_path /path/to/audio.wav --language ig \
--device cpu --compute-type int8 --no-vad
# With an initial prompt context
naijalingo-asr --audio_path /path/to/audio.wav --language ha \
--initial-prompt "Medical conversation in Hausa"
Docker
Docker support has been removed. Use the CLI or Python API instead.
Supported languages
- yo: Yoruba
- ig: Igbo
- ha: Hausa
- en: Nigerian-accented English
Notes
- Uses faster-whisper (CTranslate2 backend)
- Accepts file paths (mp3/wav/m4a/etc.) via librosa, or a numpy array (mono 16k)
- No external ffmpeg required when installing with extras:
[audio] - Task is transcription only; set
task="transcribe"and the language code.
Logging
Set via CLI --log-level INFO or env NAIJALINGO_ASR_LOG=INFO.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file naijalingo_asr-0.1.2.tar.gz.
File metadata
- Download URL: naijalingo_asr-0.1.2.tar.gz
- Upload date:
- Size: 8.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b0fe118e7796fb228bcdf3b41b4c081b7f70c87601825ee383c86195aec4a7a7
|
|
| MD5 |
8338ca9532d4302fc55d6b4c458ea7a5
|
|
| BLAKE2b-256 |
e1caa6f0bf630d6fd4312c4dd85224b54d918cbc0654b178e59b216aa1db0a55
|
File details
Details for the file naijalingo_asr-0.1.2-py3-none-any.whl.
File metadata
- Download URL: naijalingo_asr-0.1.2-py3-none-any.whl
- Upload date:
- Size: 8.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a4a3520cd762886f50e25fd2f8e1b763c3dfb29f3200b32aa01518565e2a2e8e
|
|
| MD5 |
153b6c5433329064a52db200e9e197fa
|
|
| BLAKE2b-256 |
da33e9e073ff46dfd0053b553bdb29f670b7f5b24215d6d96054044f4fc36574
|