Skip to main content

Automatic Speech Recognition (ASR) SDK for Nigerian languages Yoruba, Igbo, Hausa, and Nigerian-accented English using CTranslate2-converted Whisper models

Project description

NaijaLingo ASR SDK

ASR SDK for Nigerian languages using CTranslate2-converted Whisper models.

Install

pip install naijalingo-asr[audio]

Quickstart

from naijalingo_asr import transcribe

text = transcribe("/path/to/audio.wav", language="yo")
print(text)

text = transcribe("/path/to/igbo_audio.wav", language="ig")
print(text)

CLI

naijalingo-asr --audio_path /path/to/audio.wav --language yo

CLI options

  • --language (required): Language code (yo, ig, ha, en).
  • --device (optional): auto (default), cpu, or cuda.
  • --compute-type (optional): auto (default), float16, int8, int8_float16.
  • --beam-size (optional): Beam search size (default: 5).
  • --no-vad (optional flag): Disable VAD filter (enabled by default).
  • --temperature (optional): Sampling temperature (default: 0.0).
  • --initial-prompt (optional): Context prompt to prime decoding.
  • --log-level (optional): DEBUG, INFO, WARNING (default), ERROR.

Examples:

# GPU with float16
naijalingo-asr --audio_path /path/to/audio.wav --language yo \
  --device cuda --compute-type float16 --beam-size 5 --log-level INFO

# CPU with int8
naijalingo-asr --audio_path /path/to/audio.wav --language ig \
  --device cpu --compute-type int8 --no-vad

# With an initial prompt context
naijalingo-asr --audio_path /path/to/audio.wav --language ha \
  --initial-prompt "Medical conversation in Hausa"

Docker

Docker support has been removed. Use the CLI or Python API instead.

Supported languages

  • yo: Yoruba
  • ig: Igbo
  • ha: Hausa
  • en: Nigerian-accented English

Notes

  • Uses faster-whisper (CTranslate2 backend)
  • Accepts file paths (mp3/wav/m4a/etc.) via librosa, or a numpy array (mono 16k)
  • No external ffmpeg required when installing with extras: [audio]
  • Task is transcription only; set task="transcribe" and the language code.

Logging

Set via CLI --log-level INFO or env NAIJALINGO_ASR_LOG=INFO.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

naijalingo_asr-0.1.2.tar.gz (8.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

naijalingo_asr-0.1.2-py3-none-any.whl (8.9 kB view details)

Uploaded Python 3

File details

Details for the file naijalingo_asr-0.1.2.tar.gz.

File metadata

  • Download URL: naijalingo_asr-0.1.2.tar.gz
  • Upload date:
  • Size: 8.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for naijalingo_asr-0.1.2.tar.gz
Algorithm Hash digest
SHA256 b0fe118e7796fb228bcdf3b41b4c081b7f70c87601825ee383c86195aec4a7a7
MD5 8338ca9532d4302fc55d6b4c458ea7a5
BLAKE2b-256 e1caa6f0bf630d6fd4312c4dd85224b54d918cbc0654b178e59b216aa1db0a55

See more details on using hashes here.

File details

Details for the file naijalingo_asr-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: naijalingo_asr-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 8.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for naijalingo_asr-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 a4a3520cd762886f50e25fd2f8e1b763c3dfb29f3200b32aa01518565e2a2e8e
MD5 153b6c5433329064a52db200e9e197fa
BLAKE2b-256 da33e9e073ff46dfd0053b553bdb29f670b7f5b24215d6d96054044f4fc36574

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page