Skip to main content

Quantized IndicConformer ASR for multiple Indian languages

Project description

Indic ASR

A quantized automatic speech recognition (ASR) system for multiple Indic languages using the IndicConformer architecture.

Installation

CPU-only Installation (Recommended for limited resources)

pip install --extra-index-url https://download.pytorch.org/whl/cpu indic-asr

GPU Installation

pip install indic-asr

For uv users:

uv pip install --extra-index-url https://download.pytorch.org/whl/cpu indic-asr

Quick Start

from indic_asr import IndicConformerTranscriber

# Initialize (downloads model automatically)
transcriber = IndicConformerTranscriber()

# Transcribe audio
text = transcriber.transcribe_ctc("audio.wav", "hi")  # Hindi
print(text)

Supported Languages

  • Hindi (hi)
  • Bengali (bn)
  • Telugu (te)
  • Marathi (mr)
  • Tamil (ta)
  • Gujarati (gu)
  • Kannada (kn)
  • Malayalam (ml)
  • Odia (or)
  • Punjabi (pa)
  • Assamese (as)

Features

  • Quantized Models: INT8 quantization for efficient CPU inference
  • Multiple Languages: Support for 11 Indic languages
  • Two Modes: CTC and RNN-T decoding
  • Auto Download: Models download automatically on first use
  • ONNX Runtime: Optimized inference with ONNX

Usage

CTC Mode (Faster)

text = transcriber.transcribe_ctc("audio.wav", "hi")

RNN-T Mode (More Accurate)

text = transcriber.transcribe_rnnt("audio.wav", "hi")

Audio Requirements

  • Format: WAV, MP3, FLAC, etc.
  • Sample Rate: Auto-resampled to 16kHz
  • Channels: Mono (auto-converted)

Performance

  • CPU Inference: ~50-100x real-time on modern CPUs
  • Memory: ~200-500MB per inference
  • Model Size: ~500MB (downloaded on first use)

License

MIT License

Citation

If you use this in your research, please cite:

@misc{indic-asr-2025,
  title={Indic ASR: Quantized Conformer for Indic Languages},
  author={Verma, Atharva},
  year={2025}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

indic_asr_onnx-0.1.0.tar.gz (5.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

indic_asr_onnx-0.1.0-py3-none-any.whl (5.9 kB view details)

Uploaded Python 3

File details

Details for the file indic_asr_onnx-0.1.0.tar.gz.

File metadata

  • Download URL: indic_asr_onnx-0.1.0.tar.gz
  • Upload date:
  • Size: 5.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for indic_asr_onnx-0.1.0.tar.gz
Algorithm Hash digest
SHA256 1491e20772b45bf1011afd2943c657c93630af025922b4541b53146d7374d899
MD5 22a91d19f25b27982d3e44c2899df52b
BLAKE2b-256 6dad33f0e9cf9dacff3cbf4973b51c13dc94897c3c50838877d07192062e6dc8

See more details on using hashes here.

File details

Details for the file indic_asr_onnx-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: indic_asr_onnx-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 5.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for indic_asr_onnx-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b10e4853c277646d10e728708f9ae25b39518f90323f73a9819cf531eb131d80
MD5 b134324c5b351a2629b0c4f950a72327
BLAKE2b-256 9cad7104bc56aeeaa66ef3c84a20f31ef091913911dc7dcfb50aba87f5f428b5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page