Quantized IndicConformer ASR for multiple Indian languages
Project description
Indic ASR
A quantized automatic speech recognition (ASR) system for multiple Indic languages using the IndicConformer architecture.
Installation
CPU-only Installation (Recommended for limited resources)
pip install --extra-index-url https://download.pytorch.org/whl/cpu indic-asr
GPU Installation
pip install indic-asr
For uv users:
uv pip install --extra-index-url https://download.pytorch.org/whl/cpu indic-asr
Quick Start
from indic_asr import IndicConformerTranscriber
# Initialize (downloads model automatically)
transcriber = IndicConformerTranscriber()
# Transcribe audio
text = transcriber.transcribe_ctc("audio.wav", "hi") # Hindi
print(text)
Supported Languages
- Hindi (hi)
- Bengali (bn)
- Telugu (te)
- Marathi (mr)
- Tamil (ta)
- Gujarati (gu)
- Kannada (kn)
- Malayalam (ml)
- Odia (or)
- Punjabi (pa)
- Assamese (as)
Features
- Quantized Models: INT8 quantization for efficient CPU inference
- Multiple Languages: Support for 11 Indic languages
- Two Modes: CTC and RNN-T decoding
- Auto Download: Models download automatically on first use
- ONNX Runtime: Optimized inference with ONNX
Usage
CTC Mode (Faster)
text = transcriber.transcribe_ctc("audio.wav", "hi")
RNN-T Mode (More Accurate)
text = transcriber.transcribe_rnnt("audio.wav", "hi")
Audio Requirements
- Format: WAV, MP3, FLAC, etc.
- Sample Rate: Auto-resampled to 16kHz
- Channels: Mono (auto-converted)
Performance
- CPU Inference: ~50-100x real-time on modern CPUs
- Memory: ~200-500MB per inference
- Model Size: ~500MB (downloaded on first use)
License
MIT License
Citation
If you use this in your research, please cite:
@misc{indic-asr-2025,
title={Indic ASR: Quantized Conformer for Indic Languages},
author={Verma, Atharva},
year={2025}
}
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
indic_asr_onnx-0.1.0.tar.gz
(5.7 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file indic_asr_onnx-0.1.0.tar.gz.
File metadata
- Download URL: indic_asr_onnx-0.1.0.tar.gz
- Upload date:
- Size: 5.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1491e20772b45bf1011afd2943c657c93630af025922b4541b53146d7374d899
|
|
| MD5 |
22a91d19f25b27982d3e44c2899df52b
|
|
| BLAKE2b-256 |
6dad33f0e9cf9dacff3cbf4973b51c13dc94897c3c50838877d07192062e6dc8
|
File details
Details for the file indic_asr_onnx-0.1.0-py3-none-any.whl.
File metadata
- Download URL: indic_asr_onnx-0.1.0-py3-none-any.whl
- Upload date:
- Size: 5.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b10e4853c277646d10e728708f9ae25b39518f90323f73a9819cf531eb131d80
|
|
| MD5 |
b134324c5b351a2629b0c4f950a72327
|
|
| BLAKE2b-256 |
9cad7104bc56aeeaa66ef3c84a20f31ef091913911dc7dcfb50aba87f5f428b5
|