A user-friendly package for Thai speech recognition using the Typhoon ASR model.
Project description
Typhoon ASR
A user-friendly Python package for Thai speech recognition using the Typhoon ASR model.
Installation
pip install typhoon-asr
Quick Start
Python API
from typhoon_asr import transcribe
# Basic transcription
result = transcribe("audio.wav")
print(result['text'])
# With word timestamps (estimated)
result = transcribe("audio.wav", with_timestamps=True)
print(result['text'])
for ts in result['timestamps']:
print(f"[{ts['start']:.2f}s - {ts['end']:.2f}s] {ts['word']}")
# Specify device
result = transcribe("audio.wav", device="cuda") # or "cpu", "auto"
Command Line
# Basic usage
typhoon-asr audio.wav
# With timestamps
typhoon-asr audio.wav --with-timestamps
# Specify device
typhoon-asr audio.wav --device cuda
Supported Formats
.wav,.mp3,.m4a,.flac,.ogg,.aac,.webm
API Reference
transcribe(input_file, model_name="scb10x/typhoon-asr-realtime", with_timestamps=False, device="auto")
Parameters:
input_file(str): Path to audio filemodel_name(str): HuggingFace model identifierwith_timestamps(bool): Generate estimated word timestampsdevice(str): Processing device ("auto", "cpu", "cuda")
Returns:
dictwith keys:text: Transcribed texttimestamps: List of word timestamps (if enabled)processing_time: Processing duration in secondsaudio_duration: Audio duration in seconds
Requirements
- Python ≥ 3.8
- CUDA (optional, for GPU acceleration)
License
Apache Software License 2.0
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
typhoon_asr-0.1.1.tar.gz
(4.0 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file typhoon_asr-0.1.1.tar.gz.
File metadata
- Download URL: typhoon_asr-0.1.1.tar.gz
- Upload date:
- Size: 4.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9a2cad67801346c19cf3637085a7c0545d751ef908d6e792ebe3c41603a4258a
|
|
| MD5 |
c66d86aeb726a3095d948ef498a64982
|
|
| BLAKE2b-256 |
2a3185e72d3d357231d59fa0bcbd1a8175ba41cc14a81b25ecde60d52bb02f79
|
File details
Details for the file typhoon_asr-0.1.1-py3-none-any.whl.
File metadata
- Download URL: typhoon_asr-0.1.1-py3-none-any.whl
- Upload date:
- Size: 4.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9316bc8972e56cff07ee4e2cba41629cd14c38974ae7351025c9641659b1e596
|
|
| MD5 |
c7f67944810a9cd4ae711f942bd27035
|
|
| BLAKE2b-256 |
a6cdcc52df8ee0b44b3175247181e180c9d925ecfe1598af61b315b334e88bec
|