FASR: Fast Automatic Speech Recognition
Project description
🪐 项目: fasr
Fast Auto Speech Recognition
📋 简介
fasr是一款快速且易于使用的python库,它源于FunASR,专注于推理性能,目标是成为一个工业级别的python语音识别推理库。
📋 安装
fasr可以通过直接通过pip安装,但是如果需要使用gpu,需要手动安装pytorch和onnxruntime-gpu
pip install fasr --upgrade
📋 使用
- 下载模型
fasr prepare online # 实时语音识别模型
fasr prepare offline # 离线语音识别模型
- 构建离线语音识别pipeline
from fasr import AudioPipeline
# 语音识别pipeline: 端点检测->语音识别->添加标点
asr = AudioPipeline().add_pipe('detector').add_pipe('recognizer').add_pipe('sentencizer')
# 运行pipeline并获取文本
url = "https://xxxxx.mp3"
audio = asr(url)
for channel in audio.channels:
print(channel.text)
# 批次处理
urls = get_urls()
audios = asr.run(urls)
# 获取文本
for audio in audios:
for channel in audio.channels:
print(channel.text)
- 构建实时语音识别pipeline
from fasr import AudioPipeline, Audio
asr = AudioPipeline().add_pipe("online_detector").add_pipe("online_recognizer")
# 模拟流式接收音频流
waveform, sample_rate, is_last = get_waveform_from_stream()
audio_chunk = Audio().append_channel()
audio_chunk.is_last = ia_last
audio_chunk.channels[0].waveform = waveform
audio_chunk.channels[0].sample_rate = sample_rate
for token in asr(audio_chunk).channels[0].stream:
print(token.text)
# 清除cache,恢复组件初始状态
asr.get_pipe("online_recognizer").component.reset()
asr.get_pipe("online_detector").component.reset()
📋 性能对比
双通道音频
vad->asr->punc
测试结果
cpu: Intel(R) Xeon(R) Silver 4210 CPU @ 2.20GHz (wpai cpu 2核)
gpu: rtx6000 (wpai vgpu 20)
| 框架 | 耗时 | 推理速度 | 加速比 |
|---|---|---|---|
| funasr | 368.8s | 46.34 | 1.0 |
| fasr | 153.92s | 111.03 | 2.4 |
vad
测试结果
cpu: Intel(R) Xeon(R) Silver 4210 CPU @ 2.20GHz (wpai cpu 2核)
gpu: rtx6000 (wpai vgpu 20)
| 框架 | 耗时 | 推理速度 | 加速比 |
|---|---|---|---|
| funasr | 219.8s | 77.75 | 1.0 |
| fasr | 86.32s | 197.98 | 2.55 |
单通道音频
pipeline
vad->asr->punc
测试结果
cpu: Intel(R) Xeon(R) Silver 4210 CPU @ 2.20GHz (wpai cpu 2核)
gpu: rtx6000 (wpai vgpu 20)
| 框架 | 耗时 | 推理速度 | 加速比 |
|---|---|---|---|
| funasr | 123.8s | 22.05 | 1.0 |
| fasr | 59.04s | 46.24 | 2.1 |
vad
测试结果
cpu: Intel(R) Xeon(R) Silver 4210 CPU @ 2.20GHz (wpai cpu 2核)
gpu: rtx6000 (wpai vgpu 20)
| 框架 | 耗时 | 推理速度 | 加速比 |
|---|---|---|---|
| funasr | 59.26s | 46.07 | 1.0 |
| fasr | 36.84s | 74.1 | 1.61 |
AISHELL
pipeline
vad->asr->punc
测试结果
cpu: Intel(R) Xeon(R) Silver 4210 CPU @ 2.20GHz (wpai cpu 2核)
gpu: rtx6000 (wpai vgpu 20)
| 框架 | 耗时 | 推理速度 | 加速比 |
|---|---|---|---|
| funasr | 123.8s | 18.65 | 1.0 |
| fasr | 59.04s | 32.71 | 1.8 |
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file fasr-0.2.9.tar.gz.
File metadata
- Download URL: fasr-0.2.9.tar.gz
- Upload date:
- Size: 5.8 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.6.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
757d259f8f2d2b9004429614ab4ee6b2923da2900ceaf672c18d50ed59f745ea
|
|
| MD5 |
8fa3612d63b2d732d41d7c61260df881
|
|
| BLAKE2b-256 |
2f3bf7e04e904e9b3227ce82de3dce6323ecd1a7baf214d89d6013e39ea38097
|
File details
Details for the file fasr-0.2.9-py3-none-any.whl.
File metadata
- Download URL: fasr-0.2.9-py3-none-any.whl
- Upload date:
- Size: 5.8 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.6.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
46ee7673b1ab6b43938c515a51334fe7acb605d163691d54a0e35189963d41b9
|
|
| MD5 |
938a1670c6ddc36782b0c8d429b5a51d
|
|
| BLAKE2b-256 |
a80da8c63a675dcafda68436c2d9ca25873bf1560cf2db7f610e78b7bb19c89b
|