Parallel CBS Transformer Based ASR
Project description
fujielab-asr-parallel-cbs
Parallel CBS Transformer based ASR system built on ESPnet
Overview
fujielab-asr-parallel-cbs is an automatic speech recognition (ASR) system based on ESPnet, featuring a parallelizable Contextual Block Streaming (CBS) Transformer.
Features
- Implementation of a parallel CBS Transformer Encoder extending the ESPnet framework
- Supports online and streaming ASR inference
Installation
PyPI Installation
You can install the package directly from PyPI:
pip install fujielab-asr-parallel-cbs
Local Installation
- Install the required Python packages:
pip install -e .
- If there are additional dependencies, please refer to
pyproject.toml.
Usage
Example: Running Inference
You can perform inference from an audio file using examples/run_streaming_asr.py:
python examples/run_streaming_asr.py
It will automatically download the pre-trained model from Hugging Face Hub and sample audio files from CSJ (Corpus of Spontaneous Japanese) official site.
Directory Structure
espnet_ext/: ESPnet extension implementationespnet/: Extensions for ESPnet1espnet2/: Extensions for ESPnet2 (ASR, transducer, joint network, etc.)
examples/: Sample audio and inference scriptswarprnnt_pytorch/: Dummy module for warprnnt_pytorch
License
This repository is released under the Apache License 2.0.
Acknowledgements
This project is based on the ESPnet framework and incorporates contributions from various open-source projects. We thank the ESPnet team and contributors for their work.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file fujielab_asr_parallel_cbs-0.0.1.tar.gz.
File metadata
- Download URL: fujielab_asr_parallel_cbs-0.0.1.tar.gz
- Upload date:
- Size: 42.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cc34b646ac05d00f4504aa41de9a26e8563af7cea0923ba9f9184329c2d72f76
|
|
| MD5 |
1fe09cdf249daf2f859ecc67f2665650
|
|
| BLAKE2b-256 |
6e16b29337ff450b54804287631182d2d3382f85ec25029ad15240fdf69ce911
|
File details
Details for the file fujielab_asr_parallel_cbs-0.0.1-py3-none-any.whl.
File metadata
- Download URL: fujielab_asr_parallel_cbs-0.0.1-py3-none-any.whl
- Upload date:
- Size: 41.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cadc0906b947f011b17be13bcfc8073fec7393511ede0183e7d64b84422aac7d
|
|
| MD5 |
88824fb84b8975c29fe650c902697706
|
|
| BLAKE2b-256 |
68eeab9bbaa77ab400a94d4fa052d09e5c8eccd9e721875746fbc373f2f1b2ce
|