A library to perform automatic speech recognition with huggingface transformers.
Project description
Elpis Core Library
The Core Elpis Library, providing a quick api to :hugs: transformers for automatic-speech-recognition.
You can use the library to:
- Perform standalone inference using a pretrained HFT model.
- Fine tune a pretrained ASR model on your own dataset.
- Generate text and Elan files from inference results for further analysis.
Documentation
Documentation for the library can be be found here.
Dependencies
While we try to be as machine-independant as possible, there are some dependencies you should be aware of when using this library:
- Processing datasets (
elpis.datasets.processing
) requireslibrosa
, which depends on havinglibsndfile
installed on your computer. If you're using elpis within a docker container, you may have to manually installlibsndfile
. - Transcription (
elpis.transcription.transcribe
) requiresffmpeg
if your audio you're attempting to transcribe needs to be resampled before it can be used. The default sample rate we assume is 16khz. - The preprocessing flow (
elpis.datasets.preprocessing
) is free of external dependencies.
Installation
You can install the elpis library with:
pip3 install elpis
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
elpis-0.1.1.tar.gz
(12.8 kB
view hashes)
Built Distribution
elpis-0.1.1-py3-none-any.whl
(16.3 kB
view hashes)