Skip to main content

Scripts for preparing language data for use with Kaldi ASR

Project description

# CoEDL Kaldi Helpers <img src="docs/img/kh.png" align="right"/>

A set of scripts to use in preparing a corpus for speech-to-text processing with the [Kaldi](http://kaldi-asr.org/)
Automatic Speech Recognition Library.

Read about [setting up Docker](https://github.com/CoEDL/elpis/wiki/2018-summer-workshop-preparation) to run all this.

For more information about data requirements, see the
[data guide](https://github.com/CoEDL/elpis/wiki/2018-summer-workshop-preparation).

## Requirements
This pipeline relies on Python 3.6 and several open-source Python packages (listed [here](./requirements.txt)).
It also assumes you have Kaldi, [sox](http://sox.sourceforge.net/) and [task](https://taskfile.org/) installed. We
highly recommend using [our docker image](https://github.com/CoEDL/elpis/wiki/2018-summer-workshop-preparation).

## Tasks
This library uses the [task](https://taskfile.org) tool to run the more complex processes automatically. Once
you've set up Kaldi Helpers, you can run the various pipeline tasks we've developed (or out of the box in the docker
image). You can read about these tasks [here](https://github.com/CoEDL/elpis/wiki/tasks).

## Workflow
<p align="center">
<img src="docs/img/elpis-pipeline.svg"/>
</p>

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kaldi_helpers-0.22.tar.gz (22.1 kB view details)

Uploaded Source

Built Distribution

kaldi_helpers-0.22-py3-none-any.whl (34.6 kB view details)

Uploaded Python 3

File details

Details for the file kaldi_helpers-0.22.tar.gz.

File metadata

  • Download URL: kaldi_helpers-0.22.tar.gz
  • Upload date:
  • Size: 22.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.29.1 CPython/3.7.0

File hashes

Hashes for kaldi_helpers-0.22.tar.gz
Algorithm Hash digest
SHA256 110d1087be2b89a25ec63bbec5b0d43db45597c36f9b67ad58b5dd5355ef218f
MD5 7470936aa955ec0864b41e6fd184d4d9
BLAKE2b-256 9e054a5e2724c981b7ac1c29f82683fa68edabe0871a85c109f16172c94d27e7

See more details on using hashes here.

File details

Details for the file kaldi_helpers-0.22-py3-none-any.whl.

File metadata

  • Download URL: kaldi_helpers-0.22-py3-none-any.whl
  • Upload date:
  • Size: 34.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.29.1 CPython/3.7.0

File hashes

Hashes for kaldi_helpers-0.22-py3-none-any.whl
Algorithm Hash digest
SHA256 1c2b01c69964636c20c453cbfccb656085447a7ad440a5ceece70bb96e1a3fbd
MD5 3b6ccd0edf7e9fa72d15be06f859d67c
BLAKE2b-256 699c05e3569346e67bb0aceeaf6a45fc1fbba0480e2f05965c729d232b2bee70

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page