Skip to main content

Glue code connecting Kaldi data and Python.

Project description

kaldi-io-for-python

'Glue' code connecting kaldi data and python.

Supported data types

  • vector (integer)
  • Vector (float, double)
  • Matrix (float, double)
  • Posterior (posteriors, nnet1 training targets, confusion networks, ...)

Examples

Reading feature scp example:
import kaldi_io
for key,mat in kaldi_io.read_mat_scp(file):
  ...
Writing feature ark to file/stream:
import kaldi_io
with open(ark_file,'wb') as f:
  for key,mat in dict.iteritems():
    kaldi_io.write_mat(f, mat, key=key)
Writing features as 'ark,scp' by pipeline with 'copy-feats':
import kaldi_io
ark_scp_output='ark:| copy-feats --compress=true ark:- ark,scp:data/feats2.ark,data/feats2.scp'
with kaldi_io.open_or_fd(ark_scp_output,'wb') as f:
  for key,mat in dict.iteritems():
    kaldi_io.write_mat(f, mat, key=key)

Install

  • from pypi:
pip install kaldi_io`
  • from sources:
git clone https://github.com/vesis84/kaldi-io-for-python.git <kaldi-io-dir>`
pip install -r requirements.txt
pip install --editable .

Note: it is recommended to set export KALDI_ROOT=<some_kaldi_dir> environment variable. The I/O based on pipes can then contain kaldi binaries.

Unit tests

(note: these are not included in pypi package)

Unit tests are started this way:

./run_tests.sh

or by:

python3 -m unittest discover -s tests -t . python2 -m unittest discover -s tests -t .

License

Apache License, Version 2.0 ('LICENSE-2.0.txt')

Community

  • accepting pull requests with extensions on GitHub
  • accepting feedback via GitHub 'Issues' in the repo

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kaldi_io-0.9.8.tar.gz (44.0 kB view details)

Uploaded Source

Built Distributions

kaldi_io-0.9.8-py3-none-any.whl (16.7 kB view details)

Uploaded Python 3

kaldi_io-0.9.8-py2-none-any.whl (16.7 kB view details)

Uploaded Python 2

File details

Details for the file kaldi_io-0.9.8.tar.gz.

File metadata

  • Download URL: kaldi_io-0.9.8.tar.gz
  • Upload date:
  • Size: 44.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for kaldi_io-0.9.8.tar.gz
Algorithm Hash digest
SHA256 48f8057413e8800a18518e2077f1d29a40b9487b80052a496828a68817aa93dc
MD5 7f3b47cabd8c5b5ab7f5a8fac0dc6c25
BLAKE2b-256 c7f8d841f5d352370a3576c4339d59cc0d0c17d153b71f221798fefa05953562

See more details on using hashes here.

File details

Details for the file kaldi_io-0.9.8-py3-none-any.whl.

File metadata

  • Download URL: kaldi_io-0.9.8-py3-none-any.whl
  • Upload date:
  • Size: 16.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for kaldi_io-0.9.8-py3-none-any.whl
Algorithm Hash digest
SHA256 e82c623df158e22be420f4efad1cee42cbb9aa446e665722441b85b27f9a5a42
MD5 520c1361aad8443a12e9c2cc7c475159
BLAKE2b-256 d8b53aca5c81d094a850ac1331a23d58ce868553720a892db9b0c3a9f9b4c121

See more details on using hashes here.

File details

Details for the file kaldi_io-0.9.8-py2-none-any.whl.

File metadata

  • Download URL: kaldi_io-0.9.8-py2-none-any.whl
  • Upload date:
  • Size: 16.7 kB
  • Tags: Python 2
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for kaldi_io-0.9.8-py2-none-any.whl
Algorithm Hash digest
SHA256 4e36804117072e7e8e2b66954fcb58a20de1c8e0e917ecd88b852fa2f87196ed
MD5 54bf4f08608be92183c0557d81831b96
BLAKE2b-256 90f5f018ab6949c6416beba22787a3b69245129ae99bcad42116db322f29ff4f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page