Skip to main content

kaldi-io for Tensorflow

Project description

A python package: provide a custom tensorflow dataset for kaldi io

Python is its wrapper, C++ is its backend implemention. It depends on two things:

Through [kaldi-io lib](https://github.com/open-speech/kaldi-io.git), it is able to:
- direct read from kaldi rspecifier(scp, ark, in text or binary, just as [kaldi](https://github.com/kaldi-asr/kaldi))
- support multiple feature transforms:
- delta
- cmvn
- splice
- sampling
- compute fast: kaldi Matrix|Vector with blas math lib is used

Through [tensorflow dataset](https://www.tensorflow.org/guide/datasets), it is able to:
- shuffling
- batching at frame or utt level
- bucketing with input sequence lengths
- and all other tensorflow native dataset manipulations and features (parellel, prefetch, ..)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tf_kaldi_io-0.3.0.tar.gz (15.4 kB view details)

Uploaded Source

File details

Details for the file tf_kaldi_io-0.3.0.tar.gz.

File metadata

  • Download URL: tf_kaldi_io-0.3.0.tar.gz
  • Upload date:
  • Size: 15.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.18.4 setuptools/39.1.0 requests-toolbelt/0.8.0 tqdm/4.23.3 CPython/3.6.3

File hashes

Hashes for tf_kaldi_io-0.3.0.tar.gz
Algorithm Hash digest
SHA256 22978a4f94b17e3b7e4b001772268e109582d971ffcab06bc8dc4388100d3d43
MD5 afcadd12e27a466c3791d659cdd204d6
BLAKE2b-256 05c78246d865e5d4f523e70b67c3d6eca2fab8595dabe91b33f20012c6a01a62

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page