A pure python IO interface for data accessing in kaldi
Project description
Kaldi Python IO
A python (3.6+) wrapper for Kaldi's data accessing.
Support Type
- Kaldi's binary archives (*.ark)
- Kaldi's scripts (alignments & features, *.scp)
- Kaldi nnet3 data examples in binary (*.egs)
Install
python setup.py install
or pip install kaldi-python-io
Usage
-
ArchiveReader && AlignArchiveReader
# allow only sequential index ark_reader = ArchiveReader("copy-feats ark:foo.ark ark:- |") for key, _ in ark_reader: print(key) ali_reader = AlignArchiveReader("gunzip -c foo.ali.gz |") for key, _ in ark_reader: print(key)
-
Nnet3EgsReader
# allow only sequential index egs_reader = Nnet3EgsReader("foo.egs") for key, _ in egs_reader: print(key)
-
ArchiveWriter
with ArchiveWriter("foo.ark", "foo.scp") as writer: for i in range(10): mat = np.random.rand(100, 20) writer.write(f"mat-{i}", mat)
-
ScriptReader && AlignScriptReader
# allow sequential/random index scp_reader = ScriptReader("shuf foo.scp | head -n 2") for key, mat in scp_reader: print(f"{key}: {mat.shape}") ali_reader = AlignScriptReader("foo.ali.scp") for key, ali in ali_reader: print(f"{key}: {ali.shape}")
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
kaldi-python-io-1.2.2.tar.gz
(8.8 kB
view details)
File details
Details for the file kaldi-python-io-1.2.2.tar.gz
.
File metadata
- Download URL: kaldi-python-io-1.2.2.tar.gz
- Upload date:
- Size: 8.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.22.0 setuptools/50.3.0.post20201006 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4ebb4029c6c58296cc0abf96edff02832ba341d290ed37624a8d00105f0f7c00 |
|
MD5 | d69d674b9e310b347f3f7e20d8213dce |
|
BLAKE2b-256 | 8045e3e542ffa8970ebd782fcece35e2295de9c60e8c396c2c1a403410d1b24e |