FSA/FST algorithms, intended to (eventually) be interoperable with PyTorch and similar

These details have not been verified by PyPI

Project links

Homepage

Project description

k2

The vision of k2 is to be able to seamlessly integrate Finite State Automaton (FSA) and Finite State Transducer (FST) algorithms into autograd-based machine learning toolkits like PyTorch and TensorFlow. For speech recognition applications, this should make it easy to interpolate and combine various training objectives such as cross-entropy, CTC and MMI and to jointly optimize a speech recognition system with multiple decoding passes including lattice rescoring and confidence estimation. We hope k2 will have many other applications as well.

One of the key algorithms that we have implemented is pruned composition of a generic FSA with a "dense" FSA (i.e. one that corresponds to log-probs of symbols at the output of a neural network). This can be used as a fast implementation of decoding for ASR, and for CTC and LF-MMI training. This won't give a direct advantage in terms of Word Error Rate when compared with existing technology; but the point is to do this in a much more general and extensible framework to allow further development of ASR technology.

Implementation

A few key points on our implementation strategy.

Most of the code is in C++ and CUDA. We implement a templated class Ragged, which is quite like TensorFlow's RaggedTensor (actually we came up with the design independently, and were later told that TensorFlow was using the same ideas). Despite a close similarity at the level of data structures, the design is quite different from TensorFlow and PyTorch. Most of the time we don't use composition of simple operations, but rely on C++11 lambdas defined directly in the C++ implementations of algorithms. The code in these lambdas operate directly on data pointers and, if the backend is CUDA, they can run in parallel for each element of a tensor. (The C++ and CUDA code is mixed together and the CUDA kernels get instantiated via templates).

It is difficult to adequately describe what we are doing with these Ragged objects without going in detail through the code. The algorithms look very different from the way you would code them on CPU because of the need to avoid sequential processing. We are using coding patterns that make the most expensive parts of the computations "embarrassingly parallelizable"; the only somewhat nontrivial CUDA operations are generally reduction-type operations such as exclusive-prefix-sum, for which we use NVidia's cub library. Our design is not too specific to the NVidia hardware and the bulk of the code we write is fairly normal-looking C++; the nontrivial CUDA programming is mostly done via the cub library, parts of which we wrap with our own convenient interface.

The Finite State Automaton object is then implemented as a Ragged tensor templated on a specific data type (a struct representing an arc in the automaton).

Autograd

If you look at the code as it exists now, you won't find any references to autograd. The design is quite different to TensorFlow and PyTorch (which is why we didn't simply extend one of those toolkits). Instead of making autograd come from the bottom up (by making individual operations differentiable) we are implementing it from the top down, which is much more efficient in this case (and will tend to have better roundoff properties).

An example: suppose we are finding the best path of an FSA, and we need derivatives. We implement this by keeping track of, for each arc in the output best-path, which input arc it corresponds to. (For more complex algorithms an arc in the output might correspond to a sum of probabilities of a list of input arcs). We can make this compatible with PyTorch/TensorFlow autograd at the Python level, by, for example, defining a Function class in PyTorch that remembers this relationship between the arcs and does the appropriate (sparse) operations to propagate back the derivatives w.r.t. the weights.

Current state of the code

We have wrapped all the C++ code to Python with pybind11 and have finished the integration with PyTorch.

We are currently writing speech recognition recipes using k2, which are hosted in a separate repository. Please see https://github.com/k2-fsa/icefall.

Plans after initial release

We are currently trying to make k2 ready for production use (see the branch v2.0-pre).

Quick start

Want to try it out without installing anything? We have setup a Google Colab. You can find more Colab notebooks using k2 in speech recognition at https://icefall.readthedocs.io/en/latest/recipes/librispeech/conformer_ctc.html.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

1.24.1

Apr 30, 2023

1.24.0

Apr 27, 2023

1.23.4

Jan 30, 2023

1.23.3

Jan 5, 2023

1.23.2

Nov 25, 2022

1.23.1

Nov 24, 2022

1.22

Nov 10, 2022

1.21

Oct 7, 2022

1.20

Sep 24, 2022

1.19

Aug 23, 2022

1.18

Aug 15, 2022

1.17

Jul 4, 2022

1.16

Jun 19, 2022

1.15.1

Apr 18, 2022

1.15

Apr 18, 2022

1.14

Mar 16, 2022

1.13

Jan 29, 2022

1.12

Jan 25, 2022

1.11

Nov 29, 2021

1.10

Nov 2, 2021

1.9

Sep 20, 2021

1.9.dev20211024 pre-release

Oct 24, 2021

1.8

Sep 14, 2021

1.7

Sep 8, 2021

1.6

Aug 24, 2021

1.5

Aug 23, 2021

1.4

Aug 21, 2021

1.3

Jul 30, 2021

1.2

Jul 9, 2021

1.1

Jul 6, 2021

1.1.dev20210706 pre-release

Jul 6, 2021

1.0

Jun 17, 2021

1.0.dev20210619 pre-release

Jun 19, 2021

1.0.dev20210618 pre-release

Jun 18, 2021

1.0.dev20210617 pre-release

Jun 17, 2021

0.3.5

Jun 8, 2021

0.3.5.dev20210608 pre-release

Jun 8, 2021

0.3.5.dev20210606 pre-release

Jun 6, 2021

0.3.5.dev20210605 pre-release

Jun 5, 2021

0.3.4.dev20210515 pre-release

May 15, 2021

0.3.4.dev20210512 pre-release

May 12, 2021

0.3.4.dev20210511 pre-release

May 11, 2021

0.3.3.dev20210501 pre-release

May 1, 2021

0.3.3.dev20210426 pre-release

Apr 26, 2021

0.3.3.dev20210425 pre-release

Apr 25, 2021

0.3.3.dev20210421 pre-release

Apr 21, 2021

0.3.3.dev20210415 pre-release

Apr 15, 2021

0.3.3.dev20210411 pre-release

Apr 11, 2021

0.3.3.dev20210409 pre-release

Apr 9, 2021

0.3.3.dev20210331 pre-release

Mar 31, 2021

0.3.3.dev20210328 pre-release

Mar 28, 2021

0.3.3.dev20210321 pre-release

Mar 21, 2021

0.3.3.dev20210309 pre-release

Mar 9, 2021

0.3.3.dev20210305 pre-release

Mar 5, 2021

0.3.3.dev20210302 pre-release

Mar 2, 2021

0.3.3.dev20210224 pre-release

Feb 24, 2021

0.3.3.dev20210222 pre-release

Feb 22, 2021

0.3.3.dev20210218 pre-release

Feb 18, 2021

0.3.3.dev20210209 pre-release

Feb 9, 2021

0.3.3.dev20210206 pre-release

Feb 6, 2021

0.3.3.dev20210205 pre-release

Feb 5, 2021

0.3.2.dev20210204 pre-release

Feb 4, 2021

0.3.1.dev20210204 pre-release

Feb 4, 2021

0.3.1.dev20210127 pre-release

Jan 27, 2021

0.3.1.dev20210121 pre-release

Jan 21, 2021

0.3.0

Jan 21, 2021

0.1.3.dev20210111 pre-release

Jan 11, 2021

0.1.2

Dec 28, 2020

0.1.2.dev20210111 pre-release

Jan 11, 2021

0.1.1.dev20201216 pre-release

Dec 16, 2020

0.1.1.dev20201212 pre-release

Dec 12, 2020

0.1.1.dev20201210 pre-release

Dec 10, 2020

0.1.1.dev20201208 pre-release

Dec 8, 2020

0.1.1.dev20201130 pre-release

Nov 30, 2020

0.1.1.dev20201118 pre-release

Nov 18, 2020

0.1.1.dev20201116 pre-release

Nov 16, 2020

0.1

Nov 16, 2020

0.0.10.dev20201110 pre-release

Nov 10, 2020

0.0.9.dev20201109 pre-release

Nov 9, 2020

0.0.8.dev20201109 pre-release

Nov 9, 2020

0.0.7.dev20201028 pre-release

Oct 28, 2020

0.0.6.dev20201028 pre-release

Oct 28, 2020

0.0.5.dev20201028 pre-release

Oct 28, 2020

0.0.5.dev20201027 pre-release

Oct 27, 2020

0.0.4.dev20201027 pre-release

Oct 26, 2020

0.0.3.dev20201027 pre-release

Oct 26, 2020

0.0.3.dev20201025 pre-release

Oct 25, 2020

0.0.3.dev20201020 pre-release

Oct 20, 2020

0.0.3.dev20201019 pre-release

Oct 19, 2020

0.0.2.dev0 pre-release

Oct 17, 2020

0.0.1.dev0 pre-release yanked

Oct 17, 2020

Reason this release was yanked:

fix a bug

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

k2-1.24.1-cp310-cp310-macosx_10_15_x86_64.whl (2.4 MB view details)

Uploaded Apr 30, 2023 CPython 3.10macOS 10.15+ x86-64

k2-1.24.1-cp39-cp39-macosx_10_15_x86_64.whl (2.4 MB view details)

Uploaded Apr 30, 2023 CPython 3.9macOS 10.15+ x86-64

k2-1.24.1-cp38-cp38-macosx_10_15_x86_64.whl (2.4 MB view details)

Uploaded Apr 30, 2023 CPython 3.8macOS 10.15+ x86-64

k2-1.24.1-cp37-cp37m-macosx_10_15_x86_64.whl (2.4 MB view details)

Uploaded Apr 30, 2023 CPython 3.7mmacOS 10.15+ x86-64

File details

Details for the file k2-1.24.1-cp310-cp310-macosx_10_15_x86_64.whl.

File metadata

Download URL: k2-1.24.1-cp310-cp310-macosx_10_15_x86_64.whl
Upload date: Apr 30, 2023
Size: 2.4 MB
Tags: CPython 3.10, macOS 10.15+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.10.8

File hashes

Hashes for k2-1.24.1-cp310-cp310-macosx_10_15_x86_64.whl
Algorithm	Hash digest
SHA256	`d17d0c7115eb500f51025cd95048ae15b75d12a603776e0ad2d588622ac09bc6`
MD5	`bc3eeb75b9381121cdb297385c6a408c`
BLAKE2b-256	`42b59fc13bcab6fa65ee3e9faf6b9d033518838a3daf2a8ba9280f230c49bba8`

See more details on using hashes here.

File details

Details for the file k2-1.24.1-cp39-cp39-macosx_10_15_x86_64.whl.

File metadata

Download URL: k2-1.24.1-cp39-cp39-macosx_10_15_x86_64.whl
Upload date: Apr 30, 2023
Size: 2.4 MB
Tags: CPython 3.9, macOS 10.15+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.14

File hashes

Hashes for k2-1.24.1-cp39-cp39-macosx_10_15_x86_64.whl
Algorithm	Hash digest
SHA256	`b2c5bed70032d731ccdf48b8b7056febcb70311866552305a762a6f96292b139`
MD5	`22b7d8f01e8c38ed6c6f4e92a648458a`
BLAKE2b-256	`389f879e1f47428b829526d3bd0afc9e6cfc461044dadc7b0a549d503b8cd3d1`

See more details on using hashes here.

File details

Details for the file k2-1.24.1-cp38-cp38-macosx_10_15_x86_64.whl.

File metadata

Download URL: k2-1.24.1-cp38-cp38-macosx_10_15_x86_64.whl
Upload date: Apr 30, 2023
Size: 2.4 MB
Tags: CPython 3.8, macOS 10.15+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.8.14

File hashes

Hashes for k2-1.24.1-cp38-cp38-macosx_10_15_x86_64.whl
Algorithm	Hash digest
SHA256	`36e76e57f535fef4e2371844bc7c9eae533efa9cc05dc1fae8f6bc265993c258`
MD5	`beb1354087f34d772f5f4fe75453385a`
BLAKE2b-256	`642126f06140a67404b9aad6d0f9349db659c7deec4744774ad67ab7a0ff76ec`

See more details on using hashes here.

File details

Details for the file k2-1.24.1-cp37-cp37m-macosx_10_15_x86_64.whl.

File metadata

Download URL: k2-1.24.1-cp37-cp37m-macosx_10_15_x86_64.whl
Upload date: Apr 30, 2023
Size: 2.4 MB
Tags: CPython 3.7m, macOS 10.15+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.7.15

File hashes

Hashes for k2-1.24.1-cp37-cp37m-macosx_10_15_x86_64.whl
Algorithm	Hash digest
SHA256	`a182c34c8b2973bd5042d9de9a14d5364ba8d6d6a82875602270ff76635de35b`
MD5	`74eb625486b1803534ea6e3c6e0b278e`
BLAKE2b-256	`c291d7e0c2f2411f7566dd0c919bc857d5e88fde33750bd379a53f908f2670c5`

See more details on using hashes here.

k2 1.24.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

k2

Implementation

Autograd

Current state of the code

Plans after initial release

Quick start

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distributions

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes