Almost State-of-the-art Automatic Speech Recognition using Tensorflow 2

These details have not been verified by PyPI

Project links

Homepage

Project description

TensorFlowASR :zap:

python tensorflow

Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2

TensorFlowASR implements some automatic speech recognition architectures such as DeepSpeech2, Jasper, RNN Transducer, ContextNet, Conformer, etc. These models can be converted to TFLite to reduce memory and computation for deployment :smile:

What's New?

What's New?
Table of Contents
:yum: Supported Models
- Baselines
- Publications
Installation
Training & Testing Tutorial
Features Extraction
Augmentations
TFLite Convertion
Pretrained Models
Corpus Sources
- English
- Vietnamese
How to contribute
References & Credits
Contact

:yum: Supported Models

Baselines

Transducer Models (End2end models using RNNT Loss for training, currently supported Conformer, ContextNet, Streaming Transducer)
CTCModel (End2end models using CTC Loss for training, currently supported DeepSpeech2, Jasper)

Publications

Conformer Transducer (Reference: https://arxiv.org/abs/2005.08100) See examples/models/transducer/conformer
ContextNet (Reference: http://arxiv.org/abs/2005.03191) See examples/models/transducer/contextnet
RNN Transducer (Reference: https://arxiv.org/abs/1811.06621) See examples/models/transducer/rnnt
Deep Speech 2 (Reference: https://arxiv.org/abs/1512.02595) See examples/models/ctc/deepspeech2
Jasper (Reference: https://arxiv.org/abs/1904.03288) See examples/models/ctc/jasper

Installation

For training and testing, you should use git clone for installing necessary packages from other authors (ctc_decoders, rnnt_loss, etc.)

Installing from source (recommended)

git clone https://github.com/TensorSpeech/TensorFlowASR.git
cd TensorFlowASR
# Tensorflow 2.x (with 2.x.x >= 2.5.1)
pip3 install ".[tf2.x]" # or ".[tf2.x-gpu]"

For anaconda3:

conda create -y -n tfasr tensorflow-gpu python=3.8 # tensorflow if using CPU, this makes sure conda install all dependencies for tensorflow
conda activate tfasr
pip install -U tensorflow-gpu # upgrade to latest version of tensorflow
git clone https://github.com/TensorSpeech/TensorFlowASR.git
cd TensorFlowASR
# Tensorflow 2.x (with 2.x.x >= 2.5.1)
pip3 install ".[tf2.x]" # or ".[tf2.x-gpu]"

Installing via PyPi

# Tensorflow 2.x (with 2.x >= 2.3)
pip3 install "TensorFlowASR[tf2.x]" # or pip3 install "TensorFlowASR[tf2.x-gpu]"

Installing for development

git clone https://github.com/TensorSpeech/TensorFlowASR.git
cd TensorFlowASR
pip3 install -e ".[dev]"
pip3 install -e ".[tf2.x]" # or ".[tf2.x-gpu]" or ".[tf2.x-apple]" for apple m1 machine

Install for Apple Sillicon

Due to tensorflow-text is not built for Apple Sillicon, we need to install it with the prebuilt wheel file from sun1638650145/Libraries-and-Extensions-for-TensorFlow-for-Apple-Silicon

git clone https://github.com/TensorSpeech/TensorFlowASR.git
cd TensorFlowASR
pip3 install -e "." # or pip3 install -e ".[dev] for development # or pip3 install "TensorFlowASR[dev]" from PyPi
pip3 install tensorflow~=2.14.0 # change minor version if you want

Do this after installing TensorFlowASR with tensorflow above

TF_VERSION="$(python3 -c 'import tensorflow; print(tensorflow.__version__)')" && \
TF_VERSION_MAJOR="$(echo $TF_VERSION | cut -d'.' -f1,2)" && \
PY_VERSION="$(python3 -c 'import platform; major, minor, patch = platform.python_version_tuple(); print(f"{major}{minor}");')" && \
URL="https://github.com/sun1638650145/Libraries-and-Extensions-for-TensorFlow-for-Apple-Silicon" && \
pip3 install "${URL}/releases/download/v${TF_VERSION_MAJOR}/tensorflow_text-${TF_VERSION_MAJOR}.0-cp${PY_VERSION}-cp${PY_VERSION}-macosx_11_0_arm64.whl"

Running in a container

docker-compose up -d

Training & Testing Tutorial

For training, please read tutorial_training
For testing, please read tutorial_testing

FYI: Keras builtin training uses infinite dataset, which avoids the potential last partial batch.

See examples for some predefined ASR models and results

Features Extraction

See features_extraction

Augmentations

See augmentations

TFLite Convertion

After converting to tflite, the tflite model is like a function that transforms directly from an audio signal to text and tokens

See tflite_convertion

Pretrained Models

Go to drive

Corpus Sources

English

Name	Source	Hours
LibriSpeech	LibriSpeech	970h
Common Voice	https://commonvoice.mozilla.org	1932h

Vietnamese

Name	Source	Hours
Vivos	https://ailab.hcmus.edu.vn/vivos	15h
InfoRe Technology 1	InfoRe1 (passwd: BroughtToYouByInfoRe)	25h
InfoRe Technology 2 (used in VLSP2019)	InfoRe2 (passwd: BroughtToYouByInfoRe)	415h

How to contribute

Fork the project
Install for development
Create a branch
Make a pull request to this repo

References & Credits

Contact

Huy Le Nguyen

Email: nlhuy.cs.16@gmail.com

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

2.1.0

Jun 9, 2024

2.0.1

May 19, 2024

2.0.0

May 4, 2024

1.0.3

Mar 12, 2022

1.0.2

Nov 7, 2021

1.0.1

May 17, 2021

1.0.0

Apr 17, 2021

0.8.3

Apr 10, 2021

0.8.2

Apr 6, 2021

0.8.1

Mar 17, 2021

0.8.0

Mar 9, 2021

0.7.8

Feb 24, 2021

0.7.7

Feb 21, 2021

0.7.6

Feb 19, 2021

0.7.5

Feb 16, 2021

0.7.4

Feb 13, 2021

0.7.3

Feb 12, 2021

0.7.2

Feb 7, 2021

0.7.1

Jan 31, 2021

0.7.0

Jan 23, 2021

0.6.4

Jan 15, 2021

0.6.3

Jan 6, 2021

0.6.2

Dec 27, 2020

0.6.1

Dec 27, 2020

0.6.0

Dec 27, 2020

0.5.5

Dec 25, 2020

0.5.4

Dec 24, 2020

0.5.3

Dec 20, 2020

0.5.2

Dec 19, 2020

0.5.1

Dec 19, 2020

0.5.0

Dec 17, 2020

0.4.5

Dec 15, 2020

0.4.3

Dec 12, 2020

0.4.2

Dec 11, 2020

0.4.1

Dec 9, 2020

0.4.0

Dec 6, 2020

0.3.2

Dec 4, 2020

0.3.1

Nov 22, 2020

0.3.0

Nov 14, 2020

0.2.10

Nov 3, 2020

0.2.9

Oct 31, 2020

0.2.8

Oct 20, 2020

0.2.7

Oct 18, 2020

0.2.6

Oct 16, 2020

0.2.5

Oct 15, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

TensorFlowASR-2.1.0.tar.gz (88.1 kB view details)

Uploaded Jun 9, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

TensorFlowASR-2.1.0-py3-none-any.whl (134.1 kB view details)

Uploaded Jun 9, 2024 Python 3

File details

Details for the file TensorFlowASR-2.1.0.tar.gz.

File metadata

Download URL: TensorFlowASR-2.1.0.tar.gz
Upload date: Jun 9, 2024
Size: 88.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.0 CPython/3.10.14

File hashes

Hashes for TensorFlowASR-2.1.0.tar.gz
Algorithm	Hash digest
SHA256	`447a538bdbc40e2927f0009c1073f52834d07b33a5b67e4a8d09c5044b0bcff9`
MD5	`03cf96db8838bee0e87792c567e26db3`
BLAKE2b-256	`d13540ce897804cfb0549c1b836f0a9d11f0604da3d9d7341e295cb9dc2c58e5`

See more details on using hashes here.

File details

Details for the file TensorFlowASR-2.1.0-py3-none-any.whl.

File metadata

Download URL: TensorFlowASR-2.1.0-py3-none-any.whl
Upload date: Jun 9, 2024
Size: 134.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.0 CPython/3.10.14

File hashes

Hashes for TensorFlowASR-2.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d78e4e570b6bba5b72debbd394dbeae5b9bc8375895092209f7fdc09960f1ba9`
MD5	`df90e4342a7bf25be9fab410fb8884c7`
BLAKE2b-256	`986ca270b1bdaa0b03f0df3f072dbe7b709b54bf9833659772573f5b0317dc88`

See more details on using hashes here.

TensorFlowASR 2.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TensorFlowASR :zap:

Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2

What's New?

Table of Contents

:yum: Supported Models

Baselines

Publications

Installation

Installing from source (recommended)

Installing via PyPi

Installing for development

Install for Apple Sillicon

Running in a container

Training & Testing Tutorial

Features Extraction

Augmentations

TFLite Convertion

Pretrained Models

Corpus Sources

English

Vietnamese

How to contribute

References & Credits

Contact

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes