Skip to main content

A speech signal processing library with emphasis on deep learning.

Project description

audlib

Build Status

A speech signal processing library in Python with emphasis on deep learning.

audlib provides a collection of utilities for developing speech-related applications using both signal processing and deep learning. The package offers the following high-level features:

  • Speech signal processing utilities with ready-to-use applications
  • Deep learning architectures for speech processing tasks in PyTorch
  • PyTorch-compatible interface (similar to torchvision) for batch processing
  • A command-line interface with a unix-pipe-like syntax
    • I/O utilities for interfacing with CMUSPHINX

Some use cases of audlib are:

  • Extracting common speech features for your backend
  • Integrating CMUSPHINX with modern deep learning architectures
  • Developing your own deep-learning-based tools for speech tasks
  • Quickly try out speech processors and visualize the spectrogram in command line

audlib focuses on correctness, efficiency, and simplicity. Signal processing functionalities are mathematically checked whenever possible (e.g. constant overlap-add, istft(stft(X))==X). Deep neural networks follow the PyTorch's convention.

Installation

pip install audlib

Developer Installation

In the source directory, install the library with test dependencies:

pip install ".[tests]"

Run test:

python -m pytest tests

Release flow

  1. Bump version in setup.py.
  2. Package release: python setup.py sdist bdist_wheel
  3. Upload release: twine upload --repository-url https://upload.pypi.org/legacy/ dist/*

Usage example

More extensive examples can be found in examples/.

Release history

  • 0.0.1
    • Work in progress
    • First release on PyPI

Authors

Raymond Xia - raymondxia@cmu.edu

Mahmoud Alismail - mahmoudi@andrew.cmu.edu

Shangwu Yao - shangwuyao@gmail.com

Andrew Wu - anwu.andrew@hotmail.com

Feel free to send us any issue you find and question you have.

Contributing

Please contact one of the authors.

License

Distributed under the MIT license. See LICENSE for more information.

Project details


Release history Release notifications

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for audlib, version 0.0.1.1
Filename, size File type Python version Upload date Hashes
Filename, size audlib-0.0.1.1-py3-none-any.whl (169.2 kB) File type Wheel Python version py3 Upload date Hashes View hashes
Filename, size audlib-0.0.1.1.tar.gz (149.5 kB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page