Skip to main content

Collection of utilities in the department of communications engineering of the UPB

Project description

Paderbox: A collection of utilities for audio / speech processing

Build Status Azure DevOps tests Azure DevOps coverage MIT License

This repository started in late 2014 as an internal development repository for the Communications Engineering Group at Paderborn University, Germany. Over the years it emerged to a collection of IO helper, feature extraction modules and numerous smaller tools adding functionality to Numpy, Pandas, and others.

The main purpose here is to limit code duplication across our other public repositories.

We ensured that most functions/ classes contain Python Docstrings such that automatic tooltips for most functions are supported. It was deliberately decided against a lengthy documentation: most emphasis is put on the Python Docstrings and code readability itself.

Examples

Without going through all functions, we here select two examples which demonstrate why we rely on this very implementation.

Short-time Fourier transform

The Short-time Fourier transform (STFT) is a widely used feature extraction method when dealing with time series such as audio/ speech. Most repositories, including Deep Learning frameworks such as TensorFlow, provide an STFT implementation. However, it is rarely seen, that these implementations allow an exact reconstruction when applying the STFT followed by an inverse STFT.

Two important issues often overseen are:

  • How do I need to calculate the biorthogonal reconstruction window when using any STFT window function?
  • How much padding depeding on STFT window length, DFT length, and shift is needed to compensate for fade-in, fade-out, and uneven signal length?

Our STFT implementation addresses aforementioned issues, can operate on any number of independent dimensions and is already battle tested in our publications on audio/ speech since 2015. Numerous STFT tests ensure that the code remains stable and in particular test for the aforementioned problems.

Fast access to the IPython audio player

The function paderbox.play.play() is a somewhat elaborated wrapper around IPython.display.Audio. A single function allows to play audio from the waveform, from the STFT signal, and from file. It therefore serves as a great tool within Jupyter Notebooks and helps for quick inspection of simulation results.

Installation

Install it from PyPI with pip

pip install paderbox`[all]`

The [all] flag is optional and indicates to install all dependencies. Remove it, when you want to have the minimal dependencies.

Alternatively, you can clone this repository and install it as follows

git clone https://github.com/fgnt/paderbox.git
cd paderbox
pip install --editable .[all]

How to cite?

There is no clear way how to cite this repository for research. However, we would be grateful for direct imports from this repository if you use, e.g., the STFT. We are also fine when you copy the code as long as it remains visible where you copied the code from.

If you use one of our other repositories relying on this work we would be thankful if you respect citation hints for that repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

paderbox-0.0.3.tar.gz (154.4 kB view details)

Uploaded Source

File details

Details for the file paderbox-0.0.3.tar.gz.

File metadata

  • Download URL: paderbox-0.0.3.tar.gz
  • Upload date:
  • Size: 154.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.20.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.8

File hashes

Hashes for paderbox-0.0.3.tar.gz
Algorithm Hash digest
SHA256 891b68d0a1aad8c52fcdce8d3ec0ae49e577ffd5eee0c15b6ef2b284f1d460bd
MD5 f01674c9e97a8c248273b8fcabb52fc5
BLAKE2b-256 db9c01f376a1ff9727f4649b1b9273d9ad5441872488d256db37fb466f028722

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page