Skip to main content

A simple framework for room acoustics and audio processing in Python.

Project description

https://travis-ci.org/LCAV/pyroomacoustics.svg?branch=pypi-release Documentation Status

Summary

Pyroomacoustics is a software package aimed at the rapid development and testing of audio array processing algorithms. The content of the package can be divided into three main components:

  1. Intuitive Python object-oriented interface to quickly construct different simulation scenarios involving multiple sound sources and microphones in 2D and 3D rooms;

  2. Fast C implementation of the image source model for general polyhedral rooms to efficiently generate room impulse responses and simulate the propagation between sources and receivers;

  3. Reference implementations of popular algorithms for STFT, beamforming, direction finding, adaptive filtering, source separation, and single channel denoising.

Together, these components form a package with the potential to speed up the time to market of new algorithms by significantly reducing the implementation overhead in the performance evaluation step. Please refer to this notebook for a demonstration of the different components of this package.

Room Acoustics Simulation

Consider the following scenario.

Suppose, for example, you wanted to produce a radio crime drama, and it so happens that, according to the scriptwriter, the story line absolutely must culminate in a satanic mass that quickly degenerates into a violent shootout, all taking place right around the altar of the highly reverberant acoustic environment of Oxford’s Christ Church cathedral. To ensure that it sounds authentic, you asked the Dean of Christ Church for permission to record the final scene inside the cathedral, but somehow he fails to be convinced of the artistic merit of your production, and declines to give you permission. But recorded in a conventional studio, the scene sounds flat. So what do you do?

—Schnupp, Nelken, and King, Auditory Neuroscience, 2010

Faced with this difficult situation, pyroomacoustics can save the day by simulating the environment of the Christ Church cathedral!

At the core of the package is a room impulse response (RIR) generator based on the image source model that can handle

  • Convex and non-convex rooms

  • 2D/3D rooms

Both a pure Python implementation and a C accelerator are included for maximum speed and compatibility.

The philosophy of the package is to abstract all necessary elements of an experiment using an object-oriented programming approach. Each of these elements is represented using a class and an experiment can be designed by combining these elements just as one would do in a real experiment.

Let’s imagine we want to simulate a delay-and-sum beamformer that uses a linear array with four microphones in a shoe box shaped room that contains only one source of sound. First, we create a room object, to which we add a microphone array object, and a sound source object. Then, the room object has methods to compute the RIR between source and receiver. The beamformer object then extends the microphone array class and has different methods to compute the weights, for example delay-and-sum weights. See the example below to get an idea of what the code looks like.

The Room class also allows one to process sound samples emitted by sources, effectively simulating the propagation of sound between sources and microphones. At the input of the microphones composing the beamformer, an STFT (short time Fourier transform) engine allows to quickly process the signals through the beamformer and evaluate the output.

Reference Implementations

In addition to its core image source model simulation, pyroomacoustics also contains a number of reference implementations of popular audio processing algorithms for

We use an object-oriented approach to abstract the details of specific algorithms, making them easy to compare. Each algorithm can be tuned through optional parameters. We have tried to pre-set values for the tuning parameters so that a run with the default values will in general produce reasonable results.

Datasets

In an effort to simplify the use of datasets, we provide a few wrappers that allow to quickly load and sort through some popular speech corpora. At the moment we support the following.

For more details, see the doc.

Quick Install

Install the package with pip:

pip install pyroomacoustics

A cookiecutter is available that generates a working simulation script for a few 2D/3D scenarios:

# if necessary install cookiecutter
pip install cookiecutter

# create the simulation script
cookiecutter gh:fakufaku/cookiecutter-pyroomacoustics-sim

# run the newly created script
python <chosen_script_name>.py

Dependencies

The minimal dependencies are:

numpy
scipy>=0.18.0
Cython

where Cython is only needed to benefit from the compiled accelerated simulator. The simulator itself has a pure Python counterpart, so that this requirement could be ignored, but is much slower.

On top of that, some functionalities of the package depend on extra packages:

samplerate   # for resampling signals
matplotlib   # to create graphs and plots
sounddevice  # to play sound samples
mir_eval     # to evaluate performance of source separation in examples

The requirements.txt file lists all packages necessary to run all of the scripts in the examples folder.

This package is mainly developed under Python 3.5. We try as much as possible to keep things compatible with Python 2.7 and run tests and builds under both. However, the tests code coverage is far from 100% and it might happen that we break some things in Python 2.7 from time to time. We apologize in advance for that.

Under Linux and Mac OS, the compiled accelerators require a valid compiler to be installed, typically this is GCC. When no compiler is present, the package will still install but default to the pure Python implementation which is much slower. On Windows, we provide pre-compiled Python Wheels for Python 3.5 and 3.6.

Example

Here is a quick example of how to create and visual the response of a beamformer in a room.

import numpy as np
import matplotlib.pyplot as plt
import pyroomacoustics as pra

# Create a 4 by 6 metres shoe box room
room = pra.ShoeBox([4,6])

# Add a source somewhere in the room
room.add_source([2.5, 4.5])

# Create a linear array beamformer with 4 microphones
# with angle 0 degrees and inter mic distance 10 cm
R = pra.linear_2D_array([2, 1.5], 4, 0, 0.1)
room.add_microphone_array(pra.Beamformer(R, room.fs))

# Now compute the delay and sum weights for the beamformer
room.mic_array.rake_delay_and_sum_weights(room.sources[0][:1])

# plot the room and resulting beamformer
room.plot(freq=[1000, 2000, 4000, 8000], img_order=0)
plt.show()

More examples

A couple of detailed demos with illustrations are available.

A comprehensive set of examples covering most of the functionalities of the package can be found in the examples folder of the GitHub repository.

Authors

  • Robin Scheibler

  • Ivan Dokmanić

  • Sidney Barthe

  • Eric Bezzam

  • Hanjie Pan

How to contribute

If you would like to contribute, please clone the repository and send a pull request.

For more details, see our CONTRIBUTING page.

Academic publications

This package was developed to support academic publications. The package contains implementations for DOA algorithms and acoustic beamformers introduced in the following papers.

  • H. Pan, R. Scheibler, I. Dokmanic, E. Bezzam and M. Vetterli. FRIDA: FRI-based DOA estimation for arbitrary array layout, ICASSP 2017, New Orleans, USA, 2017.

  • I. Dokmanić, R. Scheibler and M. Vetterli. Raking the Cocktail Party, in IEEE Journal of Selected Topics in Signal Processing, vol. 9, num. 5, p. 825 - 836, 2015.

  • R. Scheibler, I. Dokmanić and M. Vetterli. Raking Echoes in the Time Domain, ICASSP 2015, Brisbane, Australia, 2015.

If you use this package in your own research, please cite our paper describing it.

R. Scheibler, E. Bezzam, I. Dokmanić, Pyroomacoustics: A Python package for audio room simulations and array processing algorithms, Proc. IEEE ICASSP, Calgary, CA, 2018.

License

Copyright (c) 2014-2018 EPFL-LCAV

Permission is hereby granted, free of charge, to any person obtaining a copy of
this software and associated documentation files (the "Software"), to deal in
the Software without restriction, including without limitation the rights to
use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies
of the Software, and to permit persons to whom the Software is furnished to do
so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyroomacoustics-0.1.22.tar.gz (269.1 kB view details)

Uploaded Source

Built Distributions

pyroomacoustics-0.1.22-cp37-cp37m-win_amd64.whl (263.8 kB view details)

Uploaded CPython 3.7m Windows x86-64

pyroomacoustics-0.1.22-cp37-cp37m-win32.whl (249.6 kB view details)

Uploaded CPython 3.7m Windows x86

pyroomacoustics-0.1.22-cp37-cp37m-macosx_10_7_x86_64.whl (270.1 kB view details)

Uploaded CPython 3.7m macOS 10.7+ x86-64

pyroomacoustics-0.1.22-cp36-cp36m-win_amd64.whl (263.9 kB view details)

Uploaded CPython 3.6m Windows x86-64

pyroomacoustics-0.1.22-cp36-cp36m-win32.whl (249.6 kB view details)

Uploaded CPython 3.6m Windows x86

pyroomacoustics-0.1.22-cp36-cp36m-macosx_10_7_x86_64.whl (269.8 kB view details)

Uploaded CPython 3.6m macOS 10.7+ x86-64

pyroomacoustics-0.1.22-cp35-cp35m-win_amd64.whl (262.2 kB view details)

Uploaded CPython 3.5m Windows x86-64

pyroomacoustics-0.1.22-cp35-cp35m-win32.whl (248.1 kB view details)

Uploaded CPython 3.5m Windows x86

pyroomacoustics-0.1.22-cp35-cp35m-macosx_10_6_x86_64.whl (267.6 kB view details)

Uploaded CPython 3.5m macOS 10.6+ x86-64

File details

Details for the file pyroomacoustics-0.1.22.tar.gz.

File metadata

  • Download URL: pyroomacoustics-0.1.22.tar.gz
  • Upload date:
  • Size: 269.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.5.6

File hashes

Hashes for pyroomacoustics-0.1.22.tar.gz
Algorithm Hash digest
SHA256 c71802e0bb4a6a15adfbdd133ed8cfcb4b642c4ae985115b5739478f8456aa4e
MD5 29dc89b60c6b5f4f9bf160b975665488
BLAKE2b-256 fb134921889ad7c1bd6cf675a7ab47952d96c546f3b9fbe7da9a5258ca6e2ab3

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.1.22-cp37-cp37m-win_amd64.whl.

File metadata

  • Download URL: pyroomacoustics-0.1.22-cp37-cp37m-win_amd64.whl
  • Upload date:
  • Size: 263.8 kB
  • Tags: CPython 3.7m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.1

File hashes

Hashes for pyroomacoustics-0.1.22-cp37-cp37m-win_amd64.whl
Algorithm Hash digest
SHA256 2d2e5566a29deacd9cbbb1a6875fff0b91045bcb059d327554f9805c1222a99e
MD5 712e83d9094b26a9e53c471edaf4bc42
BLAKE2b-256 05bbf121cd8718481d60d64ed9bc617c6e725847bce42b2bb0e5a94aa6e4b279

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.1.22-cp37-cp37m-win32.whl.

File metadata

  • Download URL: pyroomacoustics-0.1.22-cp37-cp37m-win32.whl
  • Upload date:
  • Size: 249.6 kB
  • Tags: CPython 3.7m, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.1

File hashes

Hashes for pyroomacoustics-0.1.22-cp37-cp37m-win32.whl
Algorithm Hash digest
SHA256 09d53f45f2e174f7b26e98380defac14a381bbe2c10753a3c5cc66cbaf5b8835
MD5 6e57b890087424bdc1ba64424cf5dc09
BLAKE2b-256 b6298128553f8ca4d1bc77a9cbabd6b7fd2f3adbdd91b41ab53e2378f3cb56eb

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.1.22-cp37-cp37m-macosx_10_7_x86_64.whl.

File metadata

  • Download URL: pyroomacoustics-0.1.22-cp37-cp37m-macosx_10_7_x86_64.whl
  • Upload date:
  • Size: 270.1 kB
  • Tags: CPython 3.7m, macOS 10.7+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.3

File hashes

Hashes for pyroomacoustics-0.1.22-cp37-cp37m-macosx_10_7_x86_64.whl
Algorithm Hash digest
SHA256 755ed29e2c29b79155513e3afd2188894496ba12077354c11dec21532f836cfd
MD5 f941ea88d24ba51f1d26db108d94480e
BLAKE2b-256 e12f256107e114fc44493883c0e4004e6833f86ddaa1cbe6ca875cf4bdec0b75

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.1.22-cp36-cp36m-win_amd64.whl.

File metadata

  • Download URL: pyroomacoustics-0.1.22-cp36-cp36m-win_amd64.whl
  • Upload date:
  • Size: 263.9 kB
  • Tags: CPython 3.6m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.1

File hashes

Hashes for pyroomacoustics-0.1.22-cp36-cp36m-win_amd64.whl
Algorithm Hash digest
SHA256 73e44a17c87cf3f5b1986aafdb77bc0cd86697092e8aa30fb930ccbb61b8c402
MD5 feefca842a21533b0251ab59cfe1da27
BLAKE2b-256 93897f9f4d90a04391a5fcf1f1f5d225f8c3e82eea95acefe614cc5044e6efae

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.1.22-cp36-cp36m-win32.whl.

File metadata

  • Download URL: pyroomacoustics-0.1.22-cp36-cp36m-win32.whl
  • Upload date:
  • Size: 249.6 kB
  • Tags: CPython 3.6m, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.1

File hashes

Hashes for pyroomacoustics-0.1.22-cp36-cp36m-win32.whl
Algorithm Hash digest
SHA256 2a6923693d0d36601773aa4243f2f99fb26d7e9e6be194f3d581f5a43a07dc2a
MD5 2d28a5c5151c55e6c2efccb2f2739456
BLAKE2b-256 6b1d779ddc47e8e14525ab9a4e8051afb264246b9b7ad2981be6f582b2717f42

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.1.22-cp36-cp36m-macosx_10_7_x86_64.whl.

File metadata

  • Download URL: pyroomacoustics-0.1.22-cp36-cp36m-macosx_10_7_x86_64.whl
  • Upload date:
  • Size: 269.8 kB
  • Tags: CPython 3.6m, macOS 10.7+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.8

File hashes

Hashes for pyroomacoustics-0.1.22-cp36-cp36m-macosx_10_7_x86_64.whl
Algorithm Hash digest
SHA256 911ddc4104c54c9760ef1c8b493fb3e728a716c67c7c0c48d93f84cee2b8131a
MD5 c4d29f1fc1ba96fa9eca6cd949eb7e52
BLAKE2b-256 4af8e5806cf13b9f926db86445adfba32cae57bc2ab1aab7fa0925e7807e57d6

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.1.22-cp35-cp35m-win_amd64.whl.

File metadata

  • Download URL: pyroomacoustics-0.1.22-cp35-cp35m-win_amd64.whl
  • Upload date:
  • Size: 262.2 kB
  • Tags: CPython 3.5m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.1

File hashes

Hashes for pyroomacoustics-0.1.22-cp35-cp35m-win_amd64.whl
Algorithm Hash digest
SHA256 897c2844434c51616a737374a382747fb2c8ddd7a852bdf2f06127b67d2c2334
MD5 b35e9fb9b9ed48bbfb2754a288675cfb
BLAKE2b-256 65d35c2a07993c97ebdce2d07f9e3862336bbfcd931bffd877db51277a8e01c3

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.1.22-cp35-cp35m-win32.whl.

File metadata

  • Download URL: pyroomacoustics-0.1.22-cp35-cp35m-win32.whl
  • Upload date:
  • Size: 248.1 kB
  • Tags: CPython 3.5m, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.1

File hashes

Hashes for pyroomacoustics-0.1.22-cp35-cp35m-win32.whl
Algorithm Hash digest
SHA256 996cdecd42faf07382883453be0e7d95bf9af982ebb4345a6c66ccbe1cb77ef9
MD5 67cee1ffaba11ad0acb5af642f5d4577
BLAKE2b-256 61e59822e710125bc94127e43b676214f5be2cdb2426cd66b8a680385d00fb3d

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.1.22-cp35-cp35m-macosx_10_6_x86_64.whl.

File metadata

  • Download URL: pyroomacoustics-0.1.22-cp35-cp35m-macosx_10_6_x86_64.whl
  • Upload date:
  • Size: 267.6 kB
  • Tags: CPython 3.5m, macOS 10.6+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.5.6

File hashes

Hashes for pyroomacoustics-0.1.22-cp35-cp35m-macosx_10_6_x86_64.whl
Algorithm Hash digest
SHA256 de5146203d906bfb7ad872474c1f23322c568ac10d5785bdadb6fb42e9db60c0
MD5 83f9314aceef17e82d668bb21d4ac531
BLAKE2b-256 d4e77786ffa23859763b083e66fe85a58c102654a7ba162766f045bd5a191b9a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page