Skip to main content

A simple framework for room acoustics and audio processing in Python.

Project description

Pyroomacoustics logo
Documentation Status Test on mybinder Pyroomacoustics discord server

Summary

Pyroomacoustics is a software package aimed at the rapid development and testing of audio array processing algorithms. The content of the package can be divided into three main components:

  1. Intuitive Python object-oriented interface to quickly construct different simulation scenarios involving multiple sound sources and microphones in 2D and 3D rooms;

  2. Fast C++ implementation of the image source model and ray tracing for general polyhedral rooms to efficiently generate room impulse responses and simulate the propagation between sources and receivers;

  3. Reference implementations of popular algorithms for STFT, beamforming, direction finding, adaptive filtering, source separation, and single channel denoising.

Together, these components form a package with the potential to speed up the time to market of new algorithms by significantly reducing the implementation overhead in the performance evaluation step. Please refer to this notebook for a demonstration of the different components of this package.

Room Acoustics Simulation

Consider the following scenario.

Suppose, for example, you wanted to produce a radio crime drama, and it so happens that, according to the scriptwriter, the story line absolutely must culminate in a satanic mass that quickly degenerates into a violent shootout, all taking place right around the altar of the highly reverberant acoustic environment of Oxford’s Christ Church cathedral. To ensure that it sounds authentic, you asked the Dean of Christ Church for permission to record the final scene inside the cathedral, but somehow he fails to be convinced of the artistic merit of your production, and declines to give you permission. But recorded in a conventional studio, the scene sounds flat. So what do you do?

—Schnupp, Nelken, and King, Auditory Neuroscience, 2010

Faced with this difficult situation, pyroomacoustics can save the day by simulating the environment of the Christ Church cathedral!

At the core of the package is a room impulse response (RIR) generator based on the image source model that can handle

  • Convex and non-convex rooms

  • 2D/3D rooms

The core image source model and ray tracing modules are written in C++ for better performance.

The philosophy of the package is to abstract all necessary elements of an experiment using an object-oriented programming approach. Each of these elements is represented using a class and an experiment can be designed by combining these elements just as one would do in a real experiment.

Let’s imagine we want to simulate a delay-and-sum beamformer that uses a linear array with four microphones in a shoe box shaped room that contains only one source of sound. First, we create a room object, to which we add a microphone array object, and a sound source object. Then, the room object has methods to compute the RIR between source and receiver. The beamformer object then extends the microphone array class and has different methods to compute the weights, for example delay-and-sum weights. See the example below to get an idea of what the code looks like.

The Room class also allows one to process sound samples emitted by sources, effectively simulating the propagation of sound between sources and microphones. At the input of the microphones composing the beamformer, an STFT (short time Fourier transform) engine allows to quickly process the signals through the beamformer and evaluate the output.

Reference Implementations

In addition to its core image source model simulation, pyroomacoustics also contains a number of reference implementations of popular audio processing algorithms for

We use an object-oriented approach to abstract the details of specific algorithms, making them easy to compare. Each algorithm can be tuned through optional parameters. We have tried to pre-set values for the tuning parameters so that a run with the default values will in general produce reasonable results.

Datasets

In an effort to simplify the use of datasets, we provide a few wrappers that allow to quickly load and sort through some popular speech corpora. At the moment we support the following.

For more details, see the doc.

Quick Install

Install the package with pip:

pip install pyroomacoustics

A cookiecutter is available that generates a working simulation script for a few 2D/3D scenarios:

# if necessary install cookiecutter
pip install cookiecutter

# create the simulation script
cookiecutter gh:fakufaku/cookiecutter-pyroomacoustics-sim

# run the newly created script
python <chosen_script_name>.py

We have also provided a minimal Dockerfile example in order to install and run the package within a Docker container. Note that you should increase the memory of your containers to 4 GB. Less may also be sufficient, but this is necessary for building the C++ code extension. You can build the container with:

docker build -t pyroom_container .

And enter the container with:

docker run -it pyroom_container:latest /bin/bash

Dependencies

The minimal dependencies are:

numpy
scipy>=0.18.0
Cython
pybind11

where Cython is only needed to benefit from the compiled accelerated simulator. The simulator itself has a pure Python counterpart, so that this requirement could be ignored, but is much slower.

On top of that, some functionalities of the package depend on extra packages:

samplerate   # for resampling signals
matplotlib   # to create graphs and plots
sounddevice  # to play sound samples
mir_eval     # to evaluate performance of source separation in examples

The requirements.txt file lists all packages necessary to run all of the scripts in the examples folder.

This package is mainly developed under Python 3.6. The last supported version for Python 2.7 is 0.4.3.

Under Linux and Mac OS, the compiled accelerators require a valid compiler to be installed, typically this is GCC. When no compiler is present, the package will still install but default to the pure Python implementation which is much slower. On Windows, we provide pre-compiled Python Wheels for Python 3.5 and 3.6.

Example

Here is a quick example of how to create and visualize the response of a beamformer in a room.

import numpy as np
import matplotlib.pyplot as plt
import pyroomacoustics as pra

# Create a 4 by 6 metres shoe box room
room = pra.ShoeBox([4,6])

# Add a source somewhere in the room
room.add_source([2.5, 4.5])

# Create a linear array beamformer with 4 microphones
# with angle 0 degrees and inter mic distance 10 cm
R = pra.linear_2D_array([2, 1.5], 4, 0, 0.1)
room.add_microphone_array(pra.Beamformer(R, room.fs))

# Now compute the delay and sum weights for the beamformer
room.mic_array.rake_delay_and_sum_weights(room.sources[0][:1])

# plot the room and resulting beamformer
room.plot(freq=[1000, 2000, 4000, 8000], img_order=0)
plt.show()

More examples

A couple of detailed demos with illustrations are available.

A comprehensive set of examples covering most of the functionalities of the package can be found in the examples folder of the GitHub repository.

A video introduction to pyroomacoustics.

A video tutorial series on using pyroomacoustics covering many advanced topics.

Authors

  • Robin Scheibler

  • Ivan Dokmanić

  • Sidney Barthe

  • Eric Bezzam

  • Hanjie Pan

How to contribute

If you would like to contribute, please clone the repository and send a pull request.

For more details, see our CONTRIBUTING page.

Academic publications

This package was developed to support academic publications. The package contains implementations for DOA algorithms and acoustic beamformers introduced in the following papers.

  • H. Pan, R. Scheibler, I. Dokmanic, E. Bezzam and M. Vetterli. FRIDA: FRI-based DOA estimation for arbitrary array layout, ICASSP 2017, New Orleans, USA, 2017.

  • I. Dokmanić, R. Scheibler and M. Vetterli. Raking the Cocktail Party, in IEEE Journal of Selected Topics in Signal Processing, vol. 9, num. 5, p. 825 - 836, 2015.

  • R. Scheibler, I. Dokmanić and M. Vetterli. Raking Echoes in the Time Domain, ICASSP 2015, Brisbane, Australia, 2015.

If you use this package in your own research, please cite our paper describing it.

R. Scheibler, E. Bezzam, I. Dokmanić, Pyroomacoustics: A Python package for audio room simulations and array processing algorithms, Proc. IEEE ICASSP, Calgary, CA, 2018.

License

Copyright (c) 2014-2021 EPFL-LCAV

Permission is hereby granted, free of charge, to any person obtaining a copy of
this software and associated documentation files (the "Software"), to deal in
the Software without restriction, including without limitation the rights to
use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies
of the Software, and to permit persons to whom the Software is furnished to do
so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyroomacoustics-0.8.2.tar.gz (35.1 MB view details)

Uploaded Source

Built Distributions

pyroomacoustics-0.8.2-cp312-cp312-win_amd64.whl (34.4 MB view details)

Uploaded CPython 3.12 Windows x86-64

pyroomacoustics-0.8.2-cp312-cp312-macosx_10_13_universal2.whl (34.9 MB view details)

Uploaded CPython 3.12 macOS 10.13+ universal2 (ARM64, x86-64)

pyroomacoustics-0.8.2-cp311-cp311-win_amd64.whl (34.4 MB view details)

Uploaded CPython 3.11 Windows x86-64

pyroomacoustics-0.8.2-cp311-cp311-macosx_10_9_universal2.whl (34.9 MB view details)

Uploaded CPython 3.11 macOS 10.9+ universal2 (ARM64, x86-64)

pyroomacoustics-0.8.2-cp310-cp310-win_amd64.whl (34.4 MB view details)

Uploaded CPython 3.10 Windows x86-64

pyroomacoustics-0.8.2-cp310-cp310-macosx_12_0_x86_64.whl (34.5 MB view details)

Uploaded CPython 3.10 macOS 12.0+ x86-64

pyroomacoustics-0.8.2-cp310-cp310-macosx_10_9_universal2.whl (34.9 MB view details)

Uploaded CPython 3.10 macOS 10.9+ universal2 (ARM64, x86-64)

pyroomacoustics-0.8.2-cp39-cp39-win_amd64.whl (34.4 MB view details)

Uploaded CPython 3.9 Windows x86-64

pyroomacoustics-0.8.2-cp39-cp39-macosx_12_0_x86_64.whl (34.5 MB view details)

Uploaded CPython 3.9 macOS 12.0+ x86-64

pyroomacoustics-0.8.2-cp38-cp38-win_amd64.whl (34.4 MB view details)

Uploaded CPython 3.8 Windows x86-64

pyroomacoustics-0.8.2-cp38-cp38-macosx_12_0_x86_64.whl (34.5 MB view details)

Uploaded CPython 3.8 macOS 12.0+ x86-64

File details

Details for the file pyroomacoustics-0.8.2.tar.gz.

File metadata

  • Download URL: pyroomacoustics-0.8.2.tar.gz
  • Upload date:
  • Size: 35.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.10

File hashes

Hashes for pyroomacoustics-0.8.2.tar.gz
Algorithm Hash digest
SHA256 240ef5e6c6e2ef588f9722a1caa97616b39194b1697244f21754807692f3f83b
MD5 29b74c6602df77c8a662663030f80ed5
BLAKE2b-256 61c4365b4fb7d029f2483582d0d5be7b2bc883e17336e1db827f2c3d86f6e4a6

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.2-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.2-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 7c60d8d12f06e59ace0f348a54fb5089b670a24fbec68e05ab28a000963d7bef
MD5 3ccfcc36f8969e14f1e20db2fe060bb5
BLAKE2b-256 1a2be622e4a23c65464ec536f0e7d00e87550e41ad4075a86ec3ab19ee340007

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.2-cp312-cp312-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.2-cp312-cp312-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 a9dd976c1ce65c5937b9e4e3ddb19dbe0649cf6cef94c70671fab0a565549b2c
MD5 bdfb97337149d8b26b9850de27f8059c
BLAKE2b-256 f2971de3ac9232ad67d80010987f5bfb152b52e698f6cd42c985135fb95191ea

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.2-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.2-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 3a2d915d7defa607d1432a9854acc329ada4ea34fbdf0e59d3ff31ebb72365d9
MD5 2661462c46f2cf694226efe6d398ceba
BLAKE2b-256 3eac5a39b11ea7ba7439f84dc22fc87c1080439a874f40bbd947c9aba63d0673

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.2-cp311-cp311-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.2-cp311-cp311-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 315e84e00c1b46302b790cb4cc9bae141ad58db0606476db629c5585a673d677
MD5 4ec180df45a1f466963c64a988544ea5
BLAKE2b-256 e8bf8dd9763fe99e2fb64b444b472e65c6761f27a6744e92c8cb6ca739eb4a3f

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.2-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.2-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 0f73aeeb80bdaa022e2669e50267479bd589e593e724922100eecd1f98d2ad79
MD5 145c9308dd1764af18ea61f9cf95e55d
BLAKE2b-256 eaed9a0edcb222af2d4bd10a04011d6f3c3d2ba04809ba076934dd69a2ded216

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.2-cp310-cp310-macosx_12_0_x86_64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.2-cp310-cp310-macosx_12_0_x86_64.whl
Algorithm Hash digest
SHA256 31cb775b8ef2dc12bf6291d4128794072bc305669a3dd7e91d086b47580bae49
MD5 dc24f8560fc935bedfeedaaf8f18e12f
BLAKE2b-256 d235f414ac126ea07b4938dbb083bfb1fe20e3440ec896c1bbfa08d107640989

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.2-cp310-cp310-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.2-cp310-cp310-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 527d845ffbea3f84ab04e75689d668060ff3d46c4f6fe1da4857b9a286aad039
MD5 1496a8c4f2d486f1c71e6fe6788d099b
BLAKE2b-256 4994e23f32e5f86bcb799a00d57769e85a423d572f9cb3785880b4098310e94f

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.2-cp39-cp39-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.2-cp39-cp39-win_amd64.whl
Algorithm Hash digest
SHA256 c0b855ca9973b44d09e37e3e72c4f27854f993a080f2df73b5f77d2e7cc86558
MD5 0ba6b4d03b7b610f178217b0aebfa383
BLAKE2b-256 240029113bc8193c6ffb761ab7e6a36f83dedff06c35449e5eaf3f614ff9853a

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.2-cp39-cp39-macosx_12_0_x86_64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.2-cp39-cp39-macosx_12_0_x86_64.whl
Algorithm Hash digest
SHA256 07ff11d8eee73b772a4b86bfe6c53239225b4e353b5041cc0aeff064b22eb351
MD5 9e76e9935ffa1bd54627898c7fec05d1
BLAKE2b-256 4f6953eac9fdd03647d8530bd8ebf774070d2101c88d4a1d3a90e985271cee21

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.2-cp38-cp38-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.2-cp38-cp38-win_amd64.whl
Algorithm Hash digest
SHA256 f7233dc0f4eb688ad8ffb8c332710b0b5744c72e5015c516ecfa51a248dcd8d8
MD5 ce7b29c5cc3ad9a2aa67dac98a3a0dd5
BLAKE2b-256 c4a2aacdbb2ce5aa00df708388056492ad60a3c2a2ca1134b34dbe6c4f4a3846

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.2-cp38-cp38-macosx_12_0_x86_64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.2-cp38-cp38-macosx_12_0_x86_64.whl
Algorithm Hash digest
SHA256 9b2f05979f5871f88ddd3277993ab55e63fec94a3b1bc0d6c186d9658684f8fb
MD5 e136b790139fd0603392324699f05ea9
BLAKE2b-256 e63a1a2979647b0c2fd5b71d9b75b0b7d6d3bddbdacd3bbdfa60fcac7fadcbb3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page