Skip to main content

A simple framework for room acoustics and audio processing in Python.

Project description

Pyroomacoustics logo
Documentation Status Test on mybinder Pyroomacoustics discord server

Summary

Pyroomacoustics is a software package aimed at the rapid development and testing of audio array processing algorithms. The content of the package can be divided into three main components:

  1. Intuitive Python object-oriented interface to quickly construct different simulation scenarios involving multiple sound sources and microphones in 2D and 3D rooms;

  2. Fast C++ implementation of the image source model and ray tracing for general polyhedral rooms to efficiently generate room impulse responses and simulate the propagation between sources and receivers;

  3. Reference implementations of popular algorithms for STFT, beamforming, direction finding, adaptive filtering, source separation, and single channel denoising.

Together, these components form a package with the potential to speed up the time to market of new algorithms by significantly reducing the implementation overhead in the performance evaluation step. Please refer to this notebook for a demonstration of the different components of this package.

Room Acoustics Simulation

Consider the following scenario.

Suppose, for example, you wanted to produce a radio crime drama, and it so happens that, according to the scriptwriter, the story line absolutely must culminate in a satanic mass that quickly degenerates into a violent shootout, all taking place right around the altar of the highly reverberant acoustic environment of Oxford’s Christ Church cathedral. To ensure that it sounds authentic, you asked the Dean of Christ Church for permission to record the final scene inside the cathedral, but somehow he fails to be convinced of the artistic merit of your production, and declines to give you permission. But recorded in a conventional studio, the scene sounds flat. So what do you do?

—Schnupp, Nelken, and King, Auditory Neuroscience, 2010

Faced with this difficult situation, pyroomacoustics can save the day by simulating the environment of the Christ Church cathedral!

At the core of the package is a room impulse response (RIR) generator based on the image source model that can handle

  • Convex and non-convex rooms

  • 2D/3D rooms

The core image source model and ray tracing modules are written in C++ for better performance.

The philosophy of the package is to abstract all necessary elements of an experiment using an object-oriented programming approach. Each of these elements is represented using a class and an experiment can be designed by combining these elements just as one would do in a real experiment.

Let’s imagine we want to simulate a delay-and-sum beamformer that uses a linear array with four microphones in a shoe box shaped room that contains only one source of sound. First, we create a room object, to which we add a microphone array object, and a sound source object. Then, the room object has methods to compute the RIR between source and receiver. The beamformer object then extends the microphone array class and has different methods to compute the weights, for example delay-and-sum weights. See the example below to get an idea of what the code looks like.

The Room class also allows one to process sound samples emitted by sources, effectively simulating the propagation of sound between sources and microphones. At the input of the microphones composing the beamformer, an STFT (short time Fourier transform) engine allows to quickly process the signals through the beamformer and evaluate the output.

Reference Implementations

In addition to its core image source model simulation, pyroomacoustics also contains a number of reference implementations of popular audio processing algorithms for

We use an object-oriented approach to abstract the details of specific algorithms, making them easy to compare. Each algorithm can be tuned through optional parameters. We have tried to pre-set values for the tuning parameters so that a run with the default values will in general produce reasonable results.

Datasets

In an effort to simplify the use of datasets, we provide a few wrappers that allow to quickly load and sort through some popular speech corpora. At the moment we support the following.

For more details, see the doc.

Quick Install

Install the package with pip:

pip install pyroomacoustics

A cookiecutter is available that generates a working simulation script for a few 2D/3D scenarios:

# if necessary install cookiecutter
pip install cookiecutter

# create the simulation script
cookiecutter gh:fakufaku/cookiecutter-pyroomacoustics-sim

# run the newly created script
python <chosen_script_name>.py

We have also provided a minimal Dockerfile example in order to install and run the package within a Docker container. Note that you should increase the memory of your containers to 4 GB. Less may also be sufficient, but this is necessary for building the C++ code extension. You can build the container with:

docker build -t pyroom_container .

And enter the container with:

docker run -it pyroom_container:latest /bin/bash

Dependencies

The minimal dependencies are:

numpy
scipy>=0.18.0
Cython
pybind11

where Cython is only needed to benefit from the compiled accelerated simulator. The simulator itself has a pure Python counterpart, so that this requirement could be ignored, but is much slower.

On top of that, some functionalities of the package depend on extra packages:

samplerate   # for resampling signals
matplotlib   # to create graphs and plots
sounddevice  # to play sound samples
mir_eval     # to evaluate performance of source separation in examples

The requirements.txt file lists all packages necessary to run all of the scripts in the examples folder.

This package is mainly developed under Python 3.6. The last supported version for Python 2.7 is 0.4.3.

Under Linux and Mac OS, the compiled accelerators require a valid compiler to be installed, typically this is GCC. When no compiler is present, the package will still install but default to the pure Python implementation which is much slower. On Windows, we provide pre-compiled Python Wheels for Python 3.5 and 3.6.

Example

Here is a quick example of how to create and visualize the response of a beamformer in a room.

import numpy as np
import matplotlib.pyplot as plt
import pyroomacoustics as pra

# Create a 4 by 6 metres shoe box room
room = pra.ShoeBox([4,6])

# Add a source somewhere in the room
room.add_source([2.5, 4.5])

# Create a linear array beamformer with 4 microphones
# with angle 0 degrees and inter mic distance 10 cm
R = pra.linear_2D_array([2, 1.5], 4, 0, 0.1)
room.add_microphone_array(pra.Beamformer(R, room.fs))

# Now compute the delay and sum weights for the beamformer
room.mic_array.rake_delay_and_sum_weights(room.sources[0][:1])

# plot the room and resulting beamformer
room.plot(freq=[1000, 2000, 4000, 8000], img_order=0)
plt.show()

More examples

A couple of detailed demos with illustrations are available.

A comprehensive set of examples covering most of the functionalities of the package can be found in the examples folder of the GitHub repository.

A video introduction to pyroomacoustics.

A video tutorial series on using pyroomacoustics covering many advanced topics.

Authors

  • Robin Scheibler

  • Ivan Dokmanić

  • Sidney Barthe

  • Eric Bezzam

  • Hanjie Pan

How to contribute

If you would like to contribute, please clone the repository and send a pull request.

For more details, see our CONTRIBUTING page.

Academic publications

This package was developed to support academic publications. The package contains implementations for DOA algorithms and acoustic beamformers introduced in the following papers.

  • H. Pan, R. Scheibler, I. Dokmanic, E. Bezzam and M. Vetterli. FRIDA: FRI-based DOA estimation for arbitrary array layout, ICASSP 2017, New Orleans, USA, 2017.

  • I. Dokmanić, R. Scheibler and M. Vetterli. Raking the Cocktail Party, in IEEE Journal of Selected Topics in Signal Processing, vol. 9, num. 5, p. 825 - 836, 2015.

  • R. Scheibler, I. Dokmanić and M. Vetterli. Raking Echoes in the Time Domain, ICASSP 2015, Brisbane, Australia, 2015.

If you use this package in your own research, please cite our paper describing it.

R. Scheibler, E. Bezzam, I. Dokmanić, Pyroomacoustics: A Python package for audio room simulations and array processing algorithms, Proc. IEEE ICASSP, Calgary, CA, 2018.

License

Copyright (c) 2014-2021 EPFL-LCAV

Permission is hereby granted, free of charge, to any person obtaining a copy of
this software and associated documentation files (the "Software"), to deal in
the Software without restriction, including without limitation the rights to
use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies
of the Software, and to permit persons to whom the Software is furnished to do
so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyroomacoustics-0.8.0.tar.gz (35.1 MB view details)

Uploaded Source

Built Distributions

pyroomacoustics-0.8.0-cp312-cp312-win_amd64.whl (580.0 kB view details)

Uploaded CPython 3.12 Windows x86-64

pyroomacoustics-0.8.0-cp312-cp312-macosx_10_13_universal2.whl (1.1 MB view details)

Uploaded CPython 3.12 macOS 10.13+ universal2 (ARM64, x86-64)

pyroomacoustics-0.8.0-cp311-cp311-win_amd64.whl (578.9 kB view details)

Uploaded CPython 3.11 Windows x86-64

pyroomacoustics-0.8.0-cp311-cp311-macosx_10_9_universal2.whl (1.1 MB view details)

Uploaded CPython 3.11 macOS 10.9+ universal2 (ARM64, x86-64)

pyroomacoustics-0.8.0-cp310-cp310-win_amd64.whl (578.2 kB view details)

Uploaded CPython 3.10 Windows x86-64

pyroomacoustics-0.8.0-cp310-cp310-macosx_12_0_x86_64.whl (707.4 kB view details)

Uploaded CPython 3.10 macOS 12.0+ x86-64

pyroomacoustics-0.8.0-cp310-cp310-macosx_10_9_universal2.whl (1.1 MB view details)

Uploaded CPython 3.10 macOS 10.9+ universal2 (ARM64, x86-64)

pyroomacoustics-0.8.0-cp39-cp39-win_amd64.whl (571.1 kB view details)

Uploaded CPython 3.9 Windows x86-64

pyroomacoustics-0.8.0-cp39-cp39-macosx_12_0_x86_64.whl (708.1 kB view details)

Uploaded CPython 3.9 macOS 12.0+ x86-64

pyroomacoustics-0.8.0-cp38-cp38-win_amd64.whl (578.8 kB view details)

Uploaded CPython 3.8 Windows x86-64

pyroomacoustics-0.8.0-cp38-cp38-macosx_12_0_x86_64.whl (707.9 kB view details)

Uploaded CPython 3.8 macOS 12.0+ x86-64

File details

Details for the file pyroomacoustics-0.8.0.tar.gz.

File metadata

  • Download URL: pyroomacoustics-0.8.0.tar.gz
  • Upload date:
  • Size: 35.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.10

File hashes

Hashes for pyroomacoustics-0.8.0.tar.gz
Algorithm Hash digest
SHA256 aaa3804ecfabf48f1c716f9daac3ebb8d57097310f4475ebab74a3c657b03cf6
MD5 4a090a7075a104e886fec8afa5445bfc
BLAKE2b-256 a0c0df7a89614ecf6700f4d68a346f860cf00beb53ff69e8e1b297a817e79700

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.0-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.0-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 0266ccbc6d397a7ca8c3c0346c4cf1d359e6cbd69a8fbee89bcafc67f25041fa
MD5 414277449e8ac5eeb19cb692009e00be
BLAKE2b-256 15724c02262d86e17118ade05341a24cd22ed33fe50231da058f94148ad9e7f8

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.0-cp312-cp312-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.0-cp312-cp312-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 285255293b73de18bf51dc5e5fc5494116c607a2fb22909dab4602fdebf1aca8
MD5 9feb9249db3d21eb7c441e03a79a54cd
BLAKE2b-256 d9786e4bb90432e9870ad6e06d2372e8a502c56262138e6a887d802e05bd5e47

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.0-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.0-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 ad84c38b6fb5a0aff1062d7070356927c5f1590a6169317e3933f09944e192f0
MD5 83af6cf5228a6a3f20bc23b4a4c6cead
BLAKE2b-256 a9d48e6be6436df5e3698dfd2e3565403241aa65d0cc944a1badc3788bcbaf00

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.0-cp311-cp311-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.0-cp311-cp311-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 2ec3bc181e60fc9b8ff85eb373eddb7d11404e86534b4a27c7dcf7e682b11fd9
MD5 d80839b1ede77fbcc254e851ec61a986
BLAKE2b-256 6497bc88e912d88a3d165e02a4075ee7076c2df42db6f0ae38f1fadea1ccd710

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.0-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.0-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 252d3e591bd64f835add9cac12d4fe3b3f00bc60a74159a98d59996677a9e824
MD5 9b804c530d1bcbde62f514d95a803dce
BLAKE2b-256 ced8c3e2b55bfbacc371646c6ecb81c1bdd67507270883de38ef5efc4cfcfeb9

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.0-cp310-cp310-macosx_12_0_x86_64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.0-cp310-cp310-macosx_12_0_x86_64.whl
Algorithm Hash digest
SHA256 15840da439a6d7bfde7b293abc82978491a4266b94da2e54eacb99a487d94bec
MD5 1aa7900177197bc6af79220084e8b6c7
BLAKE2b-256 d34782415a88296b6964a272d22f95ca3ac5a821f988e8a17fd8d074f8e0d50d

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.0-cp310-cp310-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.0-cp310-cp310-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 a6716c21cc293dae7d15bba34c6a40fcb17156de7dd70ef919b8fbf656e77070
MD5 c65ae24d03b54011a57b53ebbddce31d
BLAKE2b-256 f6e3d9645f2377f947ff9b80e50ca06618ee7c9f4c50bcb28c80ed879d088907

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.0-cp39-cp39-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.0-cp39-cp39-win_amd64.whl
Algorithm Hash digest
SHA256 816b56b13b73f82585bef7a7dede95c3b6473653724cc20e79eb30ded2c634f2
MD5 e42a1a68d53ee71cada4d4bac9913416
BLAKE2b-256 689ac4e51c33547bf7e68ed2688e9e99a78ceaf60d931265de4e5c4bb890a306

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.0-cp39-cp39-macosx_12_0_x86_64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.0-cp39-cp39-macosx_12_0_x86_64.whl
Algorithm Hash digest
SHA256 f173d32ad174fb51dd9a70a64e809c22582e3b82b80f61e0152f90c767ae3cde
MD5 6021b29a1f23130f45e83f27058415ad
BLAKE2b-256 192c093c1f97f45ce8bc4ee56223c69f20814ed3a16583b6f0a5fbe441cf557e

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.0-cp38-cp38-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.0-cp38-cp38-win_amd64.whl
Algorithm Hash digest
SHA256 c6656f87e5e3cb28c34e4cc37e2e341e8355914293e1843afc3e0f6f12b7c58e
MD5 9973366394b32814b1c9241427e008a3
BLAKE2b-256 506b161c284d108e39e7d7b121867097af3c1ae6378eb5845e071cb4daba7cc4

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.0-cp38-cp38-macosx_12_0_x86_64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.0-cp38-cp38-macosx_12_0_x86_64.whl
Algorithm Hash digest
SHA256 34d3f742ab8070be0a24d582724f45a290d63663b2a3dabfe5f010210e6a5195
MD5 8b2a754219718eb3d2552ff95a3e8c79
BLAKE2b-256 80c5213899d3b1d3ad5ac075afcf68b19cb1f7b637708bf5f89e56b94b480843

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page