Skip to main content

A simple framework for room acoustics and audio processing in Python.

Project description

Pyroomacoustics logo
Documentation Status Test on mybinder Pyroomacoustics discord server

Summary

Pyroomacoustics is a software package aimed at the rapid development and testing of audio array processing algorithms. The content of the package can be divided into three main components:

  1. Intuitive Python object-oriented interface to quickly construct different simulation scenarios involving multiple sound sources and microphones in 2D and 3D rooms;

  2. Fast C++ implementation of the image source model and ray tracing for general polyhedral rooms to efficiently generate room impulse responses and simulate the propagation between sources and receivers;

  3. Reference implementations of popular algorithms for STFT, beamforming, direction finding, adaptive filtering, source separation, and single channel denoising.

Together, these components form a package with the potential to speed up the time to market of new algorithms by significantly reducing the implementation overhead in the performance evaluation step. Please refer to this notebook for a demonstration of the different components of this package.

Room Acoustics Simulation

Consider the following scenario.

Suppose, for example, you wanted to produce a radio crime drama, and it so happens that, according to the scriptwriter, the story line absolutely must culminate in a satanic mass that quickly degenerates into a violent shootout, all taking place right around the altar of the highly reverberant acoustic environment of Oxford’s Christ Church cathedral. To ensure that it sounds authentic, you asked the Dean of Christ Church for permission to record the final scene inside the cathedral, but somehow he fails to be convinced of the artistic merit of your production, and declines to give you permission. But recorded in a conventional studio, the scene sounds flat. So what do you do?

—Schnupp, Nelken, and King, Auditory Neuroscience, 2010

Faced with this difficult situation, pyroomacoustics can save the day by simulating the environment of the Christ Church cathedral!

At the core of the package is a room impulse response (RIR) generator based on the image source model that can handle

  • Convex and non-convex rooms

  • 2D/3D rooms

The core image source model and ray tracing modules are written in C++ for better performance.

The philosophy of the package is to abstract all necessary elements of an experiment using an object-oriented programming approach. Each of these elements is represented using a class and an experiment can be designed by combining these elements just as one would do in a real experiment.

Let’s imagine we want to simulate a delay-and-sum beamformer that uses a linear array with four microphones in a shoe box shaped room that contains only one source of sound. First, we create a room object, to which we add a microphone array object, and a sound source object. Then, the room object has methods to compute the RIR between source and receiver. The beamformer object then extends the microphone array class and has different methods to compute the weights, for example delay-and-sum weights. See the example below to get an idea of what the code looks like.

The Room class also allows one to process sound samples emitted by sources, effectively simulating the propagation of sound between sources and microphones. At the input of the microphones composing the beamformer, an STFT (short time Fourier transform) engine allows to quickly process the signals through the beamformer and evaluate the output.

Reference Implementations

In addition to its core image source model simulation, pyroomacoustics also contains a number of reference implementations of popular audio processing algorithms for

We use an object-oriented approach to abstract the details of specific algorithms, making them easy to compare. Each algorithm can be tuned through optional parameters. We have tried to pre-set values for the tuning parameters so that a run with the default values will in general produce reasonable results.

Datasets

In an effort to simplify the use of datasets, we provide a few wrappers that allow to quickly load and sort through some popular speech corpora. At the moment we support the following.

For more details, see the doc.

Quick Install

Install the package with pip:

pip install pyroomacoustics

A cookiecutter is available that generates a working simulation script for a few 2D/3D scenarios:

# if necessary install cookiecutter
pip install cookiecutter

# create the simulation script
cookiecutter gh:fakufaku/cookiecutter-pyroomacoustics-sim

# run the newly created script
python <chosen_script_name>.py

We have also provided a minimal Dockerfile example in order to install and run the package within a Docker container. Note that you should increase the memory of your containers to 4 GB. Less may also be sufficient, but this is necessary for building the C++ code extension. You can build the container with:

docker build -t pyroom_container .

And enter the container with:

docker run -it pyroom_container:latest /bin/bash

Dependencies

The minimal dependencies are:

numpy
scipy>=0.18.0
Cython
pybind11

where Cython is only needed to benefit from the compiled accelerated simulator. The simulator itself has a pure Python counterpart, so that this requirement could be ignored, but is much slower.

On top of that, some functionalities of the package depend on extra packages:

samplerate   # for resampling signals
matplotlib   # to create graphs and plots
sounddevice  # to play sound samples
mir_eval     # to evaluate performance of source separation in examples

The requirements.txt file lists all packages necessary to run all of the scripts in the examples folder.

This package is mainly developed under Python 3.6. The last supported version for Python 2.7 is 0.4.3.

Under Linux and Mac OS, the compiled accelerators require a valid compiler to be installed, typically this is GCC. When no compiler is present, the package will still install but default to the pure Python implementation which is much slower. On Windows, we provide pre-compiled Python Wheels for Python 3.5 and 3.6.

Example

Here is a quick example of how to create and visualize the response of a beamformer in a room.

import numpy as np
import matplotlib.pyplot as plt
import pyroomacoustics as pra

# Create a 4 by 6 metres shoe box room
room = pra.ShoeBox([4,6])

# Add a source somewhere in the room
room.add_source([2.5, 4.5])

# Create a linear array beamformer with 4 microphones
# with angle 0 degrees and inter mic distance 10 cm
R = pra.linear_2D_array([2, 1.5], 4, 0, 0.1)
room.add_microphone_array(pra.Beamformer(R, room.fs))

# Now compute the delay and sum weights for the beamformer
room.mic_array.rake_delay_and_sum_weights(room.sources[0][:1])

# plot the room and resulting beamformer
room.plot(freq=[1000, 2000, 4000, 8000], img_order=0)
plt.show()

More examples

A couple of detailed demos with illustrations are available.

A comprehensive set of examples covering most of the functionalities of the package can be found in the examples folder of the GitHub repository.

A video introduction to pyroomacoustics.

A video tutorial series on using pyroomacoustics covering many advanced topics.

Authors

  • Robin Scheibler

  • Ivan Dokmanić

  • Sidney Barthe

  • Eric Bezzam

  • Hanjie Pan

How to contribute

If you would like to contribute, please clone the repository and send a pull request.

For more details, see our CONTRIBUTING page.

Academic publications

This package was developed to support academic publications. The package contains implementations for DOA algorithms and acoustic beamformers introduced in the following papers.

  • H. Pan, R. Scheibler, I. Dokmanic, E. Bezzam and M. Vetterli. FRIDA: FRI-based DOA estimation for arbitrary array layout, ICASSP 2017, New Orleans, USA, 2017.

  • I. Dokmanić, R. Scheibler and M. Vetterli. Raking the Cocktail Party, in IEEE Journal of Selected Topics in Signal Processing, vol. 9, num. 5, p. 825 - 836, 2015.

  • R. Scheibler, I. Dokmanić and M. Vetterli. Raking Echoes in the Time Domain, ICASSP 2015, Brisbane, Australia, 2015.

If you use this package in your own research, please cite our paper describing it.

R. Scheibler, E. Bezzam, I. Dokmanić, Pyroomacoustics: A Python package for audio room simulations and array processing algorithms, Proc. IEEE ICASSP, Calgary, CA, 2018.

License

Copyright (c) 2014-2021 EPFL-LCAV

Permission is hereby granted, free of charge, to any person obtaining a copy of
this software and associated documentation files (the "Software"), to deal in
the Software without restriction, including without limitation the rights to
use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies
of the Software, and to permit persons to whom the Software is furnished to do
so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyroomacoustics-0.8.4.tar.gz (35.1 MB view details)

Uploaded Source

Built Distributions

pyroomacoustics-0.8.4-cp312-cp312-win_amd64.whl (34.4 MB view details)

Uploaded CPython 3.12Windows x86-64

pyroomacoustics-0.8.4-cp312-cp312-macosx_10_13_universal2.whl (34.9 MB view details)

Uploaded CPython 3.12macOS 10.13+ universal2 (ARM64, x86-64)

pyroomacoustics-0.8.4-cp311-cp311-win_amd64.whl (34.4 MB view details)

Uploaded CPython 3.11Windows x86-64

pyroomacoustics-0.8.4-cp311-cp311-macosx_10_9_universal2.whl (34.9 MB view details)

Uploaded CPython 3.11macOS 10.9+ universal2 (ARM64, x86-64)

pyroomacoustics-0.8.4-cp310-cp310-win_amd64.whl (34.4 MB view details)

Uploaded CPython 3.10Windows x86-64

pyroomacoustics-0.8.4-cp310-cp310-macosx_13_0_x86_64.whl (34.5 MB view details)

Uploaded CPython 3.10macOS 13.0+ x86-64

pyroomacoustics-0.8.4-cp310-cp310-macosx_10_9_universal2.whl (34.9 MB view details)

Uploaded CPython 3.10macOS 10.9+ universal2 (ARM64, x86-64)

pyroomacoustics-0.8.4-cp39-cp39-win_amd64.whl (34.4 MB view details)

Uploaded CPython 3.9Windows x86-64

pyroomacoustics-0.8.4-cp39-cp39-macosx_13_0_x86_64.whl (34.5 MB view details)

Uploaded CPython 3.9macOS 13.0+ x86-64

pyroomacoustics-0.8.4-cp38-cp38-win_amd64.whl (34.4 MB view details)

Uploaded CPython 3.8Windows x86-64

pyroomacoustics-0.8.4-cp38-cp38-macosx_13_0_x86_64.whl (34.5 MB view details)

Uploaded CPython 3.8macOS 13.0+ x86-64

File details

Details for the file pyroomacoustics-0.8.4.tar.gz.

File metadata

  • Download URL: pyroomacoustics-0.8.4.tar.gz
  • Upload date:
  • Size: 35.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.12

File hashes

Hashes for pyroomacoustics-0.8.4.tar.gz
Algorithm Hash digest
SHA256 cb70e511c41952f0f578355ae261b72acc4b688d995d02780ac0847d0b400d99
MD5 1d9c5c1e0c6dd7d666875a33a677ac73
BLAKE2b-256 bb3a7bc15b0d46e1e2de5f4a6a1a568f52316f003530061e0caff3e3365b32fc

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.4-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.4-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 b00acce7e7501b85ee263efd70bc765f08f0cabf713a289ae8a77b923078aeae
MD5 cb883d1ec96c73c4f52fcad76cffd65a
BLAKE2b-256 178473b83f4d14aa4f789bc17becbddf00d5a269d488a1a13e061101437e431d

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.4-cp312-cp312-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.4-cp312-cp312-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 fac868b720c18ec60f9dbbfe92d2f5dfab1b50b0e7bbce00b35e8f2b37337181
MD5 38c99afa49d403dacf9662ea700875ff
BLAKE2b-256 f602d4fb192a7da1cc11284d3771f6c5b0ec4c77938cce80c4285510c4fabdb3

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.4-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.4-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 313c71e468f9e4a812514ea0811d5625344823b3bb23333420d22939aeb8822d
MD5 04b2616e2dcd872e07a536002b600481
BLAKE2b-256 daa3a7af33408d31707ea30fd5435dff4933fad1c6c7e3fdf3653ca425eb5949

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.4-cp311-cp311-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.4-cp311-cp311-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 f26a5a04e235594783a3c92b7213bbd70b615865061bdf80c148756fd91db764
MD5 baebbff4d227cdee42a467c84a72574b
BLAKE2b-256 17e7c27e250777fc5ddb6037c46203def80a58fad1e45e01ff4ef702c2596931

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.4-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.4-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 9fb6b35f09cdeadb8fb31d55dc5b243c24b4b45beaf5c5fd8fc4ef1a2f0b0843
MD5 645b8728f01bc0f1893b5d686b910a73
BLAKE2b-256 56f5623e516ba8403f98b1ca70f07f9559759a06167ae8a267432d456a4b23c8

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.4-cp310-cp310-macosx_13_0_x86_64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.4-cp310-cp310-macosx_13_0_x86_64.whl
Algorithm Hash digest
SHA256 4ad2450861a33e272f1b6be64f9f2130199bd2b25cec3f06d4104d51122a9893
MD5 e366eafbc1ac6a5eb985009fb1d6efd9
BLAKE2b-256 e6b9162eb75591a8d773084c51dbd0fc89db8438dc51b4322a8091c318a5a15c

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.4-cp310-cp310-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.4-cp310-cp310-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 30cd1fee4b5d07c176561656c4b7687fe18b72acd3fa7d400a08782fbec1cf20
MD5 859352dce2e86474883102474fcd7ba0
BLAKE2b-256 f42ab73810c56a155fc48e51c5389851c1239146ec2178f355dd0d2e04a2692d

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.4-cp39-cp39-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.4-cp39-cp39-win_amd64.whl
Algorithm Hash digest
SHA256 53a07b80b876ff9a89848aa1e092871dc685b878b08662f5ce26970f3f12570f
MD5 6c459ca53be394aa6ebab9694c91251b
BLAKE2b-256 c18c3621f0ad47b791a4d92d06bf9cff00862e65cecce3f8ad52aacb766cabf8

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.4-cp39-cp39-macosx_13_0_x86_64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.4-cp39-cp39-macosx_13_0_x86_64.whl
Algorithm Hash digest
SHA256 425a9375ffd8f2130b2e6df5532879812afe47e14d464910f8426a780f466ead
MD5 32e97145a2a56de5d94dad319116b223
BLAKE2b-256 dc086ec8be8714275fdd1126ba595064cb80f379d674ddc002e63870192fe245

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.4-cp38-cp38-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.4-cp38-cp38-win_amd64.whl
Algorithm Hash digest
SHA256 e8f15db65f88e629e402f091973b9368f92199431cc010a228696b53651bd96a
MD5 da3e2e524a83941a6c430e108bc29860
BLAKE2b-256 18f7c2dcaba7233081a2903f1b8ef1081fd1278bae8fd757c89e1f3d8ec53b8b

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.8.4-cp38-cp38-macosx_13_0_x86_64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.8.4-cp38-cp38-macosx_13_0_x86_64.whl
Algorithm Hash digest
SHA256 9f3ee2dee53cf02612fd767250ca701830bcba0261c94bb61b2c29cae1e5b5be
MD5 1a0d9ce360a3c3c7c78498bca974243c
BLAKE2b-256 52700ccc396dfaee30cf8fda89dac3d36e1175d740573259b838058dc1082b67

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page