Skip to main content

A simple framework for room acoustics and audio processing in Python.

Project description

Pyroomacoustics logo
Documentation Status Test on mybinder Pyroomacoustics discord server

Summary

Pyroomacoustics is a software package aimed at the rapid development and testing of audio array processing algorithms. The content of the package can be divided into three main components:

  1. Intuitive Python object-oriented interface to quickly construct different simulation scenarios involving multiple sound sources and microphones in 2D and 3D rooms;

  2. Fast C++ implementation of the image source model and ray tracing for general polyhedral rooms to efficiently generate room impulse responses and simulate the propagation between sources and receivers;

  3. Reference implementations of popular algorithms for STFT, beamforming, direction finding, adaptive filtering, source separation, and single channel denoising.

Together, these components form a package with the potential to speed up the time to market of new algorithms by significantly reducing the implementation overhead in the performance evaluation step. Please refer to this notebook for a demonstration of the different components of this package.

Room Acoustics Simulation

Consider the following scenario.

Suppose, for example, you wanted to produce a radio crime drama, and it so happens that, according to the scriptwriter, the story line absolutely must culminate in a satanic mass that quickly degenerates into a violent shootout, all taking place right around the altar of the highly reverberant acoustic environment of Oxford’s Christ Church cathedral. To ensure that it sounds authentic, you asked the Dean of Christ Church for permission to record the final scene inside the cathedral, but somehow he fails to be convinced of the artistic merit of your production, and declines to give you permission. But recorded in a conventional studio, the scene sounds flat. So what do you do?

—Schnupp, Nelken, and King, Auditory Neuroscience, 2010

Faced with this difficult situation, pyroomacoustics can save the day by simulating the environment of the Christ Church cathedral!

At the core of the package is a room impulse response (RIR) generator based on the image source model that can handle

  • Convex and non-convex rooms

  • 2D/3D rooms

The core image source model and ray tracing modules are written in C++ for better performance.

The philosophy of the package is to abstract all necessary elements of an experiment using an object-oriented programming approach. Each of these elements is represented using a class and an experiment can be designed by combining these elements just as one would do in a real experiment.

Let’s imagine we want to simulate a delay-and-sum beamformer that uses a linear array with four microphones in a shoe box shaped room that contains only one source of sound. First, we create a room object, to which we add a microphone array object, and a sound source object. Then, the room object has methods to compute the RIR between source and receiver. The beamformer object then extends the microphone array class and has different methods to compute the weights, for example delay-and-sum weights. See the example below to get an idea of what the code looks like.

The Room class also allows one to process sound samples emitted by sources, effectively simulating the propagation of sound between sources and microphones. At the input of the microphones composing the beamformer, an STFT (short time Fourier transform) engine allows to quickly process the signals through the beamformer and evaluate the output.

Reference Implementations

In addition to its core image source model simulation, pyroomacoustics also contains a number of reference implementations of popular audio processing algorithms for

We use an object-oriented approach to abstract the details of specific algorithms, making them easy to compare. Each algorithm can be tuned through optional parameters. We have tried to pre-set values for the tuning parameters so that a run with the default values will in general produce reasonable results.

Datasets

In an effort to simplify the use of datasets, we provide a few wrappers that allow to quickly load and sort through some popular speech corpora. At the moment we support the following.

For more details, see the doc.

Quick Install

Install the package with pip:

pip install pyroomacoustics

A cookiecutter is available that generates a working simulation script for a few 2D/3D scenarios:

# if necessary install cookiecutter
pip install cookiecutter

# create the simulation script
cookiecutter gh:fakufaku/cookiecutter-pyroomacoustics-sim

# run the newly created script
python <chosen_script_name>.py

We have also provided a minimal Dockerfile example in order to install and run the package within a Docker container. Note that you should increase the memory of your containers to 4 GB. Less may also be sufficient, but this is necessary for building the C++ code extension. You can build the container with:

docker build -t pyroom_container .

And enter the container with:

docker run -it pyroom_container:latest /bin/bash

Dependencies

The minimal dependencies are:

numpy
scipy>=0.18.0
Cython
pybind11

where Cython is only needed to benefit from the compiled accelerated simulator. The simulator itself has a pure Python counterpart, so that this requirement could be ignored, but is much slower.

On top of that, some functionalities of the package depend on extra packages:

samplerate   # for resampling signals
matplotlib   # to create graphs and plots
sounddevice  # to play sound samples
mir_eval     # to evaluate performance of source separation in examples

The requirements.txt file lists all packages necessary to run all of the scripts in the examples folder.

This package is mainly developed under Python 3.6. The last supported version for Python 2.7 is 0.4.3.

Under Linux and Mac OS, the compiled accelerators require a valid compiler to be installed, typically this is GCC. When no compiler is present, the package will still install but default to the pure Python implementation which is much slower. On Windows, we provide pre-compiled Python Wheels for Python 3.5 and 3.6.

Example

Here is a quick example of how to create and visualize the response of a beamformer in a room.

import numpy as np
import matplotlib.pyplot as plt
import pyroomacoustics as pra

# Create a 4 by 6 metres shoe box room
room = pra.ShoeBox([4,6])

# Add a source somewhere in the room
room.add_source([2.5, 4.5])

# Create a linear array beamformer with 4 microphones
# with angle 0 degrees and inter mic distance 10 cm
R = pra.linear_2D_array([2, 1.5], 4, 0, 0.1)
room.add_microphone_array(pra.Beamformer(R, room.fs))

# Now compute the delay and sum weights for the beamformer
room.mic_array.rake_delay_and_sum_weights(room.sources[0][:1])

# plot the room and resulting beamformer
room.plot(freq=[1000, 2000, 4000, 8000], img_order=0)
plt.show()

More examples

A couple of detailed demos with illustrations are available.

A comprehensive set of examples covering most of the functionalities of the package can be found in the examples folder of the GitHub repository.

A video introduction to pyroomacoustics.

A video tutorial series on using pyroomacoustics covering many advanced topics.

Authors

  • Robin Scheibler

  • Ivan Dokmanić

  • Sidney Barthe

  • Eric Bezzam

  • Hanjie Pan

How to contribute

If you would like to contribute, please clone the repository and send a pull request.

For more details, see our CONTRIBUTING page.

Academic publications

This package was developed to support academic publications. The package contains implementations for DOA algorithms and acoustic beamformers introduced in the following papers.

  • H. Pan, R. Scheibler, I. Dokmanic, E. Bezzam and M. Vetterli. FRIDA: FRI-based DOA estimation for arbitrary array layout, ICASSP 2017, New Orleans, USA, 2017.

  • I. Dokmanić, R. Scheibler and M. Vetterli. Raking the Cocktail Party, in IEEE Journal of Selected Topics in Signal Processing, vol. 9, num. 5, p. 825 - 836, 2015.

  • R. Scheibler, I. Dokmanić and M. Vetterli. Raking Echoes in the Time Domain, ICASSP 2015, Brisbane, Australia, 2015.

If you use this package in your own research, please cite our paper describing it.

R. Scheibler, E. Bezzam, I. Dokmanić, Pyroomacoustics: A Python package for audio room simulations and array processing algorithms, Proc. IEEE ICASSP, Calgary, CA, 2018.

License

Copyright (c) 2014-2021 EPFL-LCAV

Permission is hereby granted, free of charge, to any person obtaining a copy of
this software and associated documentation files (the "Software"), to deal in
the Software without restriction, including without limitation the rights to
use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies
of the Software, and to permit persons to whom the Software is furnished to do
so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyroomacoustics-0.7.7.tar.gz (1.1 MB view details)

Uploaded Source

Built Distributions

pyroomacoustics-0.7.7-cp312-cp312-win_amd64.whl (553.5 kB view details)

Uploaded CPython 3.12 Windows x86-64

pyroomacoustics-0.7.7-cp312-cp312-macosx_10_9_universal2.whl (1.1 MB view details)

Uploaded CPython 3.12 macOS 10.9+ universal2 (ARM64, x86-64)

pyroomacoustics-0.7.7-cp311-cp311-win_amd64.whl (551.8 kB view details)

Uploaded CPython 3.11 Windows x86-64

pyroomacoustics-0.7.7-cp311-cp311-macosx_10_9_universal2.whl (1.1 MB view details)

Uploaded CPython 3.11 macOS 10.9+ universal2 (ARM64, x86-64)

pyroomacoustics-0.7.7-cp310-cp310-win_amd64.whl (551.1 kB view details)

Uploaded CPython 3.10 Windows x86-64

pyroomacoustics-0.7.7-cp310-cp310-macosx_12_0_x86_64.whl (676.1 kB view details)

Uploaded CPython 3.10 macOS 12.0+ x86-64

pyroomacoustics-0.7.7-cp310-cp310-macosx_10_9_universal2.whl (1.1 MB view details)

Uploaded CPython 3.10 macOS 10.9+ universal2 (ARM64, x86-64)

pyroomacoustics-0.7.7-cp39-cp39-win_amd64.whl (544.3 kB view details)

Uploaded CPython 3.9 Windows x86-64

pyroomacoustics-0.7.7-cp39-cp39-macosx_12_0_x86_64.whl (676.9 kB view details)

Uploaded CPython 3.9 macOS 12.0+ x86-64

pyroomacoustics-0.7.7-cp38-cp38-win_amd64.whl (551.8 kB view details)

Uploaded CPython 3.8 Windows x86-64

pyroomacoustics-0.7.7-cp38-cp38-macosx_12_0_x86_64.whl (676.7 kB view details)

Uploaded CPython 3.8 macOS 12.0+ x86-64

File details

Details for the file pyroomacoustics-0.7.7.tar.gz.

File metadata

  • Download URL: pyroomacoustics-0.7.7.tar.gz
  • Upload date:
  • Size: 1.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.9

File hashes

Hashes for pyroomacoustics-0.7.7.tar.gz
Algorithm Hash digest
SHA256 c4a43c26bddec7d44b772b2ef9fdec2b8fb0b43dd55f5975cbb461190368e4d3
MD5 7acbcb5cde196eb9673813f3089c1c45
BLAKE2b-256 ec3bb819469ec48bcae83cb0aa65eebe3c612a6e73178c46b654f66198160025

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.7.7-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.7.7-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 aef005fd884cbb92959d3907eedac24171528cb3cfa2c1aea1bc88467929e6ef
MD5 b1505d4246a60848e7733733d9013ce8
BLAKE2b-256 b23c638246ec38038b055a56ff5f6da398e04b82f963d9f4e1b1c450d7313b36

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.7.7-cp312-cp312-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.7.7-cp312-cp312-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 733ecc51c61427c6e5e35788954963d86472b6ee82689f57bb64a717d959b5a6
MD5 327f5f2651abe62d9133c40fc5375592
BLAKE2b-256 fdb696961d7d46a41721ea6b615b6bbf37734891fef94e0062a172adb0085556

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.7.7-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.7.7-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 7b8b37947b56f1d5cbb1743afb7e9456d50936abfe70b52baa01bf9d1267a7fd
MD5 c2315fd234fe9506ae759dbf5e4ef625
BLAKE2b-256 09512fb7e806cd258975aef6e0ca6a79d0af841a2da709ee21894a8523f0b814

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.7.7-cp311-cp311-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.7.7-cp311-cp311-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 100ac419e9ca34e3728b18e0b312f7afb792870317dceaa34f9730f215bf6e04
MD5 3d6a6ba4d402b7db1c602af7c2f3f830
BLAKE2b-256 856b0f1b04e90d2a8e3ca7dbfdc5d2a729bf439546b4b4698aa1bb29c8f7c448

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.7.7-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.7.7-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 14e686b9713bccc328712181b33289298bedd31a3c323f1ebaba3bdffa1afc80
MD5 a02b98fc7834fcb32994b3c85090b516
BLAKE2b-256 20222ba270473e1ed2827fb336c6e9745ba00d56ba51c15bbe01a0a005730486

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.7.7-cp310-cp310-macosx_12_0_x86_64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.7.7-cp310-cp310-macosx_12_0_x86_64.whl
Algorithm Hash digest
SHA256 e509e329fe78c3283d6d70527a4617eeb4b72eb4581a926b68507567bc53cfb9
MD5 50ac5f3bd231cfc2ac224de5d798a20c
BLAKE2b-256 8c8bdfba8a5b42c7ebf2df8b59886c31b0568d33555dae9fb67ea1c6b8fd2cea

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.7.7-cp310-cp310-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.7.7-cp310-cp310-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 c62f7c3e5445195d6db06b99b2b710aaef733c675313e55fd0098a25a0a33830
MD5 4c458ebee8fc1a220d31a623240e277e
BLAKE2b-256 8c9ce482a7a91d8cd7305cf5027e191162d656ab7d62c12e467d38ceaeab1278

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.7.7-cp39-cp39-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.7.7-cp39-cp39-win_amd64.whl
Algorithm Hash digest
SHA256 278b9d660be5218fa6d15e00c8699a8369ef18a2d37ac638f620190311e44cad
MD5 c2eb0d153f951403bde59808503a6bbf
BLAKE2b-256 61fbf1f61156df1a517fe1587d962121f24639669c530a034c2851446ee3c193

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.7.7-cp39-cp39-macosx_12_0_x86_64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.7.7-cp39-cp39-macosx_12_0_x86_64.whl
Algorithm Hash digest
SHA256 f8edc715c219c2fe87e4e75cfd6afb9ad725b0ad32314b6abdb4c76ff3107860
MD5 fb5cb14da20b84667f7b454817e71bca
BLAKE2b-256 0b386ffe4b446e8a5277cf901bceb5fe1d8a73c72c74f3bcd768d7fe8de3e566

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.7.7-cp38-cp38-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.7.7-cp38-cp38-win_amd64.whl
Algorithm Hash digest
SHA256 7f49f63bcdc1b36738689f46f7fefb1ac433d8fa07704e13d5361c9e02a0ed4d
MD5 aac6c8d37a7a971a5bf1d06ee31bc0c3
BLAKE2b-256 e378ccac3c827379c050cecf3b340b315f81c80961ebdb0c7b035a4e5b4859d7

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.7.7-cp38-cp38-macosx_12_0_x86_64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.7.7-cp38-cp38-macosx_12_0_x86_64.whl
Algorithm Hash digest
SHA256 fb79866b44c2dc9b8178d606466292cb0753e8dd72bc4c7228375d4a7a08ca02
MD5 7efbdb8cad3a4012257d89a0a5e677df
BLAKE2b-256 6146f6f9844d9e75ac4004f1a781244f4844a575a4a8ba26f065333bbfe16262

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page