Skip to main content

A simple framework for room acoustics and audio processing in Python.

Project description

Pyroomacoustics logo
Documentation Status Test on mybinder Pyroomacoustics discord server

Summary

Pyroomacoustics is a software package aimed at the rapid development and testing of audio array processing algorithms. The content of the package can be divided into three main components:

  1. Intuitive Python object-oriented interface to quickly construct different simulation scenarios involving multiple sound sources and microphones in 2D and 3D rooms;

  2. Fast C++ implementation of the image source model and ray tracing for general polyhedral rooms to efficiently generate room impulse responses and simulate the propagation between sources and receivers;

  3. Reference implementations of popular algorithms for STFT, beamforming, direction finding, adaptive filtering, source separation, and single channel denoising.

Together, these components form a package with the potential to speed up the time to market of new algorithms by significantly reducing the implementation overhead in the performance evaluation step. Please refer to this notebook for a demonstration of the different components of this package.

Room Acoustics Simulation

Consider the following scenario.

Suppose, for example, you wanted to produce a radio crime drama, and it so happens that, according to the scriptwriter, the story line absolutely must culminate in a satanic mass that quickly degenerates into a violent shootout, all taking place right around the altar of the highly reverberant acoustic environment of Oxford’s Christ Church cathedral. To ensure that it sounds authentic, you asked the Dean of Christ Church for permission to record the final scene inside the cathedral, but somehow he fails to be convinced of the artistic merit of your production, and declines to give you permission. But recorded in a conventional studio, the scene sounds flat. So what do you do?

—Schnupp, Nelken, and King, Auditory Neuroscience, 2010

Faced with this difficult situation, pyroomacoustics can save the day by simulating the environment of the Christ Church cathedral!

At the core of the package is a room impulse response (RIR) generator based on the image source model that can handle

  • Convex and non-convex rooms

  • 2D/3D rooms

The core image source model and ray tracing modules are written in C++ for better performance.

The philosophy of the package is to abstract all necessary elements of an experiment using an object-oriented programming approach. Each of these elements is represented using a class and an experiment can be designed by combining these elements just as one would do in a real experiment.

Let’s imagine we want to simulate a delay-and-sum beamformer that uses a linear array with four microphones in a shoe box shaped room that contains only one source of sound. First, we create a room object, to which we add a microphone array object, and a sound source object. Then, the room object has methods to compute the RIR between source and receiver. The beamformer object then extends the microphone array class and has different methods to compute the weights, for example delay-and-sum weights. See the example below to get an idea of what the code looks like.

The Room class also allows one to process sound samples emitted by sources, effectively simulating the propagation of sound between sources and microphones. At the input of the microphones composing the beamformer, an STFT (short time Fourier transform) engine allows to quickly process the signals through the beamformer and evaluate the output.

Reference Implementations

In addition to its core image source model simulation, pyroomacoustics also contains a number of reference implementations of popular audio processing algorithms for

We use an object-oriented approach to abstract the details of specific algorithms, making them easy to compare. Each algorithm can be tuned through optional parameters. We have tried to pre-set values for the tuning parameters so that a run with the default values will in general produce reasonable results.

Datasets

In an effort to simplify the use of datasets, we provide a few wrappers that allow to quickly load and sort through some popular speech corpora. At the moment we support the following.

For more details, see the doc.

Quick Install

Install the package with pip:

pip install pyroomacoustics

A cookiecutter is available that generates a working simulation script for a few 2D/3D scenarios:

# if necessary install cookiecutter
pip install cookiecutter

# create the simulation script
cookiecutter gh:fakufaku/cookiecutter-pyroomacoustics-sim

# run the newly created script
python <chosen_script_name>.py

We have also provided a minimal Dockerfile example in order to install and run the package within a Docker container. Note that you should increase the memory of your containers to 4 GB. Less may also be sufficient, but this is necessary for building the C++ code extension. You can build the container with:

docker build -t pyroom_container .

And enter the container with:

docker run -it pyroom_container:latest /bin/bash

Dependencies

The minimal dependencies are:

numpy
scipy>=0.18.0
Cython
pybind11

where Cython is only needed to benefit from the compiled accelerated simulator. The simulator itself has a pure Python counterpart, so that this requirement could be ignored, but is much slower.

On top of that, some functionalities of the package depend on extra packages:

samplerate   # for resampling signals
matplotlib   # to create graphs and plots
sounddevice  # to play sound samples
mir_eval     # to evaluate performance of source separation in examples

The requirements.txt file lists all packages necessary to run all of the scripts in the examples folder.

This package is mainly developed under Python 3.6. The last supported version for Python 2.7 is 0.4.3.

Under Linux and Mac OS, the compiled accelerators require a valid compiler to be installed, typically this is GCC. When no compiler is present, the package will still install but default to the pure Python implementation which is much slower. On Windows, we provide pre-compiled Python Wheels for Python 3.5 and 3.6.

Example

Here is a quick example of how to create and visualize the response of a beamformer in a room.

import numpy as np
import matplotlib.pyplot as plt
import pyroomacoustics as pra

# Create a 4 by 6 metres shoe box room
room = pra.ShoeBox([4,6])

# Add a source somewhere in the room
room.add_source([2.5, 4.5])

# Create a linear array beamformer with 4 microphones
# with angle 0 degrees and inter mic distance 10 cm
R = pra.linear_2D_array([2, 1.5], 4, 0, 0.1)
room.add_microphone_array(pra.Beamformer(R, room.fs))

# Now compute the delay and sum weights for the beamformer
room.mic_array.rake_delay_and_sum_weights(room.sources[0][:1])

# plot the room and resulting beamformer
room.plot(freq=[1000, 2000, 4000, 8000], img_order=0)
plt.show()

More examples

A couple of detailed demos with illustrations are available.

A comprehensive set of examples covering most of the functionalities of the package can be found in the examples folder of the GitHub repository.

A video introduction to pyroomacoustics.

A video tutorial series on using pyroomacoustics covering many advanced topics.

Authors

  • Robin Scheibler

  • Ivan Dokmanić

  • Sidney Barthe

  • Eric Bezzam

  • Hanjie Pan

How to contribute

If you would like to contribute, please clone the repository and send a pull request.

For more details, see our CONTRIBUTING page.

Academic publications

This package was developed to support academic publications. The package contains implementations for DOA algorithms and acoustic beamformers introduced in the following papers.

  • H. Pan, R. Scheibler, I. Dokmanic, E. Bezzam and M. Vetterli. FRIDA: FRI-based DOA estimation for arbitrary array layout, ICASSP 2017, New Orleans, USA, 2017.

  • I. Dokmanić, R. Scheibler and M. Vetterli. Raking the Cocktail Party, in IEEE Journal of Selected Topics in Signal Processing, vol. 9, num. 5, p. 825 - 836, 2015.

  • R. Scheibler, I. Dokmanić and M. Vetterli. Raking Echoes in the Time Domain, ICASSP 2015, Brisbane, Australia, 2015.

If you use this package in your own research, please cite our paper describing it.

R. Scheibler, E. Bezzam, I. Dokmanić, Pyroomacoustics: A Python package for audio room simulations and array processing algorithms, Proc. IEEE ICASSP, Calgary, CA, 2018.

License

Copyright (c) 2014-2021 EPFL-LCAV

Permission is hereby granted, free of charge, to any person obtaining a copy of
this software and associated documentation files (the "Software"), to deal in
the Software without restriction, including without limitation the rights to
use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies
of the Software, and to permit persons to whom the Software is furnished to do
so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyroomacoustics-0.10.0.tar.gz (35.1 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

pyroomacoustics-0.10.0-cp314-cp314-win_amd64.whl (34.4 MB view details)

Uploaded CPython 3.14Windows x86-64

pyroomacoustics-0.10.0-cp314-cp314-macosx_10_15_universal2.whl (35.0 MB view details)

Uploaded CPython 3.14macOS 10.15+ universal2 (ARM64, x86-64)

pyroomacoustics-0.10.0-cp313-cp313-win_amd64.whl (34.4 MB view details)

Uploaded CPython 3.13Windows x86-64

pyroomacoustics-0.10.0-cp313-cp313-macosx_10_13_universal2.whl (35.0 MB view details)

Uploaded CPython 3.13macOS 10.13+ universal2 (ARM64, x86-64)

pyroomacoustics-0.10.0-cp312-cp312-win_amd64.whl (34.4 MB view details)

Uploaded CPython 3.12Windows x86-64

pyroomacoustics-0.10.0-cp312-cp312-macosx_10_13_universal2.whl (35.0 MB view details)

Uploaded CPython 3.12macOS 10.13+ universal2 (ARM64, x86-64)

pyroomacoustics-0.10.0-cp311-cp311-win_amd64.whl (34.4 MB view details)

Uploaded CPython 3.11Windows x86-64

pyroomacoustics-0.10.0-cp311-cp311-macosx_10_9_universal2.whl (35.0 MB view details)

Uploaded CPython 3.11macOS 10.9+ universal2 (ARM64, x86-64)

pyroomacoustics-0.10.0-cp310-cp310-win_amd64.whl (34.4 MB view details)

Uploaded CPython 3.10Windows x86-64

pyroomacoustics-0.10.0-cp310-cp310-macosx_15_0_x86_64.whl (34.5 MB view details)

Uploaded CPython 3.10macOS 15.0+ x86-64

pyroomacoustics-0.10.0-cp310-cp310-macosx_10_9_universal2.whl (35.0 MB view details)

Uploaded CPython 3.10macOS 10.9+ universal2 (ARM64, x86-64)

pyroomacoustics-0.10.0-cp39-cp39-win_amd64.whl (34.4 MB view details)

Uploaded CPython 3.9Windows x86-64

pyroomacoustics-0.10.0-cp39-cp39-macosx_13_0_x86_64.whl (34.5 MB view details)

Uploaded CPython 3.9macOS 13.0+ x86-64

File details

Details for the file pyroomacoustics-0.10.0.tar.gz.

File metadata

  • Download URL: pyroomacoustics-0.10.0.tar.gz
  • Upload date:
  • Size: 35.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for pyroomacoustics-0.10.0.tar.gz
Algorithm Hash digest
SHA256 e683f28f7f3511d92fe6f111f35a68d76828f18b9c6668ef5ed4b3e8ca9c3219
MD5 c76bfa1bd559350dd05d8c3306a11575
BLAKE2b-256 5a07da714a8a39c701bf84d6d810b953fc7f2108b0c56defda9b87fa169413f0

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.10.0-cp314-cp314-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.10.0-cp314-cp314-win_amd64.whl
Algorithm Hash digest
SHA256 a7cea47b80329f368345d303450c4614eaf4236464bd5ae7ab1c6d0d7af7fc3a
MD5 0ae3c982e7d4d75b4ac0c5b219d52aa0
BLAKE2b-256 ef6b1edd2fcaf1cb614260f4f7c4c63dacfb9d31f506493e62641bd2a49185ae

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.10.0-cp314-cp314-macosx_10_15_universal2.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.10.0-cp314-cp314-macosx_10_15_universal2.whl
Algorithm Hash digest
SHA256 e539cce9e93c1a1e6d5289be2eebd756c0490c93fc68cad688dbab566061b16f
MD5 71bd5c2a379c4b73add76179115c2983
BLAKE2b-256 ce9152c36198806501cc6d43efcc5c4ba945f29f0425d60759c22d5b8f5fbda2

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.10.0-cp313-cp313-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.10.0-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 cd9eaaa4b2b035ce41a40141e5e295fb1060476d0fd7ca3cc3a37f4f2de81362
MD5 4faa05e042765f25046fcb9484cc244c
BLAKE2b-256 615edfa749aba8174c347d6b928fdd0dfd6720a21fc1805dbb975d5d14872771

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.10.0-cp313-cp313-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.10.0-cp313-cp313-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 c91a7f380b640023125374f2d984b33dfc63c842acf66fc606919d4e9ffd4b40
MD5 44d43ea573ecaeb55a885ac3906d783e
BLAKE2b-256 2cf9de9dbc122230a5fba90d1e9d1212e528807dae6c74838bfa3ab2eed0065d

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.10.0-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.10.0-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 aa20100db778a3ae92aaf5e4ee1578d9e328d6298e82d63621c1f60fc273ac74
MD5 3236bff1d81a07b471e0b3031a94487c
BLAKE2b-256 832cdc7314c9524f0b3bf12f38dd75ebde19568fff1c96e38ba94b043b9e287a

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.10.0-cp312-cp312-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.10.0-cp312-cp312-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 9927393faa66d469415f941b989ba671feb73e19bc684a439c217f661df22071
MD5 5f2754e1387d7d18ffc9aa6ef24c5c0a
BLAKE2b-256 f6b2020dea69b3ca8f29e7cd9f760852c03493346f6f8efd663d23c032f99655

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.10.0-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.10.0-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 fb2bdebc99a9c88264cd0b953aeb44c9d383bff5154b5a09cecdad8ce0c48ac6
MD5 49bd98dc89a996ab26e2ebdfa56da45e
BLAKE2b-256 a46146fc408b9ca33640be2b122ad9b412bff8ae3a49ff7982827b37a6851957

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.10.0-cp311-cp311-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.10.0-cp311-cp311-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 6c73b5be3291b5f338405564c5d05496a94512eb8843548fb65811f1c7163994
MD5 6dcf4f9485cde1cc9b7147afa5027b67
BLAKE2b-256 ae79f3508aba0cb4ce1c0771c25e902bc7a765d3bc00a05c548a55a62cfc7aa1

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.10.0-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.10.0-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 3ee3ed71e13eb0d6f83c88ff9d3371f0392f617f9f5184155391ec3bcbec1237
MD5 679293c2aa304beef9c7e6378899940d
BLAKE2b-256 3f4424da76617bfdf590e4a6da93a84936f8095df87e880d0e547ec8fdb0a90f

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.10.0-cp310-cp310-macosx_15_0_x86_64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.10.0-cp310-cp310-macosx_15_0_x86_64.whl
Algorithm Hash digest
SHA256 ebcda3db0746673442a21e42d9823b95036cef084f69d0bca0df28b57030e21a
MD5 91f25b1c37b2761cf09c3c894e19f12c
BLAKE2b-256 866bffeec105744e3d514dfbdf4b3129ea5b571a5baaae4a43d41dea249d698e

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.10.0-cp310-cp310-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.10.0-cp310-cp310-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 bbf2576cfe82ace938d04489168d3c438ae748207d9e02eb8850d8f291018856
MD5 a0e2ea98f43c2a4b86ad04ec1056a548
BLAKE2b-256 6a6fc30e7a8f4965a22e92b6fb1ff6cfbbf180a48be37dd9f766a15c61fd48bd

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.10.0-cp39-cp39-win_amd64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.10.0-cp39-cp39-win_amd64.whl
Algorithm Hash digest
SHA256 a5bd80253b56092280d9011ad75abe3ec7d3d3de757b2ed5d28a9781724502f1
MD5 79dbb1bea861cbf445abdb8d6060b158
BLAKE2b-256 bb7bbd8b1d275e80bcf620d305dd5a8423e96a6ba4fc38b616cb2a83e6330d25

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.10.0-cp39-cp39-macosx_13_0_x86_64.whl.

File metadata

File hashes

Hashes for pyroomacoustics-0.10.0-cp39-cp39-macosx_13_0_x86_64.whl
Algorithm Hash digest
SHA256 a7f19f8449000839227bf36b87491f25110d759e435cec8f12f7138b7f4a5135
MD5 b2b484e521864d057043cc2b921e9ca2
BLAKE2b-256 57389afb13c447bdd5fa62d8d5a2bf860911246b6decd2186aa4fc2448597179

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page