Skip to main content

A simple framework for room acoustics and audio processing in Python.

Project description

.. image:: https://github.com/LCAV/pyroomacoustics/raw/master/logo/pyroomacoustics_logo_horizontal.png
:scale: 80 %
:alt: Pyroomacoustics logo
:align: left

------------------------------------------------------------------------------

.. image:: https://travis-ci.org/LCAV/pyroomacoustics.svg?branch=pypi-release
:target: https://travis-ci.org/LCAV/pyroomacoustics
.. image:: https://readthedocs.org/projects/pyroomacoustics/badge/?version=pypi-release
:target: http://pyroomacoustics.readthedocs.io/en/pypi-release/
:alt: Documentation Status

Summary
-------

Pyroomacoustics is a software package aimed at the rapid development
and testing of audio array processing algorithms. The content of the package
can be divided into three main components:

1. Intuitive Python object-oriented interface to quickly construct different simulation scenarios involving multiple sound sources and microphones in 2D and 3D rooms;
2. Fast C implementation of the image source model for general polyhedral rooms to efficiently generate room impulse responses and simulate the propagation between sources and receivers;
3. Reference implementations of popular algorithms for STFT, beamforming, direction finding, adaptive filtering, source separation, and single channel denoising.

Together, these components form a package with the potential to speed up the time to market
of new algorithms by significantly reducing the implementation overhead in the
performance evaluation step. Please refer to `this notebook <http://nbviewer.jupyter.org/github/LCAV/pyroomacoustics/blob/master/notebooks/pyroomacoustics_demo.ipynb>`_
for a demonstration of the different components of this package.

Room Acoustics Simulation
`````````````````````````

Consider the following scenario.

Suppose, for example, you wanted to produce a radio crime drama, and it
so happens that, according to the scriptwriter, the story line absolutely must culminate
in a satanic mass that quickly degenerates into a violent shootout, all taking place
right around the altar of the highly reverberant acoustic environment of Oxford's
Christ Church cathedral. To ensure that it sounds authentic, you asked the Dean of
Christ Church for permission to record the final scene inside the cathedral, but
somehow he fails to be convinced of the artistic merit of your production, and declines
to give you permission. But recorded in a conventional studio, the scene sounds flat.
So what do you do?

-- Schnupp, Nelken, and King, *Auditory Neuroscience*, 2010

Faced with this difficult situation, **pyroomacoustics** can save the day by simulating
the environment of the Christ Church cathedral!

At the core of the package is a room impulse response (RIR) generator based on the
image source model that can handle

* Convex and non-convex rooms
* 2D/3D rooms

Both a pure Python implementation and a C accelerator are included for maximum
speed and compatibility.

The philosophy of the package is to abstract all necessary elements of
an experiment using an object-oriented programming approach. Each of these elements
is represented using a class and an experiment can be designed by combining
these elements just as one would do in a real experiment.

Let's imagine we want to simulate a delay-and-sum beamformer that uses a linear
array with four microphones in a shoe box shaped room that contains only one
source of sound. First, we create a room object, to which we add a microphone
array object, and a sound source object. Then, the room object has methods
to compute the RIR between source and receiver. The beamformer object then extends
the microphone array class and has different methods to compute the weights, for
example delay-and-sum weights. See the example below to get an idea of what the
code looks like.

The `Room` class also allows one to process sound samples emitted by sources,
effectively simulating the propagation of sound between sources and microphones.
At the input of the microphones composing the beamformer, an STFT (short time
Fourier transform) engine allows to quickly process the signals through the
beamformer and evaluate the output.

Reference Implementations
`````````````````````````

In addition to its core image source model simulation, **pyroomacoustics**
also contains a number of reference implementations of popular audio processing
algorithms for

* `Short time Fourier transform <http://pyroomacoustics.readthedocs.io/en/pypi-release/pyroomacoustics.transform.stft.html>`_ (block + online)
* `beamforming <http://pyroomacoustics.readthedocs.io/en/pypi-release/pyroomacoustics.beamforming.html>`_
* `direction of arrival <http://pyroomacoustics.readthedocs.io/en/pypi-release/pyroomacoustics.doa.html>`_ (DOA) finding
* `adaptive filtering <http://pyroomacoustics.readthedocs.io/en/pypi-release/pyroomacoustics.adaptive.html>`_ (NLMS, RLS)
* `blind source separation <http://pyroomacoustics.readthedocs.io/en/pypi-release/pyroomacoustics.bss.html>`_ (AuxIVA, Trinicon, ILRMA, SparseAuxIVA, FastMNMF)
* `single channel denoising <https://pyroomacoustics.readthedocs.io/en/pypi-release/pyroomacoustics.denoise.html>`_ (Spectral Subtraction, Subspace, Iterative Wiener)

We use an object-oriented approach to abstract the details of
specific algorithms, making them easy to compare. Each algorithm can be tuned through optional parameters. We have tried to
pre-set values for the tuning parameters so that a run with the default values
will in general produce reasonable results.

Datasets
````````
In an effort to simplify the use of datasets, we provide a few wrappers that
allow to quickly load and sort through some popular speech corpora. At the
moment we support the following.

* `CMU ARCTIC <http://www.festvox.org/cmu_arctic/>`_
* `TIMIT <https://catalog.ldc.upenn.edu/ldc93s1>`_
* `Google Speech Commands Dataset <https://research.googleblog.com/2017/08/launching-speech-commands-dataset.html>`_

For more details, see the `doc <http://pyroomacoustics.readthedocs.io/en/pypi-release/pyroomacoustics.datasets.html>`_.

Quick Install
-------------

Install the package with pip::

pip install pyroomacoustics

A `cookiecutter <https://github.com/fakufaku/cookiecutter-pyroomacoustics-sim>`_
is available that generates a working simulation script for a few 2D/3D
scenarios::

# if necessary install cookiecutter
pip install cookiecutter

# create the simulation script
cookiecutter gh:fakufaku/cookiecutter-pyroomacoustics-sim

# run the newly created script
python <chosen_script_name>.py

Dependencies
------------

The minimal dependencies are::

numpy
scipy>=0.18.0
Cython

where ``Cython`` is only needed to benefit from the compiled accelerated simulator.
The simulator itself has a pure Python counterpart, so that this requirement could
be ignored, but is much slower.

On top of that, some functionalities of the package depend on extra packages::

samplerate # for resampling signals
matplotlib # to create graphs and plots
sounddevice # to play sound samples
mir_eval # to evaluate performance of source separation in examples

The ``requirements.txt`` file lists all packages necessary to run all of the
scripts in the ``examples`` folder.

This package is mainly developed under Python 3.5. We try as much as possible to keep
things compatible with Python 2.7 and run tests and builds under both. However, the tests
code coverage is far from 100% and it might happen that we break some things in Python 2.7 from
time to time. We apologize in advance for that.

Under Linux and Mac OS, the compiled accelerators require a valid compiler to
be installed, typically this is GCC. When no compiler is present, the package
will still install but default to the pure Python implementation which is much
slower. On Windows, we provide pre-compiled Python Wheels for Python 3.5 and
3.6.

Example
-------

Here is a quick example of how to create and visual the response of a
beamformer in a room.

.. code-block:: python

import numpy as np
import matplotlib.pyplot as plt
import pyroomacoustics as pra

# Create a 4 by 6 metres shoe box room
room = pra.ShoeBox([4,6])

# Add a source somewhere in the room
room.add_source([2.5, 4.5])

# Create a linear array beamformer with 4 microphones
# with angle 0 degrees and inter mic distance 10 cm
R = pra.linear_2D_array([2, 1.5], 4, 0, 0.1)
room.add_microphone_array(pra.Beamformer(R, room.fs))

# Now compute the delay and sum weights for the beamformer
room.mic_array.rake_delay_and_sum_weights(room.sources[0][:1])

# plot the room and resulting beamformer
room.plot(freq=[1000, 2000, 4000, 8000], img_order=0)
plt.show()

More examples
-------------

A couple of `detailed demos with illustrations <https://github.com/LCAV/pyroomacoustics/tree/master/notebooks>`_ are available.

A comprehensive set of examples covering most of the functionalities
of the package can be found in the ``examples`` folder of the `GitHub
repository <https://github.com/LCAV/pyroomacoustics/tree/master/examples>`_.

Authors
-------

* Robin Scheibler
* Ivan Dokmanić
* Sidney Barthe
* Eric Bezzam
* Hanjie Pan

How to contribute
-----------------

If you would like to contribute, please clone the
`repository <http://github.com/LCAV/pyroomacoustics>`_ and send a pull request.

For more details, see our `CONTRIBUTING
<http://pyroomacoustics.readthedocs.io/en/pypi-release/contributing.html>`_
page.

Academic publications
---------------------

This package was developed to support academic publications. The package
contains implementations for DOA algorithms and acoustic beamformers introduced
in the following papers.

* H\. Pan, R. Scheibler, I. Dokmanic, E. Bezzam and M. Vetterli. *FRIDA: FRI-based DOA estimation for arbitrary array layout*, ICASSP 2017, New Orleans, USA, 2017.
* I\. Dokmanić, R. Scheibler and M. Vetterli. *Raking the Cocktail Party*, in IEEE Journal of Selected Topics in Signal Processing, vol. 9, num. 5, p. 825 - 836, 2015.
* R\. Scheibler, I. Dokmanić and M. Vetterli. *Raking Echoes in the Time Domain*, ICASSP 2015, Brisbane, Australia, 2015.

If you use this package in your own research, please cite `our paper describing it <https://arxiv.org/abs/1710.04196>`_.


R\. Scheibler, E. Bezzam, I. Dokmanić, *Pyroomacoustics: A Python package for audio room simulations and array processing algorithms*, Proc. IEEE ICASSP, Calgary, CA, 2018.

License
-------

::

Copyright (c) 2014-2018 EPFL-LCAV

Permission is hereby granted, free of charge, to any person obtaining a copy of
this software and associated documentation files (the "Software"), to deal in
the Software without restriction, including without limitation the rights to
use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies
of the Software, and to permit persons to whom the Software is furnished to do
so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyroomacoustics-0.2.0.tar.gz (170.8 kB view details)

Uploaded Source

Built Distributions

pyroomacoustics-0.2.0-cp37-cp37m-win_amd64.whl (277.5 kB view details)

Uploaded CPython 3.7m Windows x86-64

pyroomacoustics-0.2.0-cp37-cp37m-win32.whl (263.3 kB view details)

Uploaded CPython 3.7m Windows x86

pyroomacoustics-0.2.0-cp37-cp37m-macosx_10_9_x86_64.whl (282.4 kB view details)

Uploaded CPython 3.7m macOS 10.9+ x86-64

pyroomacoustics-0.2.0-cp36-cp36m-win_amd64.whl (277.6 kB view details)

Uploaded CPython 3.6m Windows x86-64

pyroomacoustics-0.2.0-cp36-cp36m-win32.whl (263.3 kB view details)

Uploaded CPython 3.6m Windows x86

pyroomacoustics-0.2.0-cp36-cp36m-macosx_10_7_x86_64.whl (283.7 kB view details)

Uploaded CPython 3.6m macOS 10.7+ x86-64

pyroomacoustics-0.2.0-cp35-cp35m-win_amd64.whl (275.8 kB view details)

Uploaded CPython 3.5m Windows x86-64

pyroomacoustics-0.2.0-cp35-cp35m-win32.whl (261.8 kB view details)

Uploaded CPython 3.5m Windows x86

pyroomacoustics-0.2.0-cp35-cp35m-macosx_10_6_x86_64.whl (281.6 kB view details)

Uploaded CPython 3.5m macOS 10.6+ x86-64

pyroomacoustics-0.2.0-cp27-cp27m-macosx_10_7_x86_64.whl (283.2 kB view details)

Uploaded CPython 2.7m macOS 10.7+ x86-64

File details

Details for the file pyroomacoustics-0.2.0.tar.gz.

File metadata

  • Download URL: pyroomacoustics-0.2.0.tar.gz
  • Upload date:
  • Size: 170.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/2.7.16

File hashes

Hashes for pyroomacoustics-0.2.0.tar.gz
Algorithm Hash digest
SHA256 d070171ae9e1157528db928eb105246ef9a131bc1c74f084fa39991ba22a9584
MD5 785e57c8049fcfd8fe58b2441b2c218a
BLAKE2b-256 78398169694419412cb5b9363ac48eaf9c94bf3c0653adcacb8d6c7fbf8d9a30

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.2.0-cp37-cp37m-win_amd64.whl.

File metadata

  • Download URL: pyroomacoustics-0.2.0-cp37-cp37m-win_amd64.whl
  • Upload date:
  • Size: 277.5 kB
  • Tags: CPython 3.7m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.1

File hashes

Hashes for pyroomacoustics-0.2.0-cp37-cp37m-win_amd64.whl
Algorithm Hash digest
SHA256 db04941ef5075aa720e4ffaafbfcc8c3f36b48d946cd27482a39cdec80147cbe
MD5 ca64cdb7682104cc670e484192817499
BLAKE2b-256 2e4ab4fb28625f8e1638343493dc20d18ccdd09bcc212e681f4a7e6289b074ac

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.2.0-cp37-cp37m-win32.whl.

File metadata

  • Download URL: pyroomacoustics-0.2.0-cp37-cp37m-win32.whl
  • Upload date:
  • Size: 263.3 kB
  • Tags: CPython 3.7m, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.1

File hashes

Hashes for pyroomacoustics-0.2.0-cp37-cp37m-win32.whl
Algorithm Hash digest
SHA256 6aca56e6148d696f31c080f644c6a737a2b560862303208387c17d4b7b0e4ff8
MD5 e0b3989cfd752d57318e546e7067cca7
BLAKE2b-256 56df66eceb11bc0603449ea99719bd3a22d2c63b2dd0bf010916877bee5afea7

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.2.0-cp37-cp37m-macosx_10_9_x86_64.whl.

File metadata

  • Download URL: pyroomacoustics-0.2.0-cp37-cp37m-macosx_10_9_x86_64.whl
  • Upload date:
  • Size: 282.4 kB
  • Tags: CPython 3.7m, macOS 10.9+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.4

File hashes

Hashes for pyroomacoustics-0.2.0-cp37-cp37m-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 f2279c651c9b1c04a80c9d9b002e73658290f4af1c4a2d8ef1754b8f4ca1a66c
MD5 1a9bb746ba6cfd084f04ab8b5f9fae4a
BLAKE2b-256 5fe1de84a2deca42010862ed5cfea271ecc7fb27eb9598146ab94594f0a980e2

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.2.0-cp36-cp36m-win_amd64.whl.

File metadata

  • Download URL: pyroomacoustics-0.2.0-cp36-cp36m-win_amd64.whl
  • Upload date:
  • Size: 277.6 kB
  • Tags: CPython 3.6m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.1

File hashes

Hashes for pyroomacoustics-0.2.0-cp36-cp36m-win_amd64.whl
Algorithm Hash digest
SHA256 e3a891977ef3d9702d70576503bc5f2577444e27ca531221bb6d929e8809ef10
MD5 33fa4d1d7caffd2b1bac31beb392946a
BLAKE2b-256 2d5d72581903dcaba22af022294deb290109424860f15ba951ddbb39a6d0490c

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.2.0-cp36-cp36m-win32.whl.

File metadata

  • Download URL: pyroomacoustics-0.2.0-cp36-cp36m-win32.whl
  • Upload date:
  • Size: 263.3 kB
  • Tags: CPython 3.6m, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.1

File hashes

Hashes for pyroomacoustics-0.2.0-cp36-cp36m-win32.whl
Algorithm Hash digest
SHA256 36a7f2b29ae780501aa91836b64ae49c67025b1a16d5f48ceba987c1048e2a8f
MD5 fff384849a6285d2f60d01b0f7b298dc
BLAKE2b-256 8ac1260e1b850ca88d9cfc77770338cbd271500a85a6902164eb82192bd34c01

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.2.0-cp36-cp36m-macosx_10_7_x86_64.whl.

File metadata

  • Download URL: pyroomacoustics-0.2.0-cp36-cp36m-macosx_10_7_x86_64.whl
  • Upload date:
  • Size: 283.7 kB
  • Tags: CPython 3.6m, macOS 10.7+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.6.9

File hashes

Hashes for pyroomacoustics-0.2.0-cp36-cp36m-macosx_10_7_x86_64.whl
Algorithm Hash digest
SHA256 886462e19c2e0fd343c31cf433074d529d08a6ad900e798ccf9de3215656d48c
MD5 c63017b1a5a518eae5fe3e006e59c9f7
BLAKE2b-256 2448dfc6c3f8600a444ce50e78048b3315ea9ce2a47db66567c3bc85192e89f3

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.2.0-cp35-cp35m-win_amd64.whl.

File metadata

  • Download URL: pyroomacoustics-0.2.0-cp35-cp35m-win_amd64.whl
  • Upload date:
  • Size: 275.8 kB
  • Tags: CPython 3.5m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.1

File hashes

Hashes for pyroomacoustics-0.2.0-cp35-cp35m-win_amd64.whl
Algorithm Hash digest
SHA256 f50ab97eb1a52bf7b99809673ee9bbed97d0eedaf8b36c8251df598910137f6d
MD5 49dfeee7d54a1039a17f4cfdacbb4d60
BLAKE2b-256 2497d30e56fd79a44c0423b36d490ac314ac71924666e15d205383917ee46a1a

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.2.0-cp35-cp35m-win32.whl.

File metadata

  • Download URL: pyroomacoustics-0.2.0-cp35-cp35m-win32.whl
  • Upload date:
  • Size: 261.8 kB
  • Tags: CPython 3.5m, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.1

File hashes

Hashes for pyroomacoustics-0.2.0-cp35-cp35m-win32.whl
Algorithm Hash digest
SHA256 650d238cc66f6cb5f4360a39cb6a012e10d26139a988164686f3f6600e14aa41
MD5 0e9d5cba9c2c3aaf8c2d167354230d66
BLAKE2b-256 ed4753c24d09689a9cf61b152f2426b66f3608e78c481e797c19af10adf9b235

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.2.0-cp35-cp35m-macosx_10_6_x86_64.whl.

File metadata

  • Download URL: pyroomacoustics-0.2.0-cp35-cp35m-macosx_10_6_x86_64.whl
  • Upload date:
  • Size: 281.6 kB
  • Tags: CPython 3.5m, macOS 10.6+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.5.6

File hashes

Hashes for pyroomacoustics-0.2.0-cp35-cp35m-macosx_10_6_x86_64.whl
Algorithm Hash digest
SHA256 bdfc1fb755561661d3e73a60ec03319da8c8c3da77a486238eae0a0f4ae80242
MD5 414c15ad71d4e7db2421df5bff466407
BLAKE2b-256 5f1261c6efb7999412e1dedb483950c4b0da68213c1107dfcdac96798b162ad6

See more details on using hashes here.

File details

Details for the file pyroomacoustics-0.2.0-cp27-cp27m-macosx_10_7_x86_64.whl.

File metadata

  • Download URL: pyroomacoustics-0.2.0-cp27-cp27m-macosx_10_7_x86_64.whl
  • Upload date:
  • Size: 283.2 kB
  • Tags: CPython 2.7m, macOS 10.7+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/2.7.16

File hashes

Hashes for pyroomacoustics-0.2.0-cp27-cp27m-macosx_10_7_x86_64.whl
Algorithm Hash digest
SHA256 b5fb3bd9f48034a6825ed11c1183a015a19012dde3cb7cac7c36106d411efb0a
MD5 e1437ded9ff5cf68fedf162515033cc7
BLAKE2b-256 774ce65462f01e3ac9380dc09eb96ee9d013432119875eefa8e42c485bb82257

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page