Skip to main content

PyTorch-based room impulse response (RIR) simulation toolkit for static and dynamic scenes.

Project description

TorchRIR

A PyTorch-based room impulse response (RIR) simulation toolkit with a clean API and GPU support. This project has been developed with substantial assistance from Codex.

[!WARNING] TorchRIR is under active development and may contain bugs or breaking changes. Please validate results for your use case. If you find bugs or have feature requests, please open an issue. Contributions are welcome.

Installation

pip install torchrir

Library Comparison

Feature torchrir gpuRIR pyroomacoustics rir-generator
๐ŸŽฏ Dynamic Sources โœ… ๐ŸŸก Single moving source ๐ŸŸก Manual loop โŒ
๐ŸŽค Dynamic Microphones โœ… โŒ ๐ŸŸก Manual loop โŒ
๐Ÿ–ฅ๏ธ CPU โœ… โŒ โœ… โœ…
๐Ÿงฎ CUDA โœ… โœ… โŒ โŒ
๐ŸŽ MPS โœ… โŒ โŒ โŒ
๐Ÿ“Š Scene Plot โœ… โŒ โœ… โŒ
๐ŸŽž๏ธ Dynamic Scene GIF โœ… โŒ ๐ŸŸก Manual animation script โŒ
๐Ÿ—‚๏ธ Dataset Build โœ… โŒ โœ… โŒ
๐ŸŽ›๏ธ Signal Processing โŒ Scope out โŒ โœ… โŒ
๐Ÿงฑ Non-shoebox Geometry ๐Ÿšง Candidate โŒ โœ… โŒ
๐ŸŒ Geometric Acoustics ๐Ÿšง Candidate โŒ โœ… โŒ

Legend: โœ… native support, ๐ŸŸก manual setup, ๐Ÿšง candidate (not yet implemented), โŒ unavailable

For detailed notes and equations, see Read the Docs: Library Comparisons.

CUDA CI (GitHub Actions)

  • CUDA tests run in .github/workflows/cuda-ci.yml on a self-hosted runner with labels: self-hosted, linux, x64, cuda.
  • The workflow validates installation via uv sync --group test, checks torch.cuda.is_available(), runs tests/test_device_parity.py with -k cuda, and then tries to install gpuRIR from GitHub.
  • If gpuRIR installs successfully, the workflow runs tests/test_compare_gpurir.py (static + dynamic RIR comparisons). If installation fails, those comparison tests are skipped without failing the whole CUDA CI job.

Examples

  • examples/static.py: fixed sources and microphones with configurable mic count (default: binaural).
    uv run python examples/static.py --plot
  • examples/dynamic_src.py: moving sources, fixed microphones.
    uv run python examples/dynamic_src.py --plot
  • examples/dynamic_mic.py: fixed sources, moving microphones.
    uv run python examples/dynamic_mic.py --plot
  • examples/cli.py: unified CLI for static/dynamic scenes with JSON/YAML configs.
    uv run python examples/cli.py --mode static --plot
  • examples/build_dynamic_dataset.py: small dynamic dataset generation script (CMU ARCTIC / LibriSpeech; fixed room/mics, randomized source motion).
    uv run python examples/build_dynamic_dataset.py --dataset cmu_arctic --num-scenes 4 --num-sources 2
  • torchrir.datasets.dynamic_cmu_arctic: oobss-compatible dynamic CMU ARCTIC builder CLI.
    python -m torchrir.datasets.dynamic_cmu_arctic --cmu-root datasets/cmu_arctic --n-scenes 2 --overwrite-dataset
  • examples/benchmark_device.py: CPU/GPU benchmark for RIR simulation.
    uv run python examples/benchmark_device.py --dynamic

Dataset Notices

Dataset API Quick Guide

  • torchrir.datasets.CmuArcticDataset(root, speaker=..., download=...)
    • Accepted speaker: aew, ahw, aup, awb, axb, bdl, clb, eey, fem, gka, jmk, ksp, ljm, lnh, rms, rxr, slp, slt
    • Invalid speaker raises ValueError.
    • Missing local files with download=False raises FileNotFoundError.
  • torchrir.datasets.LibriSpeechDataset(root, subset=..., speaker=..., download=...)
    • Accepted subset: dev-clean, dev-other, test-clean, test-other, train-clean-100, train-clean-360, train-other-500
    • Invalid subset raises ValueError.
    • Missing subset/speaker paths with download=False raise FileNotFoundError.
  • torchrir.datasets.build_dynamic_cmu_arctic_dataset(...)
    • Builds oobss-compatible scene folders with mixture.wav, source_XX.wav, metadata.json, and source_info.json.
    • Static layout images (room_layout_2d.png, room_layout_3d.png) and optional layout videos (room_layout_2d.mp4, room_layout_3d.mp4) are generated, with source-index annotations by default.
    • Default behavior includes n_sources=3, moving speed range 0.3-0.8 m/s, and motion profile ratios 0-35%, 35-65%, 65-100%.
  • Local-only (no download) example:
    from pathlib import Path
    from torchrir.datasets import CmuArcticDataset, LibriSpeechDataset
    
    cmu = CmuArcticDataset(Path("datasets/cmu_arctic"), speaker="bdl", download=False)
    libri = LibriSpeechDataset(
        Path("datasets/librispeech"),
        subset="train-clean-100",
        speaker="103",
        download=False,
    )
    
  • Full dataset usage details, expected directory layout, and invalid-input handling: Read the Docs: Datasets

Core API Overview

  • Geometry: Room, Source, MicrophoneArray
  • Scene models: StaticScene, DynamicScene (Scene is deprecated)
  • Static RIR: torchrir.sim.simulate_rir
  • Dynamic RIR: torchrir.sim.simulate_dynamic_rir
  • Simulator object: torchrir.sim.ISMSimulator(max_order=..., tmax=... | nsample=...)
  • Dynamic convolution: torchrir.signal.DynamicConvolver
  • Audio I/O:
    • wav-specific: torchrir.io.load_wav, torchrir.io.save_wav, torchrir.io.info_wav
    • backend-supported formats: torchrir.io.load_audio, torchrir.io.save_audio, torchrir.io.info_audio
    • metadata-preserving: torchrir.io.AudioData, torchrir.io.load_audio_data
  • Metadata export: torchrir.io.build_metadata, torchrir.io.save_metadata_json

Module Layout (for contributors)

  • torchrir.sim: simulation backends (ISM implementation lives under torchrir.sim.ism)
  • torchrir.signal: convolution utilities and dynamic convolver
  • torchrir.geometry: array geometries, sampling, trajectories
  • torchrir.viz: plotting and GIF/MP4 animation helpers
    • Default plot style follows SciencePlots Grid (science + grid).
  • torchrir.models: room/scene/result data models
  • torchrir.io: audio I/O and metadata serialization (*_wav for wav-only, *_audio for backend-supported formats)
  • torchrir.util: shared math/tensor/device helpers
  • torchrir.logging: logging utilities
  • torchrir.config: simulation configuration objects

Design Notes

  • Scene typing is explicit: use StaticScene for fixed geometry and DynamicScene for trajectory-based simulation.
  • DynamicScene accepts tensor-like trajectories (e.g., lists) and normalizes them to tensors internally.
  • Scene remains as a backward-compatibility wrapper and emits DeprecationWarning.
  • Scene.validate() performs validation without emitting additional deprecation warnings.
  • ISMSimulator fails fast when max_order or tmax conflicts with the provided SimulationConfig.
  • Model dataclasses are frozen, but tensor payloads remain mutable (shallow immutability).
  • torchrir.load / torchrir.save and torchrir.io.load / save / info are deprecated compatibility aliases.
from torchrir import MicrophoneArray, Room, Source
from torchrir.sim import simulate_rir
from torchrir.signal import DynamicConvolver

room = Room.shoebox(size=[6.0, 4.0, 3.0], fs=16000, beta=[0.9] * 6)
sources = Source.from_positions([[1.0, 2.0, 1.5]])
mics = MicrophoneArray.from_positions([[2.0, 2.0, 1.5]])

rir = simulate_rir(room=room, sources=sources, mics=mics, max_order=6, tmax=0.3)
# For dynamic scenes, compute rirs with torchrir.sim.simulate_dynamic_rir and convolve:
# y = DynamicConvolver(mode="trajectory").convolve(signal, rirs)

For detailed documentation: Read the Docs

Future Work

  • Advanced room geometry pipeline beyond shoebox rooms (e.g., irregular polygons/meshes and boundary handling).
    Motivation: pyroomacoustics#393, pyroomacoustics#405
  • General reflection/path capping controls (e.g., first-K, strongest-K, or energy-threshold-based path selection).
    Motivation: pyroomacoustics#338
  • Microphone hardware response modeling (frequency response, sensitivity, and self-noise).
    Motivation: pyroomacoustics#394
  • Near-field speech source modeling for more realistic close-talk scenarios.
    Motivation: pyroomacoustics#417
  • Integrated 3D spatial response visualization (e.g., array/directivity beam-pattern rendering).
    Motivation: pyroomacoustics#397

Related Libraries

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

torchrir-0.8.1.tar.gz (76.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

torchrir-0.8.1-py3-none-any.whl (81.9 kB view details)

Uploaded Python 3

File details

Details for the file torchrir-0.8.1.tar.gz.

File metadata

  • Download URL: torchrir-0.8.1.tar.gz
  • Upload date:
  • Size: 76.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.10.4 {"installer":{"name":"uv","version":"0.10.4","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for torchrir-0.8.1.tar.gz
Algorithm Hash digest
SHA256 33eca3dc47808d129c02d8a0e97e30fd5ef2f73609ab76c8f8594f7ff90df5db
MD5 39bf90419b2ff61a6b014182a8f8ed72
BLAKE2b-256 4083b741fcac8cde993aec9c555c5be1bbeab7902cdbf5b16cb9136dcb517ba5

See more details on using hashes here.

File details

Details for the file torchrir-0.8.1-py3-none-any.whl.

File metadata

  • Download URL: torchrir-0.8.1-py3-none-any.whl
  • Upload date:
  • Size: 81.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.10.4 {"installer":{"name":"uv","version":"0.10.4","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for torchrir-0.8.1-py3-none-any.whl
Algorithm Hash digest
SHA256 3932714910c3aa397c693154281392cc7063ca228cf8f0e4e86a3b41fc959245
MD5 9e1aa50027eb3cec5406e00243233500
BLAKE2b-256 7557792b08562dad09d54723834dea0b935bab43b09a99e893ae4b0428413842

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page