Skip to main content

A benchmarking suite for robust audio watermarking.

Project description

RAW-Bench: Robust Audio Watermarking Benchmark

Accompanying website for the paper:
A Comprehensive Real-World Assessment of Audio Watermarking Algorithms: Will They Survive Neural Codecs?
Yigitcan Özer, Woosung Choi, Joan Serrà, Mayank Kumar Singh, Wei-Hsiang Liao, Yuki Mitsufuji
To appear at Interspeech 2025, Rotterdam, The Netherlands


Abstract

We introduce the Robust Audio Watermarking Benchmark (RAW-Bench), a benchmark for evaluating deep learning-based audio watermarking methods with standardized and systematic comparisons. To simulate real-world usage, we introduce a comprehensive audio attack pipeline with various distortions such as compression, background noise, and reverberation, along with a diverse test dataset including speech, environmental sounds, and music recordings. Evaluating four existing watermarking methods on RAW-Bench reveals two main insights: (i) neural compression techniques pose the most significant challenge, even when algorithms are trained with such compressions; and (ii) training with audio attacks generally improves robustness, although it is insufficient in some cases. Furthermore, we find that specific distortions, such as polarity inversion, time stretching, or reverb, seriously affect certain methods.

🔗 Links

TODO

  • Refactor core codebase
  • Release benchmark evaluation code for reproduction
  • Release fine-tuning & training pipeline (AudioSeal & SilentCipher + attack pipeline)
  • Publish pip package
  • Add documentation and tutorials

📖 Citation

If you find RAW-Bench useful in your research, please cite our paper:

@inproceedings{OezerChoiEtAl25_RAWBench_Interspeech,
 author = {Yigitcan {\"O}zer and Woosung Choi and Joan Serr{\`{a}} and Mayank Kumar Singh and Wei-Hsiang Liao and Yuki Mitsufuji},
 title = {A Comprehensive Real-World Assessment of Audio Watermarking Algorithms: Will They Survive Neural Codecs?},
 booktitle = {Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech)},
 address = {Rotterdam, The Netherlands},
 year = {2025},
 doi = {},
 pages = {}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

raw_bench-0.1.0.tar.gz (51.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

raw_bench-0.1.0-py3-none-any.whl (68.2 kB view details)

Uploaded Python 3

File details

Details for the file raw_bench-0.1.0.tar.gz.

File metadata

  • Download URL: raw_bench-0.1.0.tar.gz
  • Upload date:
  • Size: 51.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.18

File hashes

Hashes for raw_bench-0.1.0.tar.gz
Algorithm Hash digest
SHA256 7848fb40134d97433599a55f3703753fd1ffddcb50b8a328bb93063b4e72212b
MD5 77719de41918646347a387863bfc0a65
BLAKE2b-256 4361661e3a0f0e5adfac0ea3da8164f3923c4715d9b499440d0ad072e11d05bf

See more details on using hashes here.

File details

Details for the file raw_bench-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: raw_bench-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 68.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.18

File hashes

Hashes for raw_bench-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 8f67b94357742a2dd27c4c6e59c50b126431dbd5c1ca784e7b4e7d5a9381b6ab
MD5 10cba0b14ace2032db2bb8cae7e1ab87
BLAKE2b-256 16e8051d492e2b0bbf4e5799117f36e5a8a60e5d1ca22609bfc5155b5d31d87c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page