A benchmarking suite for robust audio watermarking.
Project description
RAW-Bench: Robust Audio Watermarking Benchmark
Accompanying website for the paper:
A Comprehensive Real-World Assessment of Audio Watermarking Algorithms: Will They Survive Neural Codecs?
Yigitcan Özer, Woosung Choi, Joan Serrà, Mayank Kumar Singh, Wei-Hsiang Liao, Yuki Mitsufuji
To appear at Interspeech 2025, Rotterdam, The Netherlands
Abstract
We introduce the Robust Audio Watermarking Benchmark (RAW-Bench), a benchmark for evaluating deep learning-based audio watermarking methods with standardized and systematic comparisons. To simulate real-world usage, we introduce a comprehensive audio attack pipeline with various distortions such as compression, background noise, and reverberation, along with a diverse test dataset including speech, environmental sounds, and music recordings. Evaluating four existing watermarking methods on RAW-Bench reveals two main insights: (i) neural compression techniques pose the most significant challenge, even when algorithms are trained with such compressions; and (ii) training with audio attacks generally improves robustness, although it is insufficient in some cases. Furthermore, we find that specific distortions, such as polarity inversion, time stretching, or reverb, seriously affect certain methods.
🔗 Links
- Paper on arXiv
- How to Prepare Datasets
- How to Reproduce
- pip Package (Coming Soon)
TODO
- Refactor core codebase
- Release benchmark evaluation code for reproduction
- Release fine-tuning & training pipeline (AudioSeal & SilentCipher + attack pipeline)
- Publish pip package
- Add documentation and tutorials
📖 Citation
If you find RAW-Bench useful in your research, please cite our paper:
@inproceedings{OezerChoiEtAl25_RAWBench_Interspeech,
author = {Yigitcan {\"O}zer and Woosung Choi and Joan Serr{\`{a}} and Mayank Kumar Singh and Wei-Hsiang Liao and Yuki Mitsufuji},
title = {A Comprehensive Real-World Assessment of Audio Watermarking Algorithms: Will They Survive Neural Codecs?},
booktitle = {Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech)},
address = {Rotterdam, The Netherlands},
year = {2025},
doi = {},
pages = {}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file raw_bench-0.1.0.tar.gz.
File metadata
- Download URL: raw_bench-0.1.0.tar.gz
- Upload date:
- Size: 51.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.18
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7848fb40134d97433599a55f3703753fd1ffddcb50b8a328bb93063b4e72212b
|
|
| MD5 |
77719de41918646347a387863bfc0a65
|
|
| BLAKE2b-256 |
4361661e3a0f0e5adfac0ea3da8164f3923c4715d9b499440d0ad072e11d05bf
|
File details
Details for the file raw_bench-0.1.0-py3-none-any.whl.
File metadata
- Download URL: raw_bench-0.1.0-py3-none-any.whl
- Upload date:
- Size: 68.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.18
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8f67b94357742a2dd27c4c6e59c50b126431dbd5c1ca784e7b4e7d5a9381b6ab
|
|
| MD5 |
10cba0b14ace2032db2bb8cae7e1ab87
|
|
| BLAKE2b-256 |
16e8051d492e2b0bbf4e5799117f36e5a8a60e5d1ca22609bfc5155b5d31d87c
|