Watermarking and detection for speech audios

These details have not been verified by PyPI

Project links

Development Status
- 4 - Beta
License
- OSI Approved :: MIT License
Topic
- Scientific/Engineering

Project description

:loud_sound: AudioSeal: Proactive Localized Watermarking

We introduce AudioSeal, a method for speech localized watermarking , with state-of-the-art detector speed without compromising the watermarking robustness. It jointly trains a generator that embeds a watermark in the audio, and a detector that detects the watermarked fragments in longer audios, even in the presence of editing. Audioseal achieves state-of-the-art detection performance of both natural and synthetic speech at the sample level (1/16k second resolution), it generates limited alteration of signal quality and is robust to many types of audio editing. Audioseal is designed with a fast, single-pass detector, that significantly surpasses existing models in speed — achieving detection up to two orders of magnitude faster, making it ideal for large-scale and real-time applications.

More details can be found in the paper

fig

:mate: Installation

AudioSeal requires Python >=3.8, Pytorch >= 1.13.0, omegaconf, julius, and numpy. To install from PyPI:

pip install audioseal

To install from source: Clone this repo and install in editable mode:

git clone https://github.com/facebookresearch/audioseal
cd audioseal
pip install -e .

:gear: Models

We provide the checkpoints for the following models:

AudioSeal Generator. It takes as input an audio signal (as a waveform), and outputs a watermark of the same size as the input, that can be added to the input to watermark it. Optionally, it can also take as input a secret message of 16-bits that will be encoded in the watermark.
AudioSeal Detector. It takes as input an audio signal (as a waveform), and outputs a probability that the input contains a watermark at each sample of the audio (every 1/16k s). Optionally, it may also output the secret message encoded in the watermark.

Note that the message is optional and has no influence on the detection output. It may be used to identify a model version for instance (up to $2**16=65536$ possible choices).

Note: We are working to release the training code for anyone wants to build their own watermarker. Stay tuned !

:abacus: Usage

Audioseal provides a simple API to watermark and detect the watermarks from an audio sample. Example usage:

from audioseal import AudioSeal

# model name corresponds to the YAML card file name found in audioseal/cards
model = AudioSeal.load_generator("audioseal_wm_16bits")

# Other way is to load directly from the checkpoint
# model =  Watermarker.from_pretrained(checkpoint_path, device = wav.device)

watermark = model.get_watermark(wav)

# Optional: you can add a 16-bit message to embed in the watermark
# msg = torch.randint(0, 2, (wav.shape(0), model.msg_processor.nbits), device=wav.device)
# watermark = model.get_watermark(wav, message = msg)

watermarked_audio = wav + watermark

detector = AudioSeal.load_detector("audioseal_detector_16bits")

# To detect the messages in the high-level.
result, message = detector.detect_watermark(watermarked_audio)

print(result) # result is a float number indicating the probability of the audio being watermarked,
print(message)  # message is a binary vector of 16 bits


# To detect the messages in the low-level.
result, message = detector(watermarked_audio)

# result is a tensor of size batch x 2 x frames, indicating the probablity (positive and negative) of watermarking for each frame
# A watermarked audio should have result[:, 1, :] > 0.5
print(result[:, 1 , :])  

# Message is a tensor of size batch x 16, indicating of the probability of each bit to be 1.
# message will be a random tensor if the detector detects no watermarking from the audio
print(message)

License

The code in this repository is released under the MIT license as found in the LICENSE file.
The models weights in this repository are released under the CC-BY-NC 4.0 license as found in the LICENSE_weights file.

Maintainers:

Citation

If you find this repository useful, please consider giving a star :star: and please cite as:

@article{sanroman2024proactive,
  title={Proactive Detection of Voice Cloning with Localized Watermarking},
  author={San Roman, Robin and Fernandez, Pierre and Elsahar, Hady and D´efossez, Alexandre and Furon, Teddy and Tran, Tuan},
  journal={arXiv preprint},
  year={2024}
}

Project details

These details have not been verified by PyPI

Project links

Development Status
- 4 - Beta
License
- OSI Approved :: MIT License
Topic
- Scientific/Engineering

Release history Release notifications | RSS feed

0.2.0

Dec 17, 2025

0.1.8

Aug 11, 2025

0.1.7

Apr 29, 2025

0.1.6

Apr 28, 2025

0.1.5

Apr 28, 2025

0.1.4

Jun 24, 2024

0.1.3

Apr 30, 2024

0.1.2

Feb 29, 2024

This version

0.1.1

Feb 4, 2024

0.1.0

Jan 31, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audioseal-0.1.1.tar.gz (1.9 MB view details)

Uploaded Feb 4, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

audioseal-0.1.1-py3-none-any.whl (28.5 kB view details)

Uploaded Feb 4, 2024 Python 3

File details

Details for the file audioseal-0.1.1.tar.gz.

File metadata

Download URL: audioseal-0.1.1.tar.gz
Upload date: Feb 4, 2024
Size: 1.9 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: python-requests/2.31.0

File hashes

Hashes for audioseal-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`1f3f327abab613592713d433a30af3d904c69491a3cc6edd7a4e9925f45c8e18`
MD5	`e6052aa8430999c882e0b6bca3ae5faa`
BLAKE2b-256	`be526a3c6ecf10b8c6ac0fadab87d85eaf0b57808b8228db5eb78980fe0c9d79`

See more details on using hashes here.

File details

Details for the file audioseal-0.1.1-py3-none-any.whl.

File metadata

Download URL: audioseal-0.1.1-py3-none-any.whl
Upload date: Feb 4, 2024
Size: 28.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: python-requests/2.31.0

File hashes

Hashes for audioseal-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`656d68bbe5991165e13aa9678d6eb33b3235634f9d29fdb9bf6b1522cc1f9d93`
MD5	`a0bc32bc5333df07ac5dce8693ecdc4d`
BLAKE2b-256	`b7c4c7ce198a141a216889cad8e75cd1edc9d8dea65ab2120048a19fc0374a53`

See more details on using hashes here.

audioseal 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

:loud_sound: AudioSeal: Proactive Localized Watermarking

:mate: Installation

:gear: Models

:abacus: Usage

License

Maintainers:

Citation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes