No project description provided
Project description
masked_prosody_model
A transformer-based model for prosody prediction with masked inputs. This model processes pitch, energy, and voice activity detection features to predict prosodic patterns in speech.
Installation
pip install masked_prosody_model
Note: torch and torchaudio need to be installed separately.
Usage
from masked_prosody_model import MaskedProsodyModel
# Load the pretrained model
model = MaskedProsodyModel.from_pretrained("cdminix/masked_prosody_model")
# Process an audio file and get representations
representation = model.process_audio("some_audio.wav", layer=7) # layer between 0 and 15, 7 was used in the paper
Acknowledgments
This model was trained using Cloud TPUs supplied by Google's TPU Research Cloud (TRC). We thank them for their support.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file masked_prosody_model-0.3.0.tar.gz.
File metadata
- Download URL: masked_prosody_model-0.3.0.tar.gz
- Upload date:
- Size: 242.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.20
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
88bc65e60e887291864804f02ddde5e4e159eb43272bbc91649726cc732b2f4d
|
|
| MD5 |
49d320e6ad19286660021b1a15628bb4
|
|
| BLAKE2b-256 |
ad1f61bfe4938aefc2932ef9780e8ca5ddf5b89043b67a5274336a87c99135b6
|
File details
Details for the file masked_prosody_model-0.3.0-py3-none-any.whl.
File metadata
- Download URL: masked_prosody_model-0.3.0-py3-none-any.whl
- Upload date:
- Size: 7.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.20
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ba353f75978712b51e9b8e11dd7a188aa99b97c03bf3cae62dcf9f9cb7af7411
|
|
| MD5 |
061db4bdcb5e43023da388467e0e5d87
|
|
| BLAKE2b-256 |
03a707308e5fed3398e64e36027f11ded10b5d56426172826a8853cf0e36ad15
|