Skip to main content

No project description provided

Project description

dtokenizer

discretize everything into tokens

Introduction

dtokenizer is a Python library designed to discretize audio files into tokens using various models. It supports models like Hubert and Encodec for tokenization.

Installation

To use dtokenizer, first ensure you have Python and pip installed. Then, install the required dependencies by running:

pip install -r requirements.txt

Usage

Hubert Tokenizer

The Hubert tokenizer can be used to tokenize audio files into discrete tokens and then decode them back. Here's how you can use it:

from dtokenizer.audio.model.hubert_model import HubertTokenizer
import soundfile as sf

ht = HubertTokenizer('hubert_layer6_code100')
code, decodec_stuff = ht.encode_file('./sample2_22k.wav')
wav_values = ht.decode(code)

# Write the decoded audio to a file
sf.write('output.wav', wav_values, 16000)

Encodec Tokenizer

Similarly, the Encodec tokenizer allows for efficient audio file tokenization. Here's an example of its usage:

import torch
from dtokenizer.audio.model.encodec_model import EncodecTokenizer
import torchaudio

et = EncodecTokenizer('encodec_24k_6bps')
code, stuff_for_decode = et.encode_file('./sample2_22k.wav')
wav_values = et.decode(stuff_for_decode)

# Save the decoded audio to a file
torchaudio.save('output.wav', torch.from_numpy(wav_values), 22050)

Contributing

We welcome contributions to the dtokenizer project. Please feel free to submit issues or pull requests.

License

This project is released under the MIT License. See the LICENSE file for more details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dtokenizer-0.0.3.tar.gz (15.1 kB view details)

Uploaded Source

Built Distribution

dtokenizer-0.0.3-py3-none-any.whl (20.2 kB view details)

Uploaded Python 3

File details

Details for the file dtokenizer-0.0.3.tar.gz.

File metadata

  • Download URL: dtokenizer-0.0.3.tar.gz
  • Upload date:
  • Size: 15.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.5

File hashes

Hashes for dtokenizer-0.0.3.tar.gz
Algorithm Hash digest
SHA256 468cca9fde7a73634eda7dc121be6afa74048b68aa063e0395e477f3a336348f
MD5 13d6ce2784f32398b37cae84794494be
BLAKE2b-256 afcae23934a2509dab4a389498b520ef1b1713eb1aab05bec88f9255aa9051e2

See more details on using hashes here.

File details

Details for the file dtokenizer-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: dtokenizer-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 20.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.5

File hashes

Hashes for dtokenizer-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 23608a27c89a49f702ba3ba9be2ed32fe8a77abc254b59154cf43d871d53510f
MD5 42d13cc86f0b8328d20186604819619c
BLAKE2b-256 51496127d0c286131002f17f7a143458886187d20008a726e2b07170bd5a2d27

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page