Skip to main content

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

Project description

SpeechMix

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together.

Installation

pip install

pip install speechmix

Build from source

git clone and cd into this project.

pip install -e .

Name the project(!important)

WANDB_PROJECT=amazing

base

python train.py --speech_model_config wav2vec2 \
--nlp_model_config facebook/bart-base \
--SpeechMixEED \
--dataset librispeech_asr \
--field clean \
--train_split train.100 \
--test_split validation \
--batch 3 \
--grad_accum 20 \
--epoch 30 \
--worker 15 \
--share_layer_ratio 0 \
--down_scale 2 \
--lr 4e-5 \
--warmup_steps 500 \
--wandb \
--notes base

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

speechmix-0.0.31.tar.gz (14.7 kB view details)

Uploaded Source

Built Distribution

speechmix-0.0.31-py3-none-any.whl (11.8 kB view details)

Uploaded Python 3

File details

Details for the file speechmix-0.0.31.tar.gz.

File metadata

  • Download URL: speechmix-0.0.31.tar.gz
  • Upload date:
  • Size: 14.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.9

File hashes

Hashes for speechmix-0.0.31.tar.gz
Algorithm Hash digest
SHA256 b34b4ef4e3a5b2bf1e4d717e0fe2521512b59a4416857e71f6e283692ed1037b
MD5 e5c2b6d2996e2bc7c41ef86c88211a0c
BLAKE2b-256 2a8e8fe7a1d07012b6da6042e644ab8dc01adbc22bc275132e3f2266afecb4c6

See more details on using hashes here.

File details

Details for the file speechmix-0.0.31-py3-none-any.whl.

File metadata

  • Download URL: speechmix-0.0.31-py3-none-any.whl
  • Upload date:
  • Size: 11.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.9

File hashes

Hashes for speechmix-0.0.31-py3-none-any.whl
Algorithm Hash digest
SHA256 0a186de0e0085d69b7ca5693ae633502467fd4bc7f5d51061462dc464264dd59
MD5 dc126cf34a3ae38ff676e65bd9426fc0
BLAKE2b-256 3832098a904e26521eadda6324f2ff79d7d1e8db783111deec6736cc8f2b90f1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page