Skip to main content

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

Project description

SpeechMix

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together.

Installation

pip install

pip install speechmix

Build from source

git clone and cd into this project.

pip install -e .

Name the project(!important)

WANDB_PROJECT=amazing

base

python train.py --speech_model_config wav2vec2 \
--nlp_model_config facebook/bart-base \
--SpeechMixEED \
--dataset librispeech_asr \
--field clean \
--train_split train.100 \
--test_split validation \
--batch 3 \
--grad_accum 20 \
--epoch 30 \
--worker 15 \
--share_layer_ratio 0 \
--down_scale 2 \
--lr 4e-5 \
--warmup_steps 500 \
--wandb \
--notes base

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

speechmix-0.0.29.tar.gz (13.9 kB view details)

Uploaded Source

Built Distribution

speechmix-0.0.29-py3-none-any.whl (10.8 kB view details)

Uploaded Python 3

File details

Details for the file speechmix-0.0.29.tar.gz.

File metadata

  • Download URL: speechmix-0.0.29.tar.gz
  • Upload date:
  • Size: 13.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.9

File hashes

Hashes for speechmix-0.0.29.tar.gz
Algorithm Hash digest
SHA256 9e49d6f423761a3eb244f30e3db42e2cc63ee2620e3cacdc84006c7676d3aec2
MD5 04897cb84c95f071f05cfb1db0a09bc5
BLAKE2b-256 6d1143a0800e5f6b49e6d54277cbfd7c4cc227f55cf6e203e4f1cf7982d4207d

See more details on using hashes here.

File details

Details for the file speechmix-0.0.29-py3-none-any.whl.

File metadata

  • Download URL: speechmix-0.0.29-py3-none-any.whl
  • Upload date:
  • Size: 10.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.9

File hashes

Hashes for speechmix-0.0.29-py3-none-any.whl
Algorithm Hash digest
SHA256 fffcb8efbcf5f671a305687cc65131ae8a4736224ed5940e927a1081d5d0844b
MD5 2869a612747049e09a9b3a7f7b436828
BLAKE2b-256 b73a79e91a2a6e3a59cfdea4eaba777e8dbe6787ccbe0e6c9c16bea24a46af2a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page