Skip to main content

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

Project description

SpeechMix

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together.

Installation

pip install

pip install speechmix

Build from source

git clone and cd into this project.

pip install -e .

Name the project(!important)

WANDB_PROJECT=amazing

base

python train.py --speech_model_config wav2vec2 \
--nlp_model_config facebook/bart-base \
--SpeechMixEED \
--dataset librispeech_asr \
--field clean \
--train_split train.100 \
--test_split validation \
--batch 3 \
--grad_accum 20 \
--epoch 30 \
--worker 15 \
--share_layer_ratio 0 \
--down_scale 2 \
--lr 4e-5 \
--warmup_steps 500 \
--wandb \
--notes base

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

speechmix-0.0.30.tar.gz (14.6 kB view details)

Uploaded Source

Built Distribution

speechmix-0.0.30-py3-none-any.whl (11.6 kB view details)

Uploaded Python 3

File details

Details for the file speechmix-0.0.30.tar.gz.

File metadata

  • Download URL: speechmix-0.0.30.tar.gz
  • Upload date:
  • Size: 14.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.9

File hashes

Hashes for speechmix-0.0.30.tar.gz
Algorithm Hash digest
SHA256 3647e4842a4d9dfd36333fa2153926b6c7ff2ad89cbcfa9055fdc1ad9903100a
MD5 42bd8c3cbe940d857476cccef5c20131
BLAKE2b-256 59eaa70ca0c190f81d501f0ae066821f36fff3740182cb157b9b705f9db32aed

See more details on using hashes here.

File details

Details for the file speechmix-0.0.30-py3-none-any.whl.

File metadata

  • Download URL: speechmix-0.0.30-py3-none-any.whl
  • Upload date:
  • Size: 11.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.9

File hashes

Hashes for speechmix-0.0.30-py3-none-any.whl
Algorithm Hash digest
SHA256 0e63fdb35e1b99681186a525d4c910570a340eea4b42781e3e6e7091de54641c
MD5 eb5a97af435dc3dbaf309e287ad32397
BLAKE2b-256 5d93810b94f3301d02911858683d0ef2fc769938446f969577b322183582f6b5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page