Skip to main content

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

Project description

SpeechMix

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together.

Installation

pip install

pip install speechmix

Build from source

git clone and cd into this project.

pip install -e .

Name the project(!important)

WANDB_PROJECT=amazing

base

python train.py --speech_model_config wav2vec2 \
--nlp_model_config facebook/bart-base \
--SpeechMixEED \
--dataset librispeech_asr \
--field clean \
--train_split train.100 \
--test_split validation \
--batch 3 \
--grad_accum 20 \
--epoch 30 \
--worker 15 \
--share_layer_ratio 0 \
--down_scale 2 \
--lr 4e-5 \
--warmup_steps 500 \
--wandb \
--notes base

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

speechmix-0.0.28.tar.gz (12.7 kB view details)

Uploaded Source

Built Distribution

speechmix-0.0.28-py3-none-any.whl (10.8 kB view details)

Uploaded Python 3

File details

Details for the file speechmix-0.0.28.tar.gz.

File metadata

  • Download URL: speechmix-0.0.28.tar.gz
  • Upload date:
  • Size: 12.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.9

File hashes

Hashes for speechmix-0.0.28.tar.gz
Algorithm Hash digest
SHA256 276f7ecbda09ffbaa855d6926de09e3881969d2d6390378585fca96435fba00e
MD5 73ac95179dcb9ebf0a5fd0b140a8fbdb
BLAKE2b-256 c67cb653a435a2ec9b0181cd5376436d2fbcb9708f605e124f9fe1b846f004f2

See more details on using hashes here.

File details

Details for the file speechmix-0.0.28-py3-none-any.whl.

File metadata

  • Download URL: speechmix-0.0.28-py3-none-any.whl
  • Upload date:
  • Size: 10.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.9

File hashes

Hashes for speechmix-0.0.28-py3-none-any.whl
Algorithm Hash digest
SHA256 c35cdebc67e016619d4f49bb3e48cbc008341708a92c03529f8ccfebf2ab3b86
MD5 86c7225ea90b3ddddc3c3252c478b898
BLAKE2b-256 e2d55bd17b4c2872e2596aab16ab86ccf92a1b41f4f3a71d09409b1c58bccbd5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page