Simple wav2vec2 wrapper
Project description
pyw2v2
Work in progress!
This module is a wrapper for Wav2Vec2 models, intended to accelerate ARS research.
Install
This module can be easily be installed with pip
:
pip install pyw2v2
Examples
Different examples can be found here.
Fine-tuning CTC model example
This example will show how to load a pretrained model, load dataset, process dataset, and fine-tune CTC model.
Example configuration files can be found here.
from pyw2v2 import ModelCTC, DatasetPreprocessor
from pyw2v2.utils import load_config, load_custom_dataset_commonvoice_format
if __name__ == "__main__":
# Load pretrained model
model_config = load_config("../configs/ctc/default.yaml")
model = ModelCTC(model_config)
# Loading dataset in Common Voice format
train_set = load_custom_dataset_commonvoice_format('../datasets/example', 'train')
eval_set = load_custom_dataset_commonvoice_format('../datasets/example', 'test')
# Set up dataset preprocessor
dataproc_config = load_config("../configs/dataproc/default.yaml").data_proc
data_processor = DatasetPreprocessor(dataproc_config)
data_processor.processor = model.processor
# Process data
train_set = data_processor.process(train_set, dataproc_config.n_samples_train)
eval_set = data_processor.process(eval_set, dataproc_config.n_samples_test)
# Train/Fine-tune model
model.train(train_set, eval_set)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pyw2v2-0.0.1a8.tar.gz
(7.0 kB
view hashes)