Skip to main content

Simple wav2vec2 wrapper

Project description

pyw2v2

Work in progress!

This module is a wrapper for Wav2Vec2 models, intended to accelerate ARS research.

Install

This module can be easily be installed with pip:

pip install pyw2v2

Examples

Different examples can be found here.

Fine-tuning CTC model example

This example will show how to load a pretrained model, load dataset, process dataset, and fine-tune CTC model.

Example configuration files can be found here.

from pyw2v2 import ModelCTC, DatasetPreprocessor
from pyw2v2.utils import load_config, load_custom_dataset_commonvoice_format

if __name__ == "__main__":
    # Load pretrained model
    model_config = load_config("../configs/ctc/default.yaml")
    model = ModelCTC(model_config)
    
    # Loading dataset in Common Voice format
    train_set = load_custom_dataset_commonvoice_format('../datasets/example', 'train')
    eval_set = load_custom_dataset_commonvoice_format('../datasets/example', 'test')

    # Set up dataset preprocessor
    dataproc_config = load_config("../configs/dataproc/default.yaml").data_proc
    data_processor = DatasetPreprocessor(dataproc_config)
    data_processor.processor = model.processor

    # Process data
    train_set = data_processor.process(train_set, dataproc_config.n_samples_train)
    eval_set = data_processor.process(eval_set, dataproc_config.n_samples_test)
    
    # Train/Fine-tune model
    model.train(train_set, eval_set)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyw2v2-0.0.2.tar.gz (7.7 kB view hashes)

Uploaded Source

Built Distribution

pyw2v2-0.0.2-py3-none-any.whl (8.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page