Skip to main content

npvcc2016: Python loader of npVCC2016 speech corpus

Project description

npvcc2016 - Python loader of npVCC2016Corpus

PyPI version Python Versions

npvcc2016 is a Python package for loader of npVCC2016 non-parallel speech corpus.
For machine learning, corpus/dataset is indispensable - but troublesome - part.
We need portable & flexible loader for streamline development.
npvcc2016 is the one!!

Demo

Python/PyTorch

pip install npvcc2016
from npvcc2016.PyTorch.dataset.waveform import NpVCC2016_wave

dataset = NpVCC2016(train=True, download=True)

for datum in dataset:
    print("Yeah, data is acquired with only two line of code!!")
    print(datum) # (datum, label) tuple provided

npvcc2016 transparently downloads corpus, structures the data and provides standarized datasets.
What you have to do is only instantiating the class!

APIs

Current npvcc2016 support PyTorch.
As interface, PyTorch's Dataset and PyTorch-Lightning's DataModule are provided.
npVCC2016 corpus is speech corpus, so we provide waveform dataset and spectrogram dataset for both interfaces.

  • PyTorch
    • (pure PyTorch) dataset
      • waveform: NpVCC2016_wave
      • spectrogram: NpVCC2016_spec
    • PyTorch-Lightning
      • waveform: NpVCC2016_wave_DataModule
      • spectrogram: NpVCC2016_spec_DataModule

Dependency Notes

PyTorch version

PyTorch version: PyTorch v1.6 is working (We checked with v1.6.0).

For dependency resolution, we do NOT explicitly specify the compatible versions.
PyTorch have several distributions for various environment (e.g. compatible CUDA version.)
Unfortunately it make dependency version management complicated for dependency management system.
In our case, the system poetry cannot handle cuda variant string (e.g. torch>=1.6.0 cannot accept 1.6.0+cu101.)
In order to resolve this problem, we use torch==*, it is equal to no version specification.
Setup.py could resolve this problem (e.g. torchaudio's setup.py), but we will not bet our effort to this hacky method.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

npvcc2016-3.0.0.tar.gz (9.5 kB view details)

Uploaded Source

Built Distribution

npvcc2016-3.0.0-py3-none-any.whl (13.2 kB view details)

Uploaded Python 3

File details

Details for the file npvcc2016-3.0.0.tar.gz.

File metadata

  • Download URL: npvcc2016-3.0.0.tar.gz
  • Upload date:
  • Size: 9.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.4 CPython/3.6.12 Linux/4.19.104-microsoft-standard

File hashes

Hashes for npvcc2016-3.0.0.tar.gz
Algorithm Hash digest
SHA256 d0aaf55ae128441833b0797fce59345d2421cce0cbf9e37417e310f8918ff4fe
MD5 f416a33dc7b2106639d83d04975ff8d3
BLAKE2b-256 a1e12a532cd9ba7c494d432eaf30ad6ceba72c2fd5357abdf93b22b930dc3471

See more details on using hashes here.

File details

Details for the file npvcc2016-3.0.0-py3-none-any.whl.

File metadata

  • Download URL: npvcc2016-3.0.0-py3-none-any.whl
  • Upload date:
  • Size: 13.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.4 CPython/3.6.12 Linux/4.19.104-microsoft-standard

File hashes

Hashes for npvcc2016-3.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b252e75b44544be69f247b3620a797db0ffad631ba73bc564cf67e5aca998ffe
MD5 4fcd905977639041c0d0d79dc60cabe5
BLAKE2b-256 c180cded0936e4b5c86c62b034d32c55e44dfed1e00ad4098e42155bb1f52f44

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page