Unofficial PyTorch dataset for Slakh
Project description
Slakh PyTorch Dataset
Unofficial PyTorch dataset for Slakh.
This project is a work in progress, expect breaking changes!
Roadmap
Automatic music transcription (AMT) usecase with audio and labels
- Specify dataset split (
original
,splits_v2
,redux
) - Add new splits (
redux_no_pitch_bend
, ...) (Should also be filed upstream) (implemented byskip_pitch_bend_tracks
) - Load audio
mix.flac
(all the instruments comined) - Load individual audio mixes (need to combine audio in a streaming fashion)
- Specify
train
,validation
ortest
group - Choose sequence length
- Reproducable load sequences (usefull for validation group to get consistent results)
- Add more instruments (
eletric-bass
,piano
,guitar
, ...) - Choose between having audio in memory or stream from disk (solved by
max_files_in_memory
) - Add to pip
Audio source separation usecase with different audio mixes
- List to come
Usage
-
Download the Slakh dataset (see the official website). It's about 100GB compressed so expect using some time on this point.
-
Install the Python package with pip:
pip install slakh-dataset
-
Convert the audio to 16 kHz (see https://github.com/ethman/slakh-utils)
-
You can use the dataset (AMT usecase):
from torch.utils.data import DataLoader
from slakh_dataset import SlakhAmtDataset
dataset = SlakhAmtDataset(
path='path/to/slakh-16khz-folder'
split='redux', # 'splits_v2','redux-no-pitch-bend'
audio='mix.flac', # 'individual'
instrument='electric-bass', # or `midi_programs`
# midi_programs=[33, 34, 35, 36, 37],
groups=['train'],
skip_pitch_bend_tracks=True,
sequence_length=327680,
max_files_in_memory=200,
)
batch_size = 8
loader = DataLoader(dataset, batch_size, shuffle=True, drop_last=True)
# train model on dataset...
Acknowledgement
-
This code is based on the dataset in Onset and Frames by Jong Wook Kim which is MIT Lisenced.
-
Slakh http://www.slakh.com/
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file slakh-dataset-0.1.13.tar.gz
.
File metadata
- Download URL: slakh-dataset-0.1.13.tar.gz
- Upload date:
- Size: 47.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.5 CPython/3.7.2 Darwin/20.1.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 404fd0ab7f7c9b7af6250021e986c0195e75a05479ce6415d2ac0a087c2265f7 |
|
MD5 | 752d4684e6331d80f7fc17271cd34ed1 |
|
BLAKE2b-256 | e2aa8cfbeff93a0c3a79657daeee4caf247848cc15c103ed7ad3e7720fbc0765 |
File details
Details for the file slakh_dataset-0.1.13-py3-none-any.whl
.
File metadata
- Download URL: slakh_dataset-0.1.13-py3-none-any.whl
- Upload date:
- Size: 48.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.5 CPython/3.7.2 Darwin/20.1.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5ab37a547b89fe06cf2d4edf32094e90fc269ef82aefa900ea2953df3cd3c9c7 |
|
MD5 | a96dee87bf97510d5eeedecb178b0a57 |
|
BLAKE2b-256 | 5292526c74d977ae0ea9570ab5cd8646644de6a5399b136af887b30dbdf959e8 |