DNN-based singing voice synthesis library
Project description
NN-SVS
Neural network-based singing voice synthesis library for research.
Demo
Neural network-based singing voice synthesis demo using kiritan_singing database (Japanese)
Audio samples
- Kiritan samples: https://soundcloud.com/r9y9/sets/dnn-based-singing-voice
Installation
python setup.py develop
Repository structure
- Core library: nnsvs/
- Command line programs: nnsvs/bin/ and its configurations nnsvs/bin/conf/
- Recipes: egs/
Python docstring style
https://sphinxcontrib-napoleon.readthedocs.io/en/latest/example_google.html
Recipes
A recipe is a set of scripts and configuraitons that are used to reproduce experiments. All the steps used to conduct experiments are provided in a self-contained way. Please have a look at the egs directory if you want to build your singing voice systems.
Background
As of Feb. 2020, NEUTRINO, a DNN-based singing voice synthesis tool, has started gaining its popularity in Japan. Because of the powerful DNN-based approach, users can create expressive and natural singing voices even without manual tuning which is typically required to achieve satisfactory quality using the existing tools.
While NEUTRINO is a great tool for creative purposes, it is not open-source software. In fact, there are only a few open-source toolkits to the best of our knowledge. To advance the singing voice synthesis research, we aim to provide a modern DNN-based singing voice synthesis tool for researchers and developers.
That being said, I was just curious to see if I can make a better one than NEUTRINO. We’ll see :)
History
See HISTORY.md
Related projects
- English support for nnsvs: https://github.com/DynamiVox/nnsvs-english-support
- NNSVSのモデルをUTAUで使えるようにするツール (UTAU plugin software powered by NNSVS): https://github.com/oatsu-gh/ENUNU
- NNSVS向け教師データ作成支援ツール Vocal2lab: https://github.com/148nasuka/Vocal2lab
References
- Y. Hono et al, "Sinsy: A Deep Neural Network-Based Singing Voice Synthesis System", Journal of IEEE/ACM TASLP https://arxiv.org/abs/2108.02776
- Y. Hono et al, "Recent Development of the DNN-based Singing Voice Synthesis System — Sinsy," Proc. of APSIPA, 2017. (PDF)
- A fork of sinsy: https://github.com/r9y9/sinsy
- Python wrapper for sinsy: https://github.com/r9y9/pysinsy
- NEUTRINO: https://n3utrino.work/
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file nnsvs-0.0.1.tar.gz.
File metadata
- Download URL: nnsvs-0.0.1.tar.gz
- Upload date:
- Size: 38.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.8.2 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c54fd8a48f932d7635c38c32a5f6d6eb3dfcdf81badbc64ca8fe38977bbd479b
|
|
| MD5 |
9d741a346d6daa76a2d6d90e7e12d9cd
|
|
| BLAKE2b-256 |
4aaeb3b7a774ec042d7ff87b728af7bc5bd89405480a84bcd74d1c85db9c4781
|