a neural network toolbox for animal vocalizations and bioacoustics

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

vak

a neural network toolbox for animal vocalizations and bioacoustics

vak is a library for researchers studying animal vocalizations--such as birdsong, bat calls, and even human speech--although it may be useful to anyone working with bioacoustics data. While there are many important reasons to study bioacoustics, the scope of vak is limited to questions related to vocal learning, "the ability to modify acoustic and syntactic sounds, acquire new sounds via imitation, and produce vocalizations" (Wikipedia). Research questions related to vocal learning cut across a wide range of fields including neuroscience, phsyiology, molecular biology, genomics, ecology, and evolution (Wirthlin et al. 2019).

vak has two main goals:

make it easier for researchers studying animal vocalizations to apply neural network algorithms to their data
provide a common framework that will facilitate benchmarking neural network algorithms on tasks related to animal vocalizations

Currently the main use is automated annotation of vocalizations and other animal sounds, using artificial neural networks. By annotation, we mean something like the example of annotated birdsong shown below:
spectrogram of birdsong with syllables annotated

You give vak training data in the form of audio or spectrogram files with annotations, and then vak helps you train neural network models and use the trained models to predict annotations for new files.

We developed vak to benchmark a neural network model we call tweetynet. See pre-print here: https://www.biorxiv.org/content/10.1101/2020.08.28.272088v2.full.pdf
We would love to help you use vak to benchmark your own model. If you have questions, please feel free to raise an issue.

Installation

Short version:

$ pip install vak

For the long version detail, please see: https://vak.readthedocs.io/en/latest/get_started/installation.html

We currently test vak on Ubuntu and MacOS. We have run on Windows and know of other users successfully running vak on that operating system, but installation on Windows will probably require some troubleshooting. A good place to start is by searching the issues.

Usage

Training models to segment and label vocalizations

Currently the easiest way to work with vak is through the command line. terminal showing vak help command output

You run it with config.toml files, using one of a handful of commands.

For more details, please see the "autoannotate" tutorial here:
https://vak.readthedocs.io/en/latest/tutorial/autoannotate.html

Data and folder structures

To train models, you provide training data in the form of audio or spectrograms files, and annotations for those files.

Spectrograms and labels

The package can generate spectrograms from .wav files or .cbin audio files. It can also accept spectrograms in the form of Matlab .mat or Numpy .npz files. The locations of these files are specified in the config.toml file.

The annotations are parsed by a separate library, crowsetta, that aims to handle common formats like Praat textgrid files, and enable researchers to easily work with formats they may have developed in their own labs. For more information please see:
https://crowsetta.readthedocs.io/en/latest/
https://github.com/NickleDave/crowsetta

Preparing training files

It is possible to train on any manually annotated data but there are some useful guidelines:

Use as many examples as possible - The results will just be better. Specifically, this code will not label correctly syllables it did not encounter while training and will most probably generalize to the nearest sample or ignore the syllable.
Use noise examples - This will make the code very good in ignoring noise.
Examples of syllables on noise are important - It is a good practice to start with clean recordings. The code will not perform miracles and is most likely to fail if the audio is too corrupt or masked by noise. Still, training with examples of syllables on the background of cage noises will be beneficial.

Predicting annotations for audio

You can predict annotations for audio files by creating a config.toml file with a [PREDICT] section.
For more details, please see the "autoannotate" tutorial here: https://vak.readthedocs.io/en/latest/tutorial/autoannotate.html

Support / Contributing

Currently we are handling support through the issue tracker on GitHub:
https://github.com/NickleDave/vak/issues
Please raise an issue there if you run into trouble.
That would be a great place to start if you are interested in contributing, as well.

Citation

If you use vak for a publication, please cite its DOI:

License

is here.

Misc

"Why this name, vak?"

It has only three letters, so it is quick to type, and it wasn't taken on pypi yet. Also I guess it has something to do with speech. "vak" rhymes with "squawk" and "talk".

Does your library have any poems?

Yes.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

1.0.0a3 pre-release

Sep 23, 2023

1.0.0a2 pre-release

Sep 11, 2023

1.0.0a1 pre-release

Mar 6, 2023

0.8.2

Oct 2, 2023

0.8.1

Mar 3, 2023

0.8.0

Feb 9, 2023

0.7.0

Nov 23, 2022

0.6.0

Jul 8, 2022

0.5.0.post1

Jun 25, 2022

0.5.0

Jun 25, 2022

0.4.2

Mar 29, 2022

0.4.1

Jan 7, 2022

0.4.0

Dec 30, 2021

This version

0.4.0b6 pre-release

Nov 23, 2021

0.4.0b5 pre-release

Oct 9, 2021

0.4.0b4 pre-release

Apr 25, 2021

0.4.0b3 pre-release

Apr 4, 2021

0.4.0b2 pre-release

Mar 21, 2021

0.4.0b1 pre-release

Mar 7, 2021

0.4.0.dev1 pre-release yanked

Jan 25, 2021

Reason this release was yanked:

dev release with wrong dependency constraints

0.3.3

Dec 11, 2020

0.3.2

Dec 7, 2020

0.3.1

Aug 9, 2020

0.3.0

Aug 8, 2020

0.3.0a5 pre-release

Apr 25, 2020

0.3.0a3 pre-release

Feb 17, 2020

0.3.0a2 pre-release

Feb 17, 2020

0.3.0a1 pre-release

Feb 9, 2020

0.3.0a0 pre-release

Feb 9, 2020

0.2.0

Oct 22, 2019

0.1.0

Jun 16, 2019

0.1.0a8 pre-release

Apr 24, 2019

0.1.0a7 pre-release

Apr 20, 2019

0.1.0a6 pre-release

Apr 16, 2019

0.1.0a5 pre-release

Apr 8, 2019

0.1.0a4 pre-release

Mar 27, 2019

0.1.0a3 pre-release

Mar 23, 2019

0.1.0a2 pre-release

Mar 18, 2019

0.1.0a1 pre-release

Mar 3, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vak-0.4.0b6.tar.gz (97.7 kB view hashes)

Uploaded Nov 23, 2021 Source

Built Distribution

vak-0.4.0b6-py3-none-any.whl (131.3 kB view hashes)

Uploaded Nov 23, 2021 Python 3

Hashes for vak-0.4.0b6.tar.gz

Hashes for vak-0.4.0b6.tar.gz
Algorithm	Hash digest
SHA256	`001bc43398446e995186b94d64115bc28b6b8b6845e6ffef86b0b84ac4d397ae`
MD5	`c53cdbad8def731b35de0a231d3274af`
BLAKE2b-256	`5d289e0d4228569280d776a48569ec6192d2878ff8e804419b9aad7463ea058c`

Hashes for vak-0.4.0b6-py3-none-any.whl

Hashes for vak-0.4.0b6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cd6fd630ecff1be0339fab156e2fad7a2a06ab3d92bba2e9dab429f21d75ed9d`
MD5	`71f5a73b9a8e5764b6390033e3682aa2`
BLAKE2b-256	`2db5558654bf7cc22dd422f08c69ae6bd7f44be6dbc743f392259a47eae3454e`