vocalpy

A core package for acoustic communication research in Python

These details have not been verified by PyPI

Project links

Docs
Home

Project description

A core package for acoustic communication research in Python

There are many great software tools for researchers studying acoustic communication in animals[^1]. But our research groups work with a wide range of different data formats: for audio, for array data, for annotations. This means we write a lot of low-level code to deal with those formats, and then our code for analyses is tightly coupled to those formats. In turn, this makes it hard for other groups to read our code, and it takes a real investment to understand our analyses, workflows and pipelines. It also means that it requires significant work to translate from a pipeline or analysis worked out by a scientist-coder in a Jupyter notebook into a generalized, robust service provided by an application.

In particular, acoustic communication researchers working with the Python programming language face these problems. How can our scripts and libraries talk to each other? Luckily, Python is a great glue language! Let's use it to solve these problems.

The goals of VocalPy are to:

make it easy to work with a wide array of data formats: audio, array (spectrograms, features), annotation
provide classes that represent commonly-used data types: audio, spectograms, features, annotations
provide classes that represent common processes and steps in pipelines: segmenting audio, computing spectrograms, extracting features
make it easier for scientist-coders to flexibly and iteratively build datasets, without needing to deal directly with a database if they don't want to
make it possible to re-use code you have already written for your own research group
and finally:
- make code easier to share and read across research groups, by providing these classes, and idiomiatic ways of coding with them; think of VocalPy as an interoperability layer and a common language
- facilitate collaboration between scientist-coders writing imperative analysis scripts and research software engineers developing libraries and applications

A paper introducing VocalPy and its design has been accepted at Forum Acusticum 2023 as part of the session "Open-source software and cutting-edge applications in bio-acoustics", and will be published in the proceedings.

[^1]: For a curated collection, see https://github.com/rhine3/bioacoustics-software.

Features

Data types for acoustic communication data: audio, spectrogram, annotations, features

The `vocalpy.Sound` data type

Works with a wide array of audio formats, thanks to soundfile.
Also works with the cbin audio format saved by the LabView app EvTAF used by many neuroscience labs studying birdsong, thanks to evfuncs.

>>> import vocalpy as voc
>>> data_dir = ('tests/data-for-tests/source/audio_wav_annot_birdsongrec/Bird0/Wave/')
>>> wav_paths = voc.paths.from_dir(data_dir, 'wav')
>>> sounds = [voc.Sound.read(wav_path) for wav_path in wav_paths]
>>> print(sounds[0])
vocalpy.Sound(data=array([3.0517...66210938e-04]), samplerate=32000, channels=1))

The `vocalpy.Spectrogram` data type

Save expensive-to-compute spectrograms to array files, so you don't regenerate them over and over again

>>> import vocalpy as voc
>>> data_dir = ('tests/data-for-tests/generated/spect_npz/')
>>> spect_paths = voc.paths.from_dir(data_dir, 'wav.npz')
>>> spects = [voc.Spectrogram.read(spect_path) for spect_path in spect_paths]
>>> print(spects[0])
vocalpy.Spectrogram(data=array([[3.463...7970774e-14]]), frequencies=array([    0....7.5, 16000. ]), times=array([0.008,...7.648, 7.65 ]))

The `vocalpy.Annotation` data type

Load many different annotation formats using the pyOpenSci package crowsetta

>>> import vocalpy as voc
>>> data_dir = ('tests/data-for-tests/source/audio_cbin_annot_notmat/gy6or6/032312/')
>>> notmat_paths = voc.paths.from_dir(data_dir, '.not.mat')
>>> annots = [voc.Annotation.read(notmat_path, format='notmat') for notmat_path in notmat_paths]
>>> print(annots[1])
Annotation(data=Annotation(annot_path=PosixPath('tests/data-for-tests/source/audio_cbin_annot_notmat/gy6or6/032312/gy6or6_baseline_230312_0809.141.cbin.not.mat'), 
notated_path=PosixPath('tests/data-for-tests/source/audio_cbin_annot_notmat/gy6or6/032312/gy6or6_baseline_230312_0809.141.cbin'), 
seq=<Sequence with 57 segments>), path=PosixPath('tests/data-for-tests/source/audio_cbin_annot_notmat/gy6or6/032312/gy6or6_baseline_230312_0809.141.cbin.not.mat'))

Classes for common steps in your pipelines and workflows

A `Segmenter` for segmentation into sequences of units

>>> import evfuncs
>>> import vocalpy as voc
>>> data_dir = ('tests/data-for-tests/source/audio_cbin_annot_notmat/gy6or6/032312/')
>>> cbin_paths = voc.paths.from_dir(data_dir, 'cbin')
>>> audios = [voc.Sound.read(cbin_path) for cbin_path in cbin_paths]
>>> segment_params = {'threshold': 1500, 'min_syl_dur': 0.01, 'min_silent_dur': 0.006}
>>> segmenter = voc.Segmenter(callback=evfuncs.segment_song, segment_params=segment_params)
>>> seqs = segmenter.segment(audios, parallelize=True)
[  ########################################] | 100% Completed | 122.91 ms
>>> print(seqs[1])
Sequence(units=[Unit(onset=2.19075, offset=2.20428125, label='-', audio=None, spectrogram=None),
                Unit(onset=2.35478125, offset=2.38815625, label='-', audio=None, spectrogram=None),
                Unit(onset=2.8410625, offset=2.86715625, label='-', audio=None, spectrogram=None),
                Unit(onset=3.48234375, offset=3.49371875, label='-', audio=None, spectrogram=None),
                Unit(onset=3.57021875, offset=3.60296875, label='-', audio=None, spectrogram=None),
                Unit(onset=3.64403125, offset=3.67721875, label='-', audio=None, spectrogram=None),
                Unit(onset=3.72228125, offset=3.74478125, label='-', audio=None, spectrogram=None),
                Unit(onset=3.8036875, offset=3.8158125, label='-', audio=None, spectrogram=None),
                Unit(onset=3.82328125, offset=3.83646875, label='-', audio=None, spectrogram=None),
                Unit(onset=4.13759375, offset=4.16346875, label='-', audio=None, spectrogram=None),
                Unit(onset=4.80278125, offset=4.814, label='-', audio=None, spectrogram=None),
                Unit(onset=4.908125, offset=4.922875, label='-', audio=None, spectrogram=None),
                Unit(onset=4.9643125, offset=4.992625, label='-', audio=None, spectrogram=None),
                Unit(onset=5.039625, offset=5.0506875, label='-', audio=None, spectrogram=None),
                Unit(onset=5.10165625, offset=5.1385, label='-', audio=None, spectrogram=None),
                Unit(onset=5.146875, offset=5.16203125, label='-', audio=None, spectrogram=None),
                Unit(onset=5.46390625, offset=5.49409375, label='-', audio=None, spectrogram=None),
                Unit(onset=6.14503125, offset=6.1565625, label='-', audio=None, spectrogram=None),
                Unit(onset=6.31003125, offset=6.346125, label='-', audio=None, spectrogram=None),
                Unit(onset=6.38996875, offset=6.4018125, label='-', audio=None, spectrogram=None),
                Unit(onset=6.46053125, offset=6.4796875, label='-', audio=None, spectrogram=None),
                Unit(onset=6.83525, offset=6.8643125, label='-', audio=None, spectrogram=None)], method='segment_song',
         segment_params={'threshold': 1500, 'min_syl_dur': 0.01, 'min_silent_dur': 0.006},
         audio=vocalpy.Sound(data=None, samplerate=None, channels=None), path=tests / data -
for -tests / source / audio_cbin_annot_notmat / gy6or6 / 032312 / gy6or6_baseline_230312_0809.141.cbin), spectrogram=None)

A `SpectrogramMaker` for computing spectrograms

>>> import vocalpy as voc
>>> wav_paths = voc.paths.from_dir('wav')
>>> audios = [voc.Sound(wav_path) for wav_path in wav_paths]
>>> spect_params = {'fft_size': 512, 'step_size': 64}
>>> spect_maker = voc.SpectrogramMaker(spect_params=spect_params)
>>> spects = spect_maker.make(audios, parallelize=True)

And more!

For a crash course in VocalPy, please see the quickstart in the documentation. And for walkthroughs on how to use VocalPy for common tasks, please see the How-Tos section of the user guide.

Installation

With `pip`

pip install vocalpy

With `conda`

conda install vocalpy -c conda-forge

For more detail see Getting Started - Installation

Support

To report a bug or request a feature, please use the issue tracker on GitHub:
https://github.com/vocalpy/vocalpy/issues

To ask a question about vocalpy, discuss its development, or share how you are using it, please start a new topic on the VocalPy forum with the vocalpy tag:
https://forum.vocalpy.org/

Contribute

Code of conduct

Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.

Contributing Guidelines

Below we provide some quick links, but you can learn more about how you can help and give feedback
by reading our Contributing Guide.

To ask a question about vocalpy, discuss its development, or share how you are using it, please start a new "Q&A" topic on the VocalPy forum with the vocalpy tag:
https://forum.vocalpy.org/

To report a bug, or to request a feature, please use the issue tracker on GitHub:
https://github.com/vocalpy/vocalpy/issues

CHANGELOG

You can see project history and work in progress in the CHANGELOG

License

The project is licensed under the BSD license.

Citation

If you use vocalpy, please cite the DOI:

Contributors ✨

Thanks goes to these wonderful people (emoji key):

_{Ralph Emilio Peterson}
🤔 📓 📖 🐛 💻

_{Tetsuo Koyama}
📖

_avanikop
🤔

_{Estelle in 't Zandt}
🐛 🔣

This project follows the all-contributors specification. Contributions of any kind welcome!

Project details

These details have not been verified by PyPI

Project links

Docs
Home

Release history Release notifications | RSS feed

This version

0.10.3.post1

Dec 16, 2025

0.10.3

Dec 16, 2025

0.10.2

Jul 17, 2025

0.10.1

Jul 14, 2025

0.10.0.post1

Nov 28, 2024

0.10.0

Nov 28, 2024

0.9.5

Sep 10, 2024

0.9.4

Jul 3, 2024

0.9.3

Jun 12, 2024

0.9.2

Jun 6, 2024

0.9.1

May 21, 2024

0.9.0

May 16, 2024

0.8.2

Feb 8, 2024

0.8.1

Feb 2, 2024

0.8.0

Jan 17, 2024

0.7.0

Jan 4, 2024

0.6.1

Nov 2, 2023

0.6.0

Oct 3, 2023

0.5.0

Sep 29, 2023

0.4.0

Sep 28, 2023

0.3.0

May 11, 2023

0.2.0.post1

May 7, 2023

0.2.0

May 7, 2023

0.1.0

Feb 12, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vocalpy-0.10.3.post1.tar.gz (5.2 MB view details)

Uploaded Dec 16, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vocalpy-0.10.3.post1-py3-none-any.whl (1.5 MB view details)

Uploaded Dec 16, 2025 Python 3

File details

Details for the file vocalpy-0.10.3.post1.tar.gz.

File metadata

Download URL: vocalpy-0.10.3.post1.tar.gz
Upload date: Dec 16, 2025
Size: 5.2 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: python-requests/2.32.5

File hashes

Hashes for vocalpy-0.10.3.post1.tar.gz
Algorithm	Hash digest
SHA256	`4bb3bd88cb94fe8a551ccac844b436615070cac2b96b40f3f963a7419e3db038`
MD5	`f84d0dc10b896bed18fbc00262b37ff6`
BLAKE2b-256	`6121670a49982589985e38bc98905439b839dcd423e50d6e350d5b5eeaac9b4b`

See more details on using hashes here.

File details

Details for the file vocalpy-0.10.3.post1-py3-none-any.whl.

File metadata

Download URL: vocalpy-0.10.3.post1-py3-none-any.whl
Upload date: Dec 16, 2025
Size: 1.5 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: python-requests/2.32.5

File hashes

Hashes for vocalpy-0.10.3.post1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`369961ec9b30ca2fad596a4d32fa346d47722dd1447f229ab34d2fd71b1aed51`
MD5	`d79c1e300bed0a3c5f16b7ec8414ce23`
BLAKE2b-256	`8e4d92597af5955ccbd0f2dc20ae3ec09e73a64895f3b3bfb2190af4aa2fe871`

See more details on using hashes here.

vocalpy 0.10.3.post1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

A core package for acoustic communication research in Python

Features

Data types for acoustic communication data: audio, spectrogram, annotations, features

The vocalpy.Sound data type

The vocalpy.Spectrogram data type

The vocalpy.Annotation data type

Classes for common steps in your pipelines and workflows

A Segmenter for segmentation into sequences of units

A SpectrogramMaker for computing spectrograms

And more!

Installation

With pip

With conda

Support

Contribute

Code of conduct

Contributing Guidelines

CHANGELOG

License

Citation

Contributors ✨

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

The `vocalpy.Sound` data type

The `vocalpy.Spectrogram` data type

The `vocalpy.Annotation` data type

A `Segmenter` for segmentation into sequences of units

A `SpectrogramMaker` for computing spectrograms

With `pip`

With `conda`