Platform independent interfacing of numpy arrays of floats with audio files and devices for scientific data analysis.

These details have not been verified by PyPI

Project links

Project description

downloads

AudioIO

Platform independent interfacing of numpy arrays of floats with audio files and devices for scientific data analysis.

Documentation | API Reference

Features

Audio data are always numpy arrays of floats with values ranging between -1 and 1 independent of how the data are stored in an audio file.
load_audio() function for loading data of a whole audio file at once.
Blockwise, random-access loading of large or sequential audio files (class AudioLoader based on class BufferedArray).
Read arbitrary metadata() as nested dictionaries of key-value pairs. Supported RIFF chunks are INFO lists, BEXT, iXML, and GUANO.
Read markers(), i.e. cue points with spans, labels, and descriptions.
write_audio() function for writing data, metadata, and markers to an audio file.
Platform independent, synchronous (blocking) and asynchronous (non blocking) playback of numpy arrays via play() with automatic resampling to match supported sampling rates.
Detailed and platform specific installation instructions (pip, conda, Debian and RPM based Linux packages, homebrew for MacOS) for all supported audio packages (see audiomodules).

The AudioIO modules try to use whatever audio packages are installed on your system to achieve their tasks. AudioIO, however, adds own code for handling metadata and marker lists.

Installation

AudioIO is available at PyPi. Simply run:

pip install audioio

Then you can use already installed audio packages for reading and writing audio files and for playing audio data. However, audio file formats supported by the python standard library are limited to basic wave files and playback capabilities are poor. If you need support for additional audio file formats or proper sound output, you need to install additional packages.

See installation for further instructions and recommendations on additional audio packages.

Usage

See API Reference for detailed information.

import audioio as aio

Loading audio data

Load an audio file into a numpy array using load_audio():

data, samplingrate = aio.load_audio('audio/file.wav')

The read in data are always numpy arrays of floats ranging between -1 and 1. The arrays are always 2-D arrays with first axis time and second axis channel, even for single channel data.

Plot the first channel:

import numpy as np
import matplotlib.pyplot as plt

time = np.arange(len(data))/samplingrate
plt.plot(time, data[:,0])
plt.show()

Get a nested dictionary with key-value pairs of the file's metadata and print it using metadata() and print_metadata():

md = aio.metadata('audio/file.wav')
aio.print_metadata(md)

See the audiometadata module for functions to read, write, and change metadata of various types.

Get and print marker positions, spans, labels and texts using markers() and print_markers():

locs, labels = aio.markers('audio/file.wav')
aio.print_markers(locs, labels)

You can also randomly access chunks of data of an audio file, without loading the entire file into memory, by means of the AudioLoader class. This is really handy for analysing very long sound recordings:

# open audio file with a buffer holding 60 seconds of data:
with aio.AudioLoader('audio/file.wav', 60.0) as data:
     block = 1000
     rate = data.samplerate
     for i in range(len(data)//block):
     	 x = data[i*block:(i+1)*block]
     	 # ... do something with x and rate

Instead of a single audio file it can also handle recordings that are split over many files. Just pass all these files as a list to the AudioLoader class.

Even simpler, iterate in blocks over the file with overlap using the blocks() generator:

from scipy.signal import spectrogram
nfft = 2048
with aio.AudioLoader('some/audio.wav') as data:
    for x in data.blocks(100*nfft, nfft//2):
        f, t, Sxx = spectrogram(x, nperseg=nfft, noverlap=nfft//2)

Metadata and markers can be accessed by the metadata() and markers() member functions of the AudioLoader object:

with aio.AudioLoader('audio/file.wav', 60.0) as data:
     md = data.metadata()
     locs, labels = data.markers()

See API documentation of the audioloader, audiometadata, and audiomarkers modules for details.

Writing audio data

Write a 1-D or 2-D numpy array into an audio file (data values between -1 and 1) using the write_audio() function:

aio.write_audio('audio/file.wav', data, samplerate)

Again, in 2-D arrays the first axis (rows) is time and the second axis the channel (columns).

Metadata in form of a nested dictionary with key-value pairs, marker positions and spans (locs) as well as associated labels and texts (labels) can also be passed on to the write_audio() function:

aio.write_audio('audio/file.wav', data, samplerate, md, locs, labels)

See API documentation of the audiowriter module for details.

Converting audio files

AudioIO provides a command line script for converting, downsampling, renaming and merging audio files:

> audioconverter -e float -o test.wav test.mp3

If possible, audioconverter tries to keep metadata and marker lists.

See documentation of the audioconverter module for details.

Display metadata and markers

AudioIO provides a command line script that prints metadata and markers of audio files to the console:

> audiometadata test.wav

See documentation of the audiometadata module for details.

Playing sounds

Fade in and out (fade()) and play (play()) a 1-D or 2-D numpy array as a sound (first axis is time and second axis the channel):

aio.fade(data, samplingrate, 0.2)
aio.play(data, samplingrate)

Just beep()

aio.beep()

Beep for half a second and 440 Hz:

aio.beep(0.5, 440.0)
aio.beep(0.5, 'a4')

Musical notes are translated into frequency with the note2freq() function.

See API documentation of the playaudio module for details.

Managing audio modules

Simply run in your terminal

> audiomodules

and you get something like

Status of audio packages on this machine:
-----------------------------------------

wave              is  installed (F)
ewave             not installed (F)
scipy.io.wavfile  is  installed (F)
soundfile         is  installed (F)
wavefile          not installed (F)
audioread         is  installed (F)
pydub             is  installed (F)
pyaudio           not installed (D)
sounddevice       NOT installed (D)
simpleaudio       not installed (D)
soundcard         not installed (D)
ossaudiodev       is  installed (D)
winsound          not installed (D)

F: file I/O, D: audio device

For better performance you should install the following modules:

sounddevice:
------------
The sounddevice package is a wrapper of the portaudio library (http://www.portaudio.com). 
For documentation see https://python-sounddevice.readthedocs.io

First, install the following packages:

sudo apt install libportaudio2 portaudio19-dev python3-cffi

Install the sounddevice module with pip:

sudo pip install sounddevice

Use this to see which audio modules you have already installed on your system, which ones are recommended to install, and how to install them.

See API documentation of the audiomodules module for details.

Used by

thunderlab: Load and preprocess time series data.
thunderfish: Algorithms and programs for analysing electric field recordings of weakly electric fish.
audian: Python-based GUI for viewing and analyzing recordings of animal vocalizations.

Alternatives

All the audio modules AudioIO is using.

Reading and writing audio files:

wave: simple wave file interface of the python standard library.
ewave: extended wave files.
scipy.io.wavfile: simple scipy wave file interface.
SoundFile: support of many open source audio file formats via libsndfile.
wavefile: support of many open source audio file formats via libsndfile.
audioread: mpeg file support.
Pydub: mpeg support for reading and writing, playback via simlpeaudio or pyaudio.
scikits.audiolab: seems to be no longer active.

Metadata:

GUANO: Grand Unified Acoustic Notation Ontology, an extensible, open format for embedding metadata within bat acoustic recordings.

Playing sounds:

sounddevice: wrapper for portaudio.
PyAudio: wrapper for portaudio.
simpleaudio: uses ALSA on Linux, runs well on windows.
SoundCard: playback via CFFI and the native audio libraries of Linux, Windows and macOS.
ossaudiodev: playback via the outdated OSS interface of the python standard library.
winsound: native windows audio playback of the python standard library, asynchronous playback only with wave files.

Not yet supported by audioio:

mutagen: handles audio metadata of many audio file formats.
playsound: pure Python, cross platform, single function module with no dependencies for playing sounds. Plays sounds from files only.
PreferredSoundPlayer: Platfrom independt playing of sound files.
AudioPlayer: cross platform Python 3 package for playing sounds (mp3, wav, ...).

Scientific audio software:

diapason: musical notes like playaudio.note2freq.
librosa: audio and music processing in python.
TimeView: GUI application to view and analyze time series signal data.
scikit-maad: quantitative analysis of environmental audio recordings
Soundscapy: analysing and visualising soundscape assessments.
BatDetect2: detecting and classifying bat echolocation calls in high frequency audio recordings.
Batogram: viewing bat call spectrograms with GUANO metadata, including the ability to click to open the location in Google Maps.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

2.5.0

Mar 16, 2025

2.4.0

Mar 9, 2025

2.3.0

Feb 16, 2025

2.2.0

Jun 21, 2024

2.1.0

Apr 19, 2024

2.0.0

Feb 11, 2024

1.2.0

Feb 4, 2024

1.1.0

Feb 3, 2024

1.0.0

Feb 1, 2024

0.11.0

Jan 5, 2024

0.9.5

Sep 21, 2020

0.9.4

Aug 31, 2020

0.9.3

Aug 21, 2020

0.9.2

Jul 17, 2020

0.9.1

Jul 17, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audioio-2.5.0.tar.gz (133.1 kB view details)

Uploaded Mar 16, 2025 Source

Built Distribution

audioio-2.5.0-py3-none-any.whl (106.8 kB view details)

Uploaded Mar 16, 2025 Python 3

File details

Details for the file audioio-2.5.0.tar.gz.

File metadata

Download URL: audioio-2.5.0.tar.gz
Upload date: Mar 16, 2025
Size: 133.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.12.3

File hashes

Hashes for audioio-2.5.0.tar.gz
Algorithm	Hash digest
SHA256	`cabb85289fde91ab60b7b77eb4b71b5c90a0c6de7a9a181252da64f5e2949d7a`
MD5	`31a3dd59fa546e76373ee69d959f185f`
BLAKE2b-256	`78185238205e01130f16294eaa854be3e4ef81dcc3f4dcefd7cf8e56f59a99fe`

See more details on using hashes here.

File details

Details for the file audioio-2.5.0-py3-none-any.whl.

File metadata

Download URL: audioio-2.5.0-py3-none-any.whl
Upload date: Mar 16, 2025
Size: 106.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.12.3

File hashes

Hashes for audioio-2.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0a3a1afc5b979ec93714c9e22bb4009778ba92585c0366c549105a142a3bdab9`
MD5	`eb03e10afaaab3a41fe92419b968659d`
BLAKE2b-256	`ddbe2f7e5262188d62fc8a67692c8f342564a19901f9c74648fefc94e66bfe6a`

See more details on using hashes here.

audioio 2.5.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

AudioIO

Features

Installation

Usage

Loading audio data

Writing audio data

Converting audio files

Display metadata and markers

Playing sounds

Managing audio modules

Used by

Alternatives

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes