The package wavaugmentate makes audio signal augmentation conversions.

These details have not been verified by PyPI

Project links

Project description

Wavaugmentate 0.2.2

Multichannel Audio Signal Augmentation Module

alt text The module makes audio signal augmentation conversions. It provides the MultiChannelSignal, SignalAugmentation classes and wavaug-cli console utility.

MultiChannalSignal provides basic operations with multi-channel signals.
SignalAugmentation helps to perform augmentation of multi-channel signals for AI models learning purpose.

PyPi: https://pypi.org/project/wavaugmentate GitHub: https://github.com/chetverovod/wavaugmentate

Installation

pip install wavaugmentate

Input Data

WAV-file or NumPy array.

Array shape: (num_channels, num_samples).

Output Data

Same types as in section Input_data.

Augmentation Methods

Amplitude (volume change, inversion).
Time shift.
Echo.
Adding noise.
Time stretching. (not implemented)
Tempo change. (not implemented)
Pitch shift. (not implemented)
Adding silence.
Frequency masking. (not implemented)
Time masking. (not implemented)
Combinations of methods.

Additional Functionality

Generation multichannel tonal signals of desired frequency, amplitude, durance.
Generation multichannel speech-like signals of desired formants frequency, amplitude, durance.

Interfaces

Signal augmentation can be applied by two ways:

As python module Mcs, Aug classes methods.
As console application wavaugmentate with CLI interface options.

Python Module

Example 1 (procedural approach):

from wavaugmentate.mcs import MultiChannelSignal as Mcs
from wavaugmentate.aug import SignalAugmentation as Aug


# File name of original sound.
file_name = "./outputwav/sound.wav"

# Create Mcs-object.
mcs = Mcs()

# Read WAV-file to Mcs-object.
mcs.read(file_name)

# Change quantity of channels to 7.
mcs.split(7)

# Create augmentation object.
aug = Aug(mcs)

# Apply delays.
# Corresponds to channels quantity.
delay_list = [0, 150, 200, 250, 300, 350, 400]
aug.delay_ctrl(delay_list)

# Apply amplitude changes.
# Corresponds to channels quantity.
amplitude_list = [1, 0.17, 0.2, 0.23, 0.3, 0.37, 0.4]
aug.amplitude_ctrl(amplitude_list)

# Augmentation result saving by single file, containing 7 channels.
aug.get().write(sound_aug_file_path)

# Augmentation result saving to 7 files, each 1 by channel.
# ./outputwav/sound_augmented_1.wav
# ./outputwav/sound_augmented_2.wav and so on.
aug.get().write_by_channel(sound_aug_file_path)

Original signal is shown on picture: Initial signal

Output signal with augmented data (channel 1 contains original signal without changes): Augmented signal

The same code as chain of operations, Example 2:

from wavaugmentate.mcs import MultiChannel as Mcs
from wavaugmentate.aug import SignalAugmentation as Aug

# File name of original sound.
file_name = "./outputwav/sound.wav"

delay_list = [0, 150, 200, 250, 300, 350, 400]
amplitude_list = [1, 0.17, 0.2, 0.23, 0.3, 0.37, 0.4]

# Apply all transformations of Example 1 in chain.
ao_obj = Aug(Mcs().rd(file_name))
ao_obj.splt(7).dly(delay_list).amp(amplitude_list).get().wr(
"sound_augmented_by_chain.wav"
)

# Augmentation result saving to 7 files, each 1 by channel.
ao_obj.get().wrbc("sound_augmented_by_chain.wav")

CLI

use for help:

wavaug-cli -h

command line interface provides the same functionality.

Example 3 (procedural approach):

wavaug-cli -i ./test_sounds/test_sound_1.wav -o ./outputwav/out.wav -d "100, 200, 300, 400"
wavaug-cli -i ./outputwav/out.wav -o ./outputwav/out.wav -a "0.1, 0.2, 0.3, 0.4"

Example 4 (OOP approach):

wavaug-cli -c 'rd("./test_sounds/test_sound_1.wav").dly([100, 200, 300, 400]).amp([0.1, 0.2, 0.3, 0.4]).wr("./outputwav/sound_delayed.wav")'

How To

Single file to several augmented

Amplitudes and delays will be augmented by code shown in example 5.

Example 5 (single file augmentation):

from wavaugmentate.mcs import MultiChannel as Mcs
from wavaugmentate.aug import SignalAugmentation as Aug

file_name = "./outputwav/sound.wav"
mcs = Mcs()
mcs.rd(file_name)  # Read original file with single channel.
file_name_head = "sound_augmented"

# Suppose we need 15 augmented files.
aug_count = 15
for i in range(aug_count):
   signal = Aug(mcs.copy())
   # Apply random amplitude [0.3..1.7) and delay [70..130)
   # microseconds changes to each copy of original signal.
   signal.amp([1], [0.7]).dly([100], [30])
   name = file_name_head + f"_{i + 1}.wav"
   signal.get().write(name)

Unit Tests

Just run:

export  PYTHONPATH='./src/wavaugmentate'
python3 -m pytest

Test coverage:

---------- coverage: platform linux, python 3.11.4-final-0 -----------
Name                       Stmts   Miss  Cover
----------------------------------------------
common_test_functions.py      15      0   100%
test_mcs_class.py            385      0   100%
test_wavaugmentate.py        293      0   100%
wavaugmentate.py             507     38    93%
----------------------------------------------
TOTAL                       1200     38    97%

Reference

MCS - multi channel signal, it is NumPy array with shape (M_channels, N_samples).

#	Mcs class method	CLI option	Method alias	Description
1	read(path)	-c 'rd(path)'	rd	Read MCS from WAV-file.
2	write(path)	-c 'wr(path)'	wr	Save MCS to WAV-file.
3	file_info(path)	--info	info	Returns WAV-file info.
4	-	-i path	-	Input WAV-file path.
5	-	-o path	-	Output WAV-file path.
6	amplitude_ctrl([c1,c2..cm])	-a "c1,c2..Cm"	amp	Change amplitudes of channels.
7	delay_ctrl([t1,t2..tm])	-d "t1,t2..tm"	dly	Add delays to channels.
8	echo _ctrl([t1,t2..tm],[c1,c2..cm])	-d "t1,t2..tm / c1,c2..Cm"	echo	Add echo to channels.
9	noise_ctrl([c1,c2..cm])	-n "c1,c2..Cm"	ns	Add normal noise to channels.
10	copy	-	cpy	Makes copy of MCS.
11	generate([f1,f2,f3..fm],duration,fs)	-	gen	Creates MCS and generates sine signal for each channel.
12	merge()	-	mrg	Merges all channels to single and returns mono MCS.
13	pause_detect(relative_level)	-	pdt	Searches pauses by selected levels. Returns array-mask.
14	pause_set(pause_map,pause_sz)	-	-	Set pauses length to selected values. Returns updated MCS.
15	rms()	-	rms	Returns list of RMS calculated for object channels.
16	side_by_side(mcs)	-	sbs	Appends channels from mcs data as new channels.
17	split(m_channels)	-	splt	Splits single channel to m_channels copies.
18	sum(mcs2)	-	sum	Adds mcs2 data channels values to object channels data sample by sample.
19	write_by_channel(path)	-	wrbc	Save MCS object channels to separate WAV-files.

Documentation

Documentation on the Read the Docs

For local documentation make clone of repository and look html-version of documentation (docs/_build/html/index.html): html-documentation

Rebuild Documentation

cd docs
make html

Build Package

Install builder:

python3 -m pip install --upgrade build

build package:

python3 -m build

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.2

Oct 6, 2024

0.2.1

Oct 2, 2024

0.1.16

Sep 15, 2024

0.1.13

Sep 15, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wavaugmentate-0.2.2.tar.gz (18.1 kB view details)

Uploaded Oct 6, 2024 Source

Built Distribution

wavaugmentate-0.2.2-py3-none-any.whl (17.7 kB view details)

Uploaded Oct 6, 2024 Python 3

File details

Details for the file wavaugmentate-0.2.2.tar.gz.

File metadata

Download URL: wavaugmentate-0.2.2.tar.gz
Upload date: Oct 6, 2024
Size: 18.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.11.4

File hashes

Hashes for wavaugmentate-0.2.2.tar.gz
Algorithm	Hash digest
SHA256	`42cf0413d5f086194e384ac51aca50e9645c87f2d2a7e6663dbe926167d4daa9`
MD5	`8ff55bd4df89a07ba583937ff2b812aa`
BLAKE2b-256	`450156d77795ad2fc7bc2757c70ce1aa53884abb3f30121831be2cdd6e59f664`

See more details on using hashes here.

File details

Details for the file wavaugmentate-0.2.2-py3-none-any.whl.

File metadata

Download URL: wavaugmentate-0.2.2-py3-none-any.whl
Upload date: Oct 6, 2024
Size: 17.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.11.4

File hashes

Hashes for wavaugmentate-0.2.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e43eb7b178e980fb3ef2d1d5b848cc4d3ec5ecbca7550be01260711772befb27`
MD5	`4a478382eb7578f8f95bc3251fc7cd4c`
BLAKE2b-256	`91c224aa2c5bbd2e7ec82739304d4cb5c39ee314ae3d4dc2b55bc4dc8aed0877`