A library for audio and music analysis, feature extraction.

These details have not been verified by PyPI

Project links

Project description

audioFlux

GitHub Workflow Status (with branch) example branch parameter language GitHub

audioflux is a deep learning tool library for audio and music analysis, feature extraction. It supports dozens of time-frequency analysis transformation methods and hundreds of corresponding time-domain and frequency-domain feature combinations. It can be provided to deep learning networks for training, and is used to study various tasks in the audio field such as Classification, Separation, Music Information Retrieval(MIR) and ASR etc.

New Features

v0.1.8
- Add a variety of Pitch algorithms: YIN, CEP, PEF, NCF, HPS, LHS, STFT and FFP.
- Add PitchShift and TimeStretch algorithms.

Overview
Installation
- Python Package Install
- Other Build
Quickstart
Benchmark
Documentation
Contributing
Citing
License

Overview

audioFlux is based on data stream design. It decouples each algorithm module in structure, and can quickly and efficiently extract features of multiple dimensions. The following is the main feature architecture diagram.

You can use multiple dimensional feature combinations, select different deep learning networks training, study various tasks in the audio field such as Classification, Separation, MIR etc.

The main functions of audioFlux include transform, feature and mir modules.

1. Transform

In the time–frequency representation, main transform algorithm:

BFT - Based Fourier Transform, similar short-time Fourier transform.
NSGT - Non-Stationary Gabor Transform.
CWT - Continuous Wavelet Transform.
PWT - Pseudo Wavelet Transform.

The above transform supports all the following frequency scale types:

Linear - Short-time Fourier transform spectrogram.
Linspace - Linspace-scale spectrogram.
Mel - Mel-scale spectrogram.
Bark - Bark-scale spectrogram.
Erb - Erb-scale spectrogram.
Octave - Octave-scale spectrogram.
Log - Logarithmic-scale spectrogram.

The following transform are not supports multiple frequency scale types, only used as independent transform:

CQT - Constant-Q Transform.
VQT - Variable-Q Transform.
ST - S-Transform/Stockwell Transform.
FST - Fast S-Transform.
DWT - Discrete Wavelet Transform.
WPT - Wave Packet Transform.
SWT - Stationary Wavelet Transform.

Detailed transform function, description, and use view the documentation.

The synchrosqueezing or reassignment is a technique for sharpening a time-frequency representation, contains the following algorithms:

reassign - reassign transform for STFT.
synsq - reassign data use CWT data.
wsst - reassign transform for CWT.

2. Feature

The feature module contains the following algorithms:

spectral - Spectrum feature, supports all spectrum types.
xxcc - Cepstrum coefficients, supports all spectrum types.
deconv - Deconvolution for spectrum, supports all spectrum types.
chroma - Chroma feature, only supports CQT spectrum, Linear/Octave spectrum based on BFT.

3. MIR

The mir module contains the following algorithms:

pitch - YIN, STFT, etc algorithm.
onset - Spectrum flux, novelty, etc algorithm.
hpss - Median filtering, NMF algorithm.

Installation

language

The library is cross-platform and currently supports Linux, macOS, Windows, iOS and Android systems.

Python Package Install

To install the audioFlux package, Python >=3.6, using the released python package.

Using PyPI:

$ pip install audioflux

Using Anaconda:

$ conda install -c tanky25 -c conda-forge audioflux

Other Build

Quickstart

More example scripts are provided in the Documentation section.

Benchmark

server hardware:

- CPU: AMD Ryzen Threadripper 3970X 32-Core Processor

More detailed performance benchmark are provided in the Benchmark module.

Documentation

Documentation of the package can be found online:

https://audioflux.top

Contributing

We are more than happy to collaborate and receive your contributions to audioFlux. If you want to contribute, please fork the latest git repository and create a feature branch. Submitted requests should pass all continuous integration tests.

You are also more than welcome to suggest any improvements, including proposals for need help, find a bug, have a feature request, ask a general question, new algorithms. Open an issue

Citing

If you want to cite audioFlux in a scholarly work, please use the following ways:

If you are using the library for your work, for the sake of reproducibility, please cite the version you used as indexed at Zenodo:

License

audioFlux project is available MIT License.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.9

May 24, 2024

0.1.8

Feb 28, 2024

0.1.7

Dec 16, 2023

0.1.6

Apr 25, 2023

0.1.5

Apr 23, 2023

0.1.4

Mar 24, 2023

0.1.3

Mar 7, 2023

0.1.2

Feb 11, 2023

0.1.1

Jan 19, 2023

0.0.1

Jan 18, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

audioflux-0.1.9-py3-none-any.whl (70.8 MB view details)

Uploaded May 24, 2024 Python 3

File details

Details for the file audioflux-0.1.9-py3-none-any.whl.

File metadata

Download URL: audioflux-0.1.9-py3-none-any.whl
Upload date: May 24, 2024
Size: 70.8 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.0.0 CPython/3.9.16

File hashes

Hashes for audioflux-0.1.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c4b2d9d72dffb36a1d74e0894d3ea958c5195a7e05ebe779e972c97d593df3c2`
MD5	`c0dfc7326d9517dbba44a382004eb764`
BLAKE2b-256	`bea6f54fdbaaa8c5600afd8a47e4185178c688e8af55f5c455aa8f777aab8c36`

See more details on using hashes here.

audioflux 0.1.9

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

audioFlux

New Features

Table of Contents

Overview

1. Transform

2. Feature

3. MIR

Installation

Python Package Install

Other Build

Quickstart

Benchmark

Documentation

Contributing

Citing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes