Framework for building deep neural network models for sound, speech, and voice AI
Project description
SonusAI
Aaware Sound and Voice Machine Learning Framework.
SonusAI is a Python framework for building deep neural network models for sound, speech, and voice AI, with a performance-critical core implemented in Rust.
Key Features
- Performance Core: High-performance signal processing and data management in Rust.
- Data Augmentation: Advanced noise mixing, pitch shifting, and time-stretching.
- Model Support: Training and evaluation support for RCNNs, binary, and multi-label classification.
- Mixture Generation: Powerful engine for creating complex audio mixture databases.
Getting Started
Refer to the Development Guide for installation and setup instructions.
Prerequisites
- Python 3.13+ (Python 3.13 and 3.14 supported)
- Rust Toolchain
uv
Quick Build
./build_local.sh
Documentation
- CLI Reference: Detailed overview of all
sonusaicommands. - Development Guide: Setup, build, and test instructions.
- Data Formats & Dimensions: Technical specifications for data processing.
- Architecture: Details on the Rust/Python integration.
- Plugins: Summary of the plugin and extension system.
Current Performance Investigation Focus
- Active performance work is focused on
genmixdb(mixture database generation) to establish bottlenecks before runtime refactoring. - Real-world benchmark scenarios are currently centered on configurations under
examples/includingmcgsp-valid,vctk28n2paper,voicebank, andvoicebank8kof. - Results are intended to guide both SonusAI core changes and downstream data-loading experience in
sonusai_torchlprojects.
Command Overview
calc_metric_spenh Run speech enhancement and analysis
doc Documentation
evaluate Evaluate model performance
genft Generate feature and truth data
genmix Generate mixture and truth data
genmixdb Generate a mixture database
gentcst Generate target configuration from a subdirectory tree
keras_onnx Convert a trained Keras model to ONNX
keras_predict Run Keras predict on a trained model
keras_train Train a model using Keras
lsdb List information about a mixture database
mkmanifest Make ASR manifest JSON file
mkwav Make WAV files from a mixture database
onnx_predict Run ONNX predict on a trained model
plot Plot mixture data
post_spenh_targetf Run post-processing for speech enhancement targetf data
torchl_onnx Convert a trained Pytorch Lightning model to ONNX
torchl_predict Run Lightning predict on a trained model
torchl_train Train a model using Lightning
tplot Plot truth data
vars List custom SonusAI variables
Use sonusai help <command> for detailed information on any command.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sonusai-1.3.2-cp313-abi3-manylinux_2_28_x86_64.whl.
File metadata
- Download URL: sonusai-1.3.2-cp313-abi3-manylinux_2_28_x86_64.whl
- Upload date:
- Size: 7.2 MB
- Tags: CPython 3.13+, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.11 {"installer":{"name":"uv","version":"0.11.11","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
67d1450ed7eabf15dabfe59834a7c0c90ffee0380209c1f05638802d03593c34
|
|
| MD5 |
2bd07fdbbd7e282f97c4a5223b86b8c8
|
|
| BLAKE2b-256 |
a312ecab312e3d181d1d4b3d1e09652e4ac6ede6c847fb32938b958250850292
|
File details
Details for the file sonusai-1.3.2-cp313-abi3-manylinux_2_28_aarch64.whl.
File metadata
- Download URL: sonusai-1.3.2-cp313-abi3-manylinux_2_28_aarch64.whl
- Upload date:
- Size: 6.9 MB
- Tags: CPython 3.13+, manylinux: glibc 2.28+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.11 {"installer":{"name":"uv","version":"0.11.11","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a3c99e8fdab2d89276c227f640b4f198beccf99141456213027afb3f3e431a39
|
|
| MD5 |
ad54e472cb04b7145fc01684e957108e
|
|
| BLAKE2b-256 |
85fb3a237e955bca0c243cfb2ebdbeb9191e73df1636fb540f81f6401dc8a6fc
|
File details
Details for the file sonusai-1.3.2-cp313-abi3-macosx_11_0_arm64.whl.
File metadata
- Download URL: sonusai-1.3.2-cp313-abi3-macosx_11_0_arm64.whl
- Upload date:
- Size: 6.7 MB
- Tags: CPython 3.13+, macOS 11.0+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.11 {"installer":{"name":"uv","version":"0.11.11","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e159534cfd36236b460c6d11d5dc50c399c51ebdfd6353052b881739db8e8f36
|
|
| MD5 |
e446a4b12c6e31240300c67d82455b80
|
|
| BLAKE2b-256 |
549684bceedfcf5e254f2100b0a017b802c125b2d8a698f941e1fe20592afcb0
|