Skip to main content

Framework for building deep neural network models for sound, speech, and voice AI

Project description

SonusAI

Aaware Sound and Voice Machine Learning Framework.

SonusAI is a Python framework for building deep neural network models for sound, speech, and voice AI, with a performance-critical core implemented in Rust.

Key Features

  • Performance Core: High-performance signal processing and data management in Rust.
  • Data Augmentation: Advanced noise mixing, pitch shifting, and time-stretching.
  • Model Support: Training and evaluation support for RCNNs, binary, and multi-label classification.
  • Mixture Generation: Powerful engine for creating complex audio mixture databases.

Getting Started

Refer to the Development Guide for installation and setup instructions.

Prerequisites

  • Python 3.13+ (Python 3.13 and 3.14 supported)
  • Rust Toolchain
  • uv

Quick Build

./build_local.sh

Documentation

Current Performance Investigation Focus

  • Active performance work is focused on genmixdb (mixture database generation) to establish bottlenecks before runtime refactoring.
  • Real-world benchmark scenarios are currently centered on configurations under examples/ including mcgsp-valid, vctk28n2paper, voicebank, and voicebank8kof.
  • Results are intended to guide both SonusAI core changes and downstream data-loading experience in sonusai_torchl projects.

Command Overview

   calc_metric_spenh            Run speech enhancement and analysis
   doc                          Documentation
   evaluate                     Evaluate model performance
   genft                        Generate feature and truth data
   genmix                       Generate mixture and truth data
   genmixdb                     Generate a mixture database
   gentcst                      Generate target configuration from a subdirectory tree
   keras_onnx                   Convert a trained Keras model to ONNX
   keras_predict                Run Keras predict on a trained model
   keras_train                  Train a model using Keras
   lsdb                         List information about a mixture database
   mkmanifest                   Make ASR manifest JSON file
   mkwav                        Make WAV files from a mixture database
   onnx_predict                 Run ONNX predict on a trained model
   plot                         Plot mixture data
   post_spenh_targetf           Run post-processing for speech enhancement targetf data
   torchl_onnx                  Convert a trained Pytorch Lightning model to ONNX
   torchl_predict               Run Lightning predict on a trained model
   torchl_train                 Train a model using Lightning
   tplot                        Plot truth data
   vars                         List custom SonusAI variables

Use sonusai help <command> for detailed information on any command.

Project details


Release history Release notifications | RSS feed

This version

1.3.2

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

sonusai-1.3.2-cp313-abi3-manylinux_2_28_x86_64.whl (7.2 MB view details)

Uploaded CPython 3.13+manylinux: glibc 2.28+ x86-64

sonusai-1.3.2-cp313-abi3-manylinux_2_28_aarch64.whl (6.9 MB view details)

Uploaded CPython 3.13+manylinux: glibc 2.28+ ARM64

sonusai-1.3.2-cp313-abi3-macosx_11_0_arm64.whl (6.7 MB view details)

Uploaded CPython 3.13+macOS 11.0+ ARM64

File details

Details for the file sonusai-1.3.2-cp313-abi3-manylinux_2_28_x86_64.whl.

File metadata

  • Download URL: sonusai-1.3.2-cp313-abi3-manylinux_2_28_x86_64.whl
  • Upload date:
  • Size: 7.2 MB
  • Tags: CPython 3.13+, manylinux: glibc 2.28+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.11 {"installer":{"name":"uv","version":"0.11.11","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for sonusai-1.3.2-cp313-abi3-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 67d1450ed7eabf15dabfe59834a7c0c90ffee0380209c1f05638802d03593c34
MD5 2bd07fdbbd7e282f97c4a5223b86b8c8
BLAKE2b-256 a312ecab312e3d181d1d4b3d1e09652e4ac6ede6c847fb32938b958250850292

See more details on using hashes here.

File details

Details for the file sonusai-1.3.2-cp313-abi3-manylinux_2_28_aarch64.whl.

File metadata

  • Download URL: sonusai-1.3.2-cp313-abi3-manylinux_2_28_aarch64.whl
  • Upload date:
  • Size: 6.9 MB
  • Tags: CPython 3.13+, manylinux: glibc 2.28+ ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.11 {"installer":{"name":"uv","version":"0.11.11","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for sonusai-1.3.2-cp313-abi3-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 a3c99e8fdab2d89276c227f640b4f198beccf99141456213027afb3f3e431a39
MD5 ad54e472cb04b7145fc01684e957108e
BLAKE2b-256 85fb3a237e955bca0c243cfb2ebdbeb9191e73df1636fb540f81f6401dc8a6fc

See more details on using hashes here.

File details

Details for the file sonusai-1.3.2-cp313-abi3-macosx_11_0_arm64.whl.

File metadata

  • Download URL: sonusai-1.3.2-cp313-abi3-macosx_11_0_arm64.whl
  • Upload date:
  • Size: 6.7 MB
  • Tags: CPython 3.13+, macOS 11.0+ ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.11 {"installer":{"name":"uv","version":"0.11.11","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for sonusai-1.3.2-cp313-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 e159534cfd36236b460c6d11d5dc50c399c51ebdfd6353052b881739db8e8f36
MD5 e446a4b12c6e31240300c67d82455b80
BLAKE2b-256 549684bceedfcf5e254f2100b0a017b802c125b2d8a698f941e1fe20592afcb0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page