Skip to main content

Framework for building deep neural network models for sound, speech, and voice AI

Project description

SonusAI

Aaware Sound and Voice Machine Learning Framework.

SonusAI is a Python framework for building deep neural network models for sound, speech, and voice AI, with a performance-critical core implemented in Rust.

Key Features

  • Performance Core: High-performance signal processing and data management in Rust.
  • Data Augmentation: Advanced noise mixing, pitch shifting, and time-stretching.
  • Model Support: Training and evaluation support for RCNNs, binary, and multi-label classification.
  • Mixture Generation: Powerful engine for creating complex audio mixture databases.

Getting Started

Refer to the Development Guide for installation and setup instructions.

Prerequisites

  • Python 3.13+ (Python 3.13 and 3.14 supported)
  • Rust Toolchain
  • uv

Quick Build

./build_local.sh

Documentation

Current Performance Investigation Focus

  • Active performance work is focused on genmixdb (mixture database generation) to establish bottlenecks before runtime refactoring.
  • Real-world benchmark scenarios are currently centered on configurations under examples/ including mcgsp-valid, vctk28n2paper, voicebank, and voicebank8kof.
  • Results are intended to guide both SonusAI core changes and downstream data-loading experience in sonusai_torchl projects.

Command Overview

   calc_metric_spenh            Run speech enhancement and analysis
   doc                          Documentation
   evaluate                     Evaluate model performance
   genft                        Generate feature and truth data
   genmix                       Generate mixture and truth data
   genmixdb                     Generate a mixture database
   gentcst                      Generate target configuration from a subdirectory tree
   keras_onnx                   Convert a trained Keras model to ONNX
   keras_predict                Run Keras predict on a trained model
   keras_train                  Train a model using Keras
   lsdb                         List information about a mixture database
   mkmanifest                   Make ASR manifest JSON file
   mkwav                        Make WAV files from a mixture database
   onnx_predict                 Run ONNX predict on a trained model
   plot                         Plot mixture data
   post_spenh_targetf           Run post-processing for speech enhancement targetf data
   torchl_onnx                  Convert a trained Pytorch Lightning model to ONNX
   torchl_predict               Run Lightning predict on a trained model
   torchl_train                 Train a model using Lightning
   tplot                        Plot truth data
   vars                         List custom SonusAI variables

Use sonusai help <command> for detailed information on any command.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

sonusai-1.3.3-cp313-abi3-manylinux_2_28_x86_64.whl (7.2 MB view details)

Uploaded CPython 3.13+manylinux: glibc 2.28+ x86-64

sonusai-1.3.3-cp313-abi3-manylinux_2_28_aarch64.whl (6.9 MB view details)

Uploaded CPython 3.13+manylinux: glibc 2.28+ ARM64

sonusai-1.3.3-cp313-abi3-macosx_11_0_arm64.whl (6.8 MB view details)

Uploaded CPython 3.13+macOS 11.0+ ARM64

File details

Details for the file sonusai-1.3.3-cp313-abi3-manylinux_2_28_x86_64.whl.

File metadata

  • Download URL: sonusai-1.3.3-cp313-abi3-manylinux_2_28_x86_64.whl
  • Upload date:
  • Size: 7.2 MB
  • Tags: CPython 3.13+, manylinux: glibc 2.28+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.14 {"installer":{"name":"uv","version":"0.11.14","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for sonusai-1.3.3-cp313-abi3-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 6c8cf1566024a73111ec42ad6482c3bd1df0e16663f4167e75d5eef911ed36aa
MD5 c8fa512419f7d1239da9337912f5b44a
BLAKE2b-256 e80d7296af4ba888ba7848ac812d932a83409b4f185862a36e8e1f99a6fa9d8a

See more details on using hashes here.

File details

Details for the file sonusai-1.3.3-cp313-abi3-manylinux_2_28_aarch64.whl.

File metadata

  • Download URL: sonusai-1.3.3-cp313-abi3-manylinux_2_28_aarch64.whl
  • Upload date:
  • Size: 6.9 MB
  • Tags: CPython 3.13+, manylinux: glibc 2.28+ ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.14 {"installer":{"name":"uv","version":"0.11.14","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for sonusai-1.3.3-cp313-abi3-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 075aa04edbb1dc350dd7b0bc818db274340b94038c339cfa6d25426abdd5c7e9
MD5 9f850c0766a420bb5442453d039a1c95
BLAKE2b-256 54c348c30df5c43b91cc832c7f070750240511851b6398872fc0879b0a17de4a

See more details on using hashes here.

File details

Details for the file sonusai-1.3.3-cp313-abi3-macosx_11_0_arm64.whl.

File metadata

  • Download URL: sonusai-1.3.3-cp313-abi3-macosx_11_0_arm64.whl
  • Upload date:
  • Size: 6.8 MB
  • Tags: CPython 3.13+, macOS 11.0+ ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.14 {"installer":{"name":"uv","version":"0.11.14","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for sonusai-1.3.3-cp313-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 d3c2e128935ebd58a8a7307753f764b6141bc52cebabe2b56705c394c03cf2ca
MD5 746c666375e6a05fdbb45ad8d60a0a31
BLAKE2b-256 1de85abb2805d40b2e02818d818b907dac470a7cd941450ee14adb42574aa050

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page