Skip to main content

eXtensive Audio Representation and Evaluation Suite

Project description

xares

X-ARES: eXtensive Audio Representation and Evaluation Suite

Introduction

X-ARES is a toolkit for training, evaluating, and exporting audio encoders for various audio tasks. It is heavily inspired by the HEAR benchmark.

Supported tasks

Speech

  • Speech Commands V2
  • LibriCount
  • VoxLingua107
  • VoxCeleb1
  • LibriSpeech-Male-Female
  • Fluent Speech Commands
  • VocalSound
  • CREMA-D
  • RAVDESS
  • LibriSpeech-Phoneme
  • speechocean762
  • ASV2015

Environment

  • ESC-50
  • FSD50k
  • UrbanSound 8k
  • DESED
  • FSD18-Kaggle
  • Clotho

Music

  • MAESTRO
  • GTZAN Genre
  • NSynth
  • FMA

Installation

X-ARES is available on PyPI. You can install it via pip.

pip install xares

For development, you can clone the repository and install the package in editable mode.

git clone <this-repo>
cd xares
pip install -e .[examples]

Run with the baseline pretrained audio encoder (Dasheng)

python -m xares.run --max-jobs 8 example/dasheng/dasheng_encoder.py "src/tasks/*.py"

Run with your own pretrained audio encoder

An example of audio encoder wrapper could be found at example/dasheng/dasheng_encoder.py and example/wav2vec2/wav2vec2.py.

We provide a check function to verify if the encoder is correctly implemented:

>>> from xares.audio_encoder_checker import check_audio_encoder

>>> encoder = YourEncoder()
>>> check_audio_encoder(encoder)
True

And then you can run the benchmark with your own encoder:

python -m xares.run --max-jobs 8 your_encoder.py "src/tasks/*.py"

Add your own task

To add a new task, refer to the existing task implementations for guidance. You need to create a TaskConfig tailored to your chosen dataset.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xares-0.0.3.tar.gz (35.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

xares-0.0.3-py3-none-any.whl (39.5 kB view details)

Uploaded Python 3

File details

Details for the file xares-0.0.3.tar.gz.

File metadata

  • Download URL: xares-0.0.3.tar.gz
  • Upload date:
  • Size: 35.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.2

File hashes

Hashes for xares-0.0.3.tar.gz
Algorithm Hash digest
SHA256 85591a1ed07cb9fed82cb612f9d0c2353722b8a67ab385e534f16747bb045723
MD5 da6c9d9d51a5e8bd035fa0de403864df
BLAKE2b-256 c1bc55a2dccb243318f9ed57ba94ede238850ae76c0e87a8c657568056e58e55

See more details on using hashes here.

File details

Details for the file xares-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: xares-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 39.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.2

File hashes

Hashes for xares-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 2b034e9cd4fcdebdce234068b52a8826df167f9ccfc654f20df1513afbd17573
MD5 1661636e8ab04d24fae242846dcf81c6
BLAKE2b-256 130881bb4f38624db103ca075310383826b4f8980e55af35d5616b075e3aa967

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page