eXtensive Audio Representation and Evaluation Suite
Project description
xares
X-ARES: eXtensive Audio Representation and Evaluation Suite
Introduction
X-ARES is a toolkit for training, evaluating, and exporting audio encoders for various audio tasks. It is heavily inspired by the HEAR benchmark.
Supported tasks
Speech
- Speech Commands V2
- LibriCount
- VoxLingua107
- VoxCeleb1
- LibriSpeech-Male-Female
- Fluent Speech Commands
- VocalSound
- CREMA-D
- RAVDESS
- LibriSpeech-Phoneme
- speechocean762
- ASV2015
Environment
- ESC-50
- FSD50k
- UrbanSound 8k
- DESED
- FSD18-Kaggle
- Clotho
Music
- MAESTRO
- GTZAN Genre
- NSynth
- FMA
Installation
X-ARES is available on PyPI. You can install it via pip.
pip install xares
For development, you can clone the repository and install the package in editable mode.
git clone <this-repo>
cd xares
pip install -e .[examples]
Run with the baseline pretrained audio encoder (Dasheng)
python -m xares.run --max-jobs 8 example/dasheng/dasheng_encoder.py "src/tasks/*.py"
Run with your own pretrained audio encoder
An example of audio encoder wrapper could be found at example/dasheng/dasheng_encoder.py and example/wav2vec2/wav2vec2.py.
We provide a check function to verify if the encoder is correctly implemented:
>>> from xares.audio_encoder_checker import check_audio_encoder
>>> encoder = YourEncoder()
>>> check_audio_encoder(encoder)
True
And then you can run the benchmark with your own encoder:
python -m xares.run --max-jobs 8 your_encoder.py "src/tasks/*.py"
Add your own task
To add a new task, refer to the existing task implementations for guidance. You need to create a TaskConfig tailored to your chosen dataset.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file xares-0.0.3.tar.gz.
File metadata
- Download URL: xares-0.0.3.tar.gz
- Upload date:
- Size: 35.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
85591a1ed07cb9fed82cb612f9d0c2353722b8a67ab385e534f16747bb045723
|
|
| MD5 |
da6c9d9d51a5e8bd035fa0de403864df
|
|
| BLAKE2b-256 |
c1bc55a2dccb243318f9ed57ba94ede238850ae76c0e87a8c657568056e58e55
|
File details
Details for the file xares-0.0.3-py3-none-any.whl.
File metadata
- Download URL: xares-0.0.3-py3-none-any.whl
- Upload date:
- Size: 39.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2b034e9cd4fcdebdce234068b52a8826df167f9ccfc654f20df1513afbd17573
|
|
| MD5 |
1661636e8ab04d24fae242846dcf81c6
|
|
| BLAKE2b-256 |
130881bb4f38624db103ca075310383826b4f8980e55af35d5616b075e3aa967
|