Skip to main content

Speech Emotion Recognition models and training using PyTorch

Project description

Vistec-AIS Speech Emotion Recognition

python-badge pytorch-badge license

Code Grade Code Quality Score

Speech Emotion Recognition Model and Inferencing using Pytorch

Installation

From Pypi

pip install vistec-ser

From source

git clone https://github.com/tann9949/vistec-ser.git
cd vistec-ser
python setup.py install

Usage

Training with THAI SER Dataset

We provide Google Colaboratory example for training the THAI SER dataset using our repository.

colab

Training using provided scripts

Note that currently, this workflow only supports pre-loaded features. So it might comsume an additional overhead of ~2 Gb or RAM. To run the experiment. Run the following command

Since there are 80 studios recording and 20 zoom recording. We split the dataset into 10-fold, 10 studios each. Then evaluate using k-fold cross validation method. We provide 2 k-fold experiments: including and excluding zoom recording. This can be configured in config file (see examples/aisser.yaml)

python examples/train_fold_aisser.py --config-path <path-to-config> --n-iter <number-of-iterations>  

Inferencing

We also implement a FastAPI backend server as an example of deploying a SER model. To run the server, run

cd examples
uvicorn server:app --reload

You can customize the server by modifying example/thaiser.yaml in inference field.

Once the server spawn, you can do HTTP POST request in form-data format. and JSON will return as the following format:

[
  {
    "name": <request-file-name>,
    "prob": {
      "neutral": <p(neu)>,
      "anger": <p(ang)>,
      "happiness": <p(hap)>,
      "sadness": <p(sad)>
    }
  }, ...
]

See an example below:

server-demo

Author & Sponsor

airesearch ais

Chompakorn Chaksangchaichot

Email: chompakornc_pro@vistec.ac.th

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vistec-ser-0.4.3a1.tar.gz (21.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vistec_ser-0.4.3a1-py3-none-any.whl (31.0 kB view details)

Uploaded Python 3

File details

Details for the file vistec-ser-0.4.3a1.tar.gz.

File metadata

  • Download URL: vistec-ser-0.4.3a1.tar.gz
  • Upload date:
  • Size: 21.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.8.9

File hashes

Hashes for vistec-ser-0.4.3a1.tar.gz
Algorithm Hash digest
SHA256 493cff51cfc4f52188497ed5451835b2972faf9ce15045664257d7823d28af55
MD5 d6f1bf0d4e6c5aed3b8b54f7bde76be4
BLAKE2b-256 4fb6df859adecc67df6eb53c3939c87994d20886dc2ae1683c855ed9573fdd99

See more details on using hashes here.

File details

Details for the file vistec_ser-0.4.3a1-py3-none-any.whl.

File metadata

  • Download URL: vistec_ser-0.4.3a1-py3-none-any.whl
  • Upload date:
  • Size: 31.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.8.9

File hashes

Hashes for vistec_ser-0.4.3a1-py3-none-any.whl
Algorithm Hash digest
SHA256 6c8ec6be18695128b2083458bc9eca1e483c22c83b19b17a3f0daf0bf9b871bd
MD5 89a90cbb44b875299f60081dad54eef2
BLAKE2b-256 a51dbe9b9af44f77ffbe356b70cf54b860bd18ca77b6ee80bfc07a99c6bbf1fb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page