Skip to main content

This is a kit for simple speech feature extraction.

Project description

Speech Feature Kit

A Python wrapper for convenient speech feature extraction

Installation

pip install speech-features-kit

Functions

  1. MFCC feature analysis
  2. Volume analysis
  3. Emotion analysis

Example of emotion analysis

from speech_features_kit.Emotion.speech_toolkit import SpeechEmotionToolkit

# set the path of pre-trained model for speech emotion model
# the used model here is optimized for Chinese speech; however, it is possible you can train your own model. 
speech_kit = SpeechEmotionToolkit()

# load the model
speech_kit.load()

# obtain emotion list with timestamp given an audio file
list_emo, list_timestamp = speech_kit.get_emotion_list_by_blocks(audio_file="../data/english.wav",
                                                                     num_sec_each_file=1)

# print the list of emotion over timestamp
print("Time interval\tEmotion")
for idx, e in enumerate(list_emo):
    print(list_timestamp[idx], "\t", e)

Note

Other functions please see the examples folder!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

speech-features-kit-1.0.0.dev6.tar.gz (4.3 MB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page