sequentia

A machine learning interface for isolated sequence classification algorithms in Python.

These details have not been verified by PyPI

Project links

Development Status
- 4 - Beta
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
- MacOS
- Unix
Programming Language
Topic
- Scientific/Engineering
- Software Development

Project description

A machine learning interface for isolated sequence classification algorithms in Python.

Introduction

Sequential data is often observed in many different forms such as audio signals, stock prices, and even brain & heart signals. Such data is of particular interest in machine learning, as changing patterns over time naturally provide many interesting opportunities and challenges for classification.

Sequentia is a Python package that implements various classification algorithms for sequential data.

Some examples of how Sequentia can be used in isolated sequence classification include:

determining a spoken word based on its audio signal or some other representation such as MFCCs,
identifying potential heart conditions such as arrhythmia from ECG signals,
predicting motion intent for gesture control from electrical muscle activity,
classifying hand-written characters according to their pen-tip trajectories,
classifying hand or head gestures from rotation or movement signals,
classifying the sentiment of a phrase or sentence in natural language from word embeddings.

Build status

`master`	`dev`

Features

Sequentia provides the following algorithms, all supporting multivariate sequences with different durations.

Classification algorithms

Hidden Markov Models (via hmmlearn)
Learning with the Baum-Welch algorithm [1]
- Gaussian Mixture Model emissions
- Linear, left-right and ergodic topologies
- Multi-processed predictions
Dynamic Time Warping k-Nearest Neighbors (via dtaidistance)
- Sakoe–Chiba band global warping constraint
- Dependent and independent feature warping (DTWD & DTWI)
- Custom distance-weighted predictions
- Multi-processed predictions
DeepGRU: Deep Gesture Recognition Utility [2]
- Deep recurrent neural network with multiple GRU layers
- Faster training than LSTM-based networks
- Attention module for learning sub-sequence importance

Example of a classification algorithm (HMM sequence classifier)

Preprocessing methods

Centering, standardization and min-max scaling
Decimation and mean downsampling
Mean and median filtering

Installation

You can install Sequentia using pip.

pip install sequentia

Note: All tools under the sequentia.classifiers.rnn module (i.e. DeepGRU and collate_fn) require a working installation of torch (>= 1.8.0), and Sequentia assumes that you already have this installed.

Since there are many different possible configurations when installing PyTorch (e.g. CPU or GPU, CUDA version), we leave this up to the user instead of specifying particular binaries to install alongside TorchFSDD.

You can use the following if you really wish to install a CPU-only version of torch together with Sequentia.
pip install sequentia[torch]

Click here for installation instructions for contributing to Sequentia or running the notebooks.

If you intend to help contribute to Sequentia, you will need some additional dependencies for running tests, notebooks and generating documentation.

Depending on what you intend to do, you can specify the following extras.

For running tests in the /lib/test directory:
```
pip install sequentia[test]
```
For generating Sphinx documentation in the /docs directory:
```
pip install sequentia[docs]
```
For running notebooks in the /notebooks directory:
```
pip install sequentia[notebooks]
```
A full development suite which installs all of the above extras:
```
pip install sequentia[dev]
```

Documentation

Documentation for the package is available on Read The Docs.

Tutorials and examples

For detailed tutorials and examples on the usage of Sequentia, see the notebooks here.

Below are some basic examples of how univariate and multivariate sequences can be used in Sequentia.

Univariate sequences

import numpy as np, sequentia as seq

# Generate training observation sequences and labels
X, y = [
  np.array([1, 0, 5, 3, 7, 2, 2, 4, 9, 8, 7]),
  np.array([2, 1, 4, 6, 5, 8]),
  np.array([5, 8, 0, 3, 1, 0, 2, 7, 9])
], ['good', 'good', 'bad']

# Create and fit the classifier
clf = seq.KNNClassifier(k=1, classes=('good', 'bad'))
clf.fit(X, y)

# Make a prediction for a new observation sequence
x_new = np.array([0, 3, 2, 7, 9, 1, 1])
y_new = clf.predict(x_new)

Multivariate sequences

import numpy as np, sequentia as seq

# Generate training observation sequences and labels
X, y = [
  np.array([[1, 0, 5, 3, 7, 2, 2, 4, 9, 8, 7],
            [3, 8, 4, 0, 7, 1, 1, 3, 4, 2, 9]]).T,
  np.array([[2, 1, 4, 6, 5, 8],
            [5, 3, 9, 0, 8, 2]]).T,
  np.array([[5, 8, 0, 3, 1, 0, 2, 7, 9],
            [0, 2, 7, 1, 2, 9, 5, 8, 1]]).T
], ['good', 'good', 'bad']

# Create and fit the classifier
clf = seq.KNNClassifier(k=1, classes=('good', 'bad'))
clf.fit(X, y)

# Make a prediction for a new observation sequence
x_new = np.array([[0, 3, 2, 7, 9, 1, 1],
                  [2, 5, 7, 4, 2, 0, 8]]).T
y_new = clf.predict(x_new)

Acknowledgments

In earlier versions of the package (<0.10.0), an approximate dynamic time warping algorithm implementation (fastdtw) was used in hopes of speeding up k-NN predictions, as the authors of the original FastDTW paper [3] claim that approximated DTW alignments can be computed in linear memory and time - compared to the O(N^2) runtime complexity of the usual exact DTW implementation.

However, I was recently contacted by Prof. Eamonn Keogh (at University of California, Riverside), whose recent work [4] makes the surprising revelation that FastDTW is generally slower than the exact DTW algorithm that it approximates. Upon switching from the fastdtw package to dtaidistance (a very solid implementation of exact DTW with fast pure C compiled functions), DTW k-NN prediction times were indeed reduced drastically.

I would like to thank Prof. Eamonn Keogh for directly reaching out to me regarding this finding!

References

[1]	Lawrence R. Rabiner. "A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition" Proceedings of the IEEE 77 (1989), no. 2, 257-86.
[2]	Mehran Maghoumi & Joseph J. LaViola Jr. "DeepGRU: Deep Gesture Recognition Utility" Advances in Visual Computing, 14th International Symposium on Visual Computing, ISVC 2019, Proceedings, Part I, 16-31.
[3]	Stan Salvador & Philip Chan. "FastDTW: Toward accurate dynamic time warping in linear time and space." Intelligent Data Analysis 11.5 (2007), 561-580.
[4]	Renjie Wu & Eamonn J. Keogh. "FastDTW is approximate and Generally Slower than the Algorithm it Approximates" IEEE Transactions on Knowledge and Data Engineering (2020), 1–1.

Contributors

All contributions to this repository are greatly appreciated. Contribution guidelines can be found here.

_{Edwin Onuonga} ✉️ 🌍	_Prhmma

Project details

These details have not been verified by PyPI

Project links

Development Status
- 4 - Beta
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
- MacOS
- Unix
Programming Language
Topic
- Scientific/Engineering
- Software Development

Release history Release notifications | RSS feed

2.0.2

Apr 13, 2024

2.0.1

Apr 2, 2024

2.0.0

Apr 1, 2024

1.1.1

Feb 1, 2023

1.1.0

Jan 18, 2023

1.0.0

Dec 27, 2022

1.0.0a2 pre-release

Dec 6, 2022

1.0.0a1 pre-release

Dec 6, 2022

0.13.1

Jun 26, 2022

0.13.0

Jun 26, 2022

This version

0.12.1

Jan 22, 2022

0.12.0

May 11, 2021

0.11.1

Mar 13, 2021

0.11.0

Jan 26, 2021

0.10.3

Jan 16, 2021

0.10.2

Jan 7, 2021

0.10.1

Dec 31, 2020

0.10.0

Dec 30, 2020

0.7.2

Aug 24, 2020

0.7.1

May 21, 2020

0.7.0

May 19, 2020

0.7.0a1 pre-release

Feb 7, 2020

0.6.1

Jan 16, 2020

0.6.0

Jan 5, 2020

0.5.0

Jan 1, 2020

0.4.0

Dec 29, 2019

0.3.0

Dec 29, 2019

0.2.0

Dec 24, 2019

0.1.0

Dec 22, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sequentia-0.12.1.tar.gz (47.8 kB view details)

Uploaded Jan 22, 2022 Source

File details

Details for the file sequentia-0.12.1.tar.gz.

File metadata

Download URL: sequentia-0.12.1.tar.gz
Upload date: Jan 22, 2022
Size: 47.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.7.1 importlib_metadata/4.10.1 pkginfo/1.8.2 requests/2.23.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.7.4

File hashes

Hashes for sequentia-0.12.1.tar.gz
Algorithm	Hash digest
SHA256	`8c73e488bad956c18b2d03c41883731946a4d83681de57b1fd0b3abd48c98da6`
MD5	`6129e51ca520f56ef6d2eecb26d5e000`
BLAKE2b-256	`b7d12b8a607e502148ed62e1e8daf1d299cdeea3f13cbb80d313046e31b59ce9`