Skip to main content

Package for Hidden Markov and Hidden Semi-Markov Models

Project description

Chad Hidden Markov Models (ChadHMM)

ChadHMM Logo

chadhmm License: MIT PRs Welcome Downloads Development Status CI Release

Table of Contents

About

This repository was created as an attempt to learn and recreate the parameter estimation for Hidden Markov Models using PyTorch library. Included are models with Categorical and Gaussian emissions for both Hidden Markov Models (HMM) and Hidden Semi-Markov Models(HSMM). As en extension I am trying to include models where the parameter estimation depends on certain set of external variables, these models are referred to as Contextual HMM or Parametric/Conditional HMM where the emission probabilities/distribution parameters are influenced by the context either time dependent or independent.

The documentation on the parameter estimation and model description is captured in - now empty - docs folder. Furthermore, there are examples of the usage, especially on the financial time series, focusing on the sequence prediction but also on the possible interpretation of the model parameters.

Getting Started

This is an example of how you may give instructions on setting up your project locally. To get a local copy up and running follow these simple example steps.

Installation

Option 1: Install from PyPI (recommended)

# Using pip
$ pip install chadhmm

# Using uv
$ uv add chadhmm

Option 2: Development Installation

  1. Clone the repo

    $ git clone https://github.com/GarroshIcecream/ChadHMM.git
    $ cd ChadHMM
    
  2. Install dependencies and the package in development mode

    $ uv sync --dev
    

Usage

Please refer to the docs for more detailed guide on how to create, train and predict sequences using Hidden Markov Models. There is also a section dedicated to visualizing the model parameters as well as its sequence predictions.

See below example of training and inference using MultinomialHMM:

from chadhmm import MultinomialHMM
from chadhmm.schemas import Transitions, InformCriteria
import torch

# Initialize Multinomial HMM with 6 states and 4 emissions
hmm = MultinomialHMM(
  n_states=6,
  n_features=4,
  n_trials=2,
  transitions=Transitions.ERGODIC
)

# Mock the example data and one hot encode
train_seq = torch.randint(0,hmm.n_features,(1000,))
one_hot = hmm.n_trials * torch.nn.functional.one_hot(train_seq,4)

# fit the model using EM algorithm assuming two sequences of lenghts 400 and 600
# also fit the model only once (n_init=1)
hmm.fit(X=one_hot,max_iter=5,lengths=[400,600],n_init=1,verbose=False)

# Compute log likelihood of generated sequence (set by_sample=False for joint log likelihood)
log_likes = hmm.score(
  one_hot,
  lengths=[400,500,100], 
  by_sample=True
)
print(log_likes)

# Compute Akeike Information criteria for each sequence (AIC, BIC or HQC)
ics = hmm.ic(
  one_hot,
  lengths=[400,500,100],
  criterion=InformCriteria.AIC
)
print(ics)

# Get the most likely sequence using Viterbi algorithm (MAP also available)
viterbi_path = hmm.predict(
  one_hot,
  lengths=[400,500,100],
  algorithm='viterbi'
)
print(viterbi_path)

Roadmap

  • Hidden Semi Markov Models
    • Fix computation of posteriors
    • Implementation of Viterbi algorithm for HSMM
  • Integration of contextual models
    • Time dependent context
    • Contextual Variables for covariances using GEM (Generalized Expectation Maximization algorithm)
    • Contextual variables for Multinomial emissions
  • Use Wrapped Distributions instead of Tensors with parameters - instead of Tensor with logits use Categorical distribution
    • Implement different types of covariance matrices
    • Connect that with degrees of freedom
  • Improve the docs with examples
    • Application on financial time series prediction
  • Support for CUDA training
  • Support different types of Transition Matrices - semi, left-to-right and ergodic
  • Support for wider range of emissions distributions
  • K-Means for Gaussian means initialization
  • Code base refactor, abstractions might be confusing

See the open issues for a full list of proposed features (and known issues).

Unit Tests

If you want to run the unit tests, execute the following command:

$ uv run pytest

References

Implementations are based on:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chadhmm-0.4.0.tar.gz (599.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

chadhmm-0.4.0-py3-none-any.whl (34.5 kB view details)

Uploaded Python 3

File details

Details for the file chadhmm-0.4.0.tar.gz.

File metadata

  • Download URL: chadhmm-0.4.0.tar.gz
  • Upload date:
  • Size: 599.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for chadhmm-0.4.0.tar.gz
Algorithm Hash digest
SHA256 32dd482ed8d87c79a94b9a1c2e575b59602faf425aa4d896937d8c5fe11f4ae5
MD5 485158711005b3c17fa11b3cdf9a566a
BLAKE2b-256 cdd86ae2fb4726c10dcfe7cbe529263234c0de24b07fe0588f66b1a52876c409

See more details on using hashes here.

Provenance

The following attestation bundles were made for chadhmm-0.4.0.tar.gz:

Publisher: release.yml on GarroshIcecream/ChadHMM

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file chadhmm-0.4.0-py3-none-any.whl.

File metadata

  • Download URL: chadhmm-0.4.0-py3-none-any.whl
  • Upload date:
  • Size: 34.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for chadhmm-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 913225de1655b32772bf09021a0a21f859457b87f2d9969d7314802d49360ac7
MD5 15e7ec176b018a9b4925cbcc99225008
BLAKE2b-256 1dc58567bc99969194e7e71d6a49627846b82e7c471d4c58b68e4684d453f019

See more details on using hashes here.

Provenance

The following attestation bundles were made for chadhmm-0.4.0-py3-none-any.whl:

Publisher: release.yml on GarroshIcecream/ChadHMM

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page