Skip to main content

Mixture modeling algorithms using the Student's t-distribution

Project description

studenttmixture

Mixtures of multivariate Student's t distributions are widely used for clustering data that may contain outliers, but scipy and scikit-learn do not at present offer classes for fitting Student's t mixture models. This package provides classes for:

  1. Modeling / clustering a dataset using a finite mixture of multivariate Student's t distributions fit via the EM algorithm. You can select the number of components using either prior knowledge or the information criteria calculated by the model (e.g. AIC, BIC).
  2. Modeling / clustering a dataset using a mixture of multivariate Student's t distributions fit via the variational mean-field approximation. Depending on the hyperparameters you select, the fitting process will automatically "choose" an appropriate number of clusters, so the number of components in this case acts as an upper bound.
  3. Modeling / clustering an infinite mixture of Student's t-distributions (i.e. a Dirichlet process). In practice, this model is fitted using some small modifications to the mean-field recipe and has some of the same advantages and limitations.

(1) and (2) are currently available; (3) will be available in version 0.0.3.

Unittests for the package are in the tests folder.

Installation

pip install studenttmixture

Usage

Background

Upcoming in future versions

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

studenttmixture-0.0.2.tar.gz (26.4 kB view details)

Uploaded Source

Built Distribution

studenttmixture-0.0.2-py3-none-any.whl (29.0 kB view details)

Uploaded Python 3

File details

Details for the file studenttmixture-0.0.2.tar.gz.

File metadata

  • Download URL: studenttmixture-0.0.2.tar.gz
  • Upload date:
  • Size: 26.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.5

File hashes

Hashes for studenttmixture-0.0.2.tar.gz
Algorithm Hash digest
SHA256 69559eeafac4e946ccf0b25f3edbb9d542bc08906d86e14a14f6f8368b7816d1
MD5 735aabc07329dd56c2190076a4f959eb
BLAKE2b-256 62515235ad6b26280ba178fc290c51cfc1cb4653130c37b49c56f135d43a02de

See more details on using hashes here.

File details

Details for the file studenttmixture-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: studenttmixture-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 29.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.5

File hashes

Hashes for studenttmixture-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 358a64b7aa3258bddde32d956305ab782596476168f5b05885fa42997e100218
MD5 7244f8855fce864e61aa68026044dfab
BLAKE2b-256 c8f780e314d66babf7d561b7918f178ea799374a21ad2d998e63930212f83eb8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page