Skip to main content

Package implementing PLN models

Project description

PLNmodels: Poisson lognormal models

The Poisson lognormal model and variants can be used for analysis of mutivariate count data. This package implements efficient algorithms extracting meaningful data from difficult to interpret and complex multivariate count data. It has been built to scale on large datasets even though it has memory limitations. Possible fields of applications include

  • Genomics (number of times a gene is expressed in a cell)
  • Ecology (species abundances) One main functionality is to normalize the count data to obtain more valuable data. It also analyse the significance of each variable and their correlation as well as the weight of covariates (if available).

Getting started

The getting started can be found here. If you need just a quick view of the package, see next.

Installation

pyPLNmodels is available on pypi. The development version is available on GitHub.

Package installation

pip install pyPLNmodels

Usage and main fitting functions

The package comes with an ecological data set to present the functionality

import pyPLNmodels
from pyPLNmodels.models import PlnPCAcollection, Pln
from pyPLNmodels.oaks import load_oaks
oaks = load_oaks()

Unpenalized Poisson lognormal model (aka PLN)

pln = Pln.from_formula("counts ~ 1  + tree + dist2ground + orientation ", data = oaks, take_log_offsets = True)
pln.fit()
print(pln)
transformed_data = pln.transform()

Rank Constrained Poisson lognormal for Poisson Principal Component Analysis (aka PLNPCA)

pca =  PlnPCAcollection.from_formula("counts ~ 1  + tree + dist2ground + orientation ", data = oaks, take_log_offsets = True, ranks = [3,4,5])
pca.fit()
print(pca)
transformed_data = pln.transform()

References

Please cite our work using the following references:

  • J. Chiquet, M. Mariadassou and S. Robin: Variational inference for probabilistic Poisson PCA, the Annals of Applied Statistics, 12: 2674–2698, 2018. link

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyPLNmodels-0.0.65.tar.gz (3.4 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyPLNmodels-0.0.65-py3-none-any.whl (140.1 kB view details)

Uploaded Python 3

File details

Details for the file pyPLNmodels-0.0.65.tar.gz.

File metadata

  • Download URL: pyPLNmodels-0.0.65.tar.gz
  • Upload date:
  • Size: 3.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for pyPLNmodels-0.0.65.tar.gz
Algorithm Hash digest
SHA256 47d4642980fb6c3937aa92c40efec21f2cfb3b9e7f751434c0a01bf38cfe533c
MD5 85be7113cff97ea07b779b4225720ce3
BLAKE2b-256 4a2d4066196a516a5cd11197eee4ef687c5bcd91e75ddb9fc51e4e5f017f6404

See more details on using hashes here.

File details

Details for the file pyPLNmodels-0.0.65-py3-none-any.whl.

File metadata

  • Download URL: pyPLNmodels-0.0.65-py3-none-any.whl
  • Upload date:
  • Size: 140.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for pyPLNmodels-0.0.65-py3-none-any.whl
Algorithm Hash digest
SHA256 64ca828e19fd0b9bf0bb2401c711fad37f7e337d5b9c3cb1d46b31a309e2e51a
MD5 f7b0140ab7afde39f13b13b0d642344c
BLAKE2b-256 0ab0bf03d41c27b96c8a7b79bf29e300a2bdc4c5d5e1beb681f326ee13d1716f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page