Skip to main content

Package implementing PLN models

Project description

PLNmodels: Poisson lognormal models

The Poisson lognormal model and variants can be used for analysis of mutivariate count data. This package implements efficient algorithms extracting meaningful data from difficult to interpret and complex multivariate count data. It has been built to scale on large datasets even though it has memory limitations. Possible fields of applications include

  • Genomics (number of times a gene is expressed in a cell)
  • Ecology (species abundances) One main functionality is to normalize the count data to obtain more valuable data. It also analyse the significance of each variable and their correlation as well as the weight of covariates (if available).

Getting started

The getting started can be found here. If you need just a quick view of the package, see next.

Installation

pyPLNmodels is available on pypi. The development version is available on GitHub.

Package installation

pip install pyPLNmodels

Usage and main fitting functions

The package comes with an ecological data set to present the functionality

import pyPLNmodels
from pyPLNmodels.models import PlnPCAcollection, Pln
from pyPLNmodels.oaks import load_oaks
oaks = load_oaks()

Unpenalized Poisson lognormal model (aka PLN)

pln = Pln.from_formula("counts ~ 1  + tree + dist2ground + orientation ", data = oaks, take_log_offsets = True)
pln.fit()
print(pln)
transformed_data = pln.transform()

Rank Constrained Poisson lognormal for Poisson Principal Component Analysis (aka PLNPCA)

pca =  PlnPCAcollection.from_formula("counts ~ 1  + tree + dist2ground + orientation ", data = oaks, take_log_offsets = True, ranks = [3,4,5])
pca.fit()
print(pca)
transformed_data = pln.transform()

References

Please cite our work using the following references:

  • J. Chiquet, M. Mariadassou and S. Robin: Variational inference for probabilistic Poisson PCA, the Annals of Applied Statistics, 12: 2674–2698, 2018. link

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyPLNmodels-0.0.67.tar.gz (3.4 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyPLNmodels-0.0.67-py3-none-any.whl (140.1 kB view details)

Uploaded Python 3

File details

Details for the file pyPLNmodels-0.0.67.tar.gz.

File metadata

  • Download URL: pyPLNmodels-0.0.67.tar.gz
  • Upload date:
  • Size: 3.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for pyPLNmodels-0.0.67.tar.gz
Algorithm Hash digest
SHA256 15a61f5ea01a06bbbe09b623159780f7efd195544b97f88a17895fe254ee8c98
MD5 b7a7742c0a945104724af07c32c612f6
BLAKE2b-256 af4d5cbe07ee4555d51944fef73a986e1ef2586ce3b07cd63fca50245da064a3

See more details on using hashes here.

File details

Details for the file pyPLNmodels-0.0.67-py3-none-any.whl.

File metadata

  • Download URL: pyPLNmodels-0.0.67-py3-none-any.whl
  • Upload date:
  • Size: 140.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for pyPLNmodels-0.0.67-py3-none-any.whl
Algorithm Hash digest
SHA256 0dba49bddb5f8ab61d7ca0b6335fd8d35442b5699ca8acdee648cd937b3b0a80
MD5 d1b77dbbc9bc690d1b6c2b6711ed7612
BLAKE2b-256 629e202b70193f35ece379c996e7e269f3ba25fc5e2d67f8d70f085265222a40

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page