Skip to main content

InMoose: the Integrated Multi Omic Open Source Environment

Project description

Documentation Status

InMoose

InMoose is the Integrated Multi Omic Open Source Environment. It is a collection of tools for the analysis of omic data.

Installation

You can install InMoose directly with:

pip install inmoose

Documentation

Documentation is hosted on readthedocs.org.

Batch Effect Correction

InMoose provides features to correct technical biases, also called batch effects, in transcriptomic data:

  • for microarray data, InMoose supersedes pyCombat [1], a Python3 implementation of ComBat [2], one of the most widely used tool for batch effect correction on microarray data.
  • for RNASeq data, InMoose features a port to Python3 of ComBat-Seq [3], one of the most widely used tool for batch effect correction on RNASeq data.

To use these functions, simply import them and call them with default parameters:

from inmoose.pycombat import pycombat_norm, pycombat_seq

microarray_corrected = pycombat_norm(microarray_data, microarray_batches)
rnaseq_corrected = pycombat_seq(rnaseq_data, rnaseq_batches)
  • microarray_data, rnaseq_data: the expression matrices, containing the information about the gene expression (rows) for each sample (columns).
  • microarray_batches, rnaseq_batches: list of batch indices, describing the batch for each sample. The list of batches should contain as many elements as the number of samples in the expression matrix.

Differential Expression Analysis

InMoose provides features to analyse diffentially expressed genes in bulk transcriptomic data:

  • for microarray data, InMoose features a port of limma [4], the de facto standard tool for differential expression analysis on microarray data.
  • for RNASeq data, InMoose features a ports to Python3 of edgeR [5] and DESeq2 [6], two of the most widely used tools for differential expression analysis on RNASeq data.

See the dedicated sections of the documentation.

Consensus clustering

InMoose provides features to compute consensus clustering, a resampling based algorithm compatible with any clustering algorithms which class implementation is instantiated with parameter n_clusters, and possess a fit_predict method, which is invoked on data. Consensus clustering helps determining the best number of clusters to use and output confidence metrics and plots.

To use these functions, import the consensusClustering class and a clustering algorithm class:

from inmoose.consensus_clustering.consensus_clustering import consensusClustering
from sklearn.cluster import AgglomerativeClustering

CC = consensusClustering(AgglomerativeClustering)
CC.compute_consensus_clustering(numpy_ndarray)

How to contribute

Please refer to CONTRIBUTING.md to learn more about the contribution guidelines.

References

[1] Behdenna A, Colange M, Haziza J, Gema A, Appé G, Azencot CA and Nordor A. (2023) pyComBat, a Python tool for batch effects correction in high-throughput molecular data using empirical Bayes methods. BMC Bioinformatics 7;24(1):459. https://doi.org/10.1186/s12859-023-05578-5.

[2] Johnson W E, et al. (2007) Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics, 8, 118–12. https://doi.org/10.1093/biostatistics/kxj037

[3] Zhang Y, et al. (2020) ComBat-Seq: batch effect adjustment for RNASeq count data. NAR Genomics and Bioinformatics, 2(3). https://doi.org/10.1093/nargab/lqaa078

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

inmoose-0.5.1.tar.gz (7.3 MB view details)

Uploaded Source

File details

Details for the file inmoose-0.5.1.tar.gz.

File metadata

  • Download URL: inmoose-0.5.1.tar.gz
  • Upload date:
  • Size: 7.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for inmoose-0.5.1.tar.gz
Algorithm Hash digest
SHA256 5707e159d7b23ad7d1f66d7a6350d517100157a8fd141afc0fd3f0f1e74b351b
MD5 c61afbe7dd20d5bd91b8d6fbb5dc2fe4
BLAKE2b-256 64a554603025df2301946d51f5a10e76f6590b666790f3b3c0c60b5c0dcda85c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page