Skip to main content

Comberons from single cell transcriptomics in endothelial cells

Project description

DECNEO

This repository contains DECNEO, a Python package that provides bioinformatics utilities for analyzing single cell transcriptomics datasets. DECNEO implements in silico detection of transcriptional regulation genes. The documentation is available at Read the Docs: https://decneo.readthedocs.io/

logo

Getting Started

These are the instructions on how to get a copy of this project up and use it for data analysis.

Installation

The software runs in Python >= 3.8

To install DECNEO as a package:

$ pip install decneo

Alternatively, clone a local copy of this project to install the package from the cloned directory:

git clone https://github.com/sdomanskyi/decneo
python setup.py install

Dependencies

DECNEO is dependent on the following packages, that are installed/updated with installation of DECNEO:

  • Matplotlib - plotting from Python
  • NetworkX - used in network enrichment analysis
  • Pandas and tables - for data storage and analysis
  • NumPy - for processing data
  • sklearn - we use clustering algorithms and metrics
  • adjustText - optimization of text labels locations in plots

Functionality

Overview

The main implementation of DECNEO includes workflow for fast and efficient calculation of single cell gene expression distance (e.g. correlation) followed by the bootstrap technique to account for variation and noise in the input data. The results are summarized in a form of a optimized dendrogram, heatmap and information panels. Analysis of combination of measurements panels allows to identify main and secondary groups of genes that are coexpressed in the cell type of interest.

Input Data Format

Expression data for two different species for comparison is required. For each of these species provide the input gene expression data is expected in one of the following formats:

  1. Spreadsheet of comma-separated values csv where rows are genes, columns are cells with gene expression counts, this should be accompanied by another dataframe with two columns with one specifying batches and the other specifying corresponding cells. Alternatively, the first row of the dataframe should be 'batch' and the second 'cell'.

  2. Pandas DataFrame where axis 0 is genes and axis 1 are cells. If the are batched in the data then the index of axis 1 should have two levels, e.g. ('batch', 'cell'), with the first level indicating patient, batch or expreriment where that cell was sequenced, and the second level containing cell barcodes for identification.

For examples refer to documentation.

Usage Example

We have made an example execution file demo.py that shows how to use decneo.

Download file VoightChoroid4567RemappedData.h5 (456.7 Mb) from https://doi.org/10.5281/zenodo.4419880

This file contains normalized gene expression of 27504 genes of 7996 endothelial cells from 8 batches, and 5704 non-endothelial cells from 8 batches. Genes that are not expressed in endothelial cells are removed from non-endothelial cells dataset

Save the downloaded data file to demo/, or otherwise modify path in demoData of demo.py:

See details of the script demo.py at:

Example walkthrough of demo.py script

To execute the complete script demo.py run:

python demo.py

If reading demo data gives error "unsupported pickle protocol: 5" make sure that python 3.8 is used and latest version of pandas and tables is installed.

Output

Outputs all resulting directories, files, and figures to directory specified as the workingDir when creating an instance of class Analysis. It will also output an analysis report detailing all results and figures.

For a detailed list, refer to the documentation.

Funding

This research project is a part of R01GM122085 grant, funded by NIH/NIGMS.

Licensing

DECNEO is released under an MIT License. Please also consult the folder LICENSES distributed with DECNEO regarding Licensing information for use of external associated content.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

decneo-1.0.7.tar.gz (2.1 MB view details)

Uploaded Source

Built Distribution

decneo-1.0.7-py3-none-any.whl (2.1 MB view details)

Uploaded Python 3

File details

Details for the file decneo-1.0.7.tar.gz.

File metadata

  • Download URL: decneo-1.0.7.tar.gz
  • Upload date:
  • Size: 2.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.7.0 requests/2.25.1 setuptools/52.0.0.post20210125 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.8

File hashes

Hashes for decneo-1.0.7.tar.gz
Algorithm Hash digest
SHA256 326e2005abc36427c46a7f4bbf02684d6520d64c5176d633d91b0b6abbf4601d
MD5 13e84383738d5321068b1f61eddfda13
BLAKE2b-256 c9883e02fc3856e4d50777dd2a72d3f8f84fff321a4aed1181fb704b7b170a00

See more details on using hashes here.

File details

Details for the file decneo-1.0.7-py3-none-any.whl.

File metadata

  • Download URL: decneo-1.0.7-py3-none-any.whl
  • Upload date:
  • Size: 2.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.7.0 requests/2.25.1 setuptools/52.0.0.post20210125 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.8

File hashes

Hashes for decneo-1.0.7-py3-none-any.whl
Algorithm Hash digest
SHA256 e5739f94a806843e4cc9f76975d8cacd6f66992ac69c6cc4c6da510b2f7e8c11
MD5 f95b430635c10ea895d1bfe4837d58e3
BLAKE2b-256 4c1a768b2ead1bf32ea9f75753d2318b0ead415cfbc2b4eba0e191283a10ec4d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page