Skip to main content

A deep learning method for joint batch correction, denoting, and clustering of single-cell rna-seq data.

Project description

CarDEC

CarDEC (Count adapted regularized Deep Embedded Clustering) is a joint deep learning computational tool that is useful for analyses of single-cell RNA-seq data. CarDEC can be used to:

  1. Correct for batch effect in the full gene expression space, allowing the investigator to remove batch effect from downstream analyses like psuedotime analysis and coexpression analysis. Batch correction is also possible in a low-dimensional embedding space.
  2. Denoise gene expression.
  3. Cluster cells.

Reproducibility

We described and introduced CarDEC in our methodological paper. To find code to reproduce the results we generated in that paper, please visit this separate github repository, which provides all code (including that for other methods) necessary to reproduce our results.

Installation

Recomended installation procedure is as follows.

  1. Install Anaconda if you do not already have it.
  2. Create a conda environment, and then activate it as follows in terminal.
$ conda create -n cardecenv
$ conda activate cardecenv
  1. Install an appropriate version of python.
$ conda install python==3.7
  1. Install nb_conda_kernels so that you can change python kernels in jupyter notebook.
$ conda install nb_conda_kernels
  1. Finally, install CarDEC.
$ pip install CarDEC

Now, to use CarDEC, always make sure you activate the environment in terminal first ("conda activate cardecenv"). And then run jupyter notebook. When you create a notebook to run CarDEC, make sure the active kernel is switched to "cardecenv"

Usage

A tutorial jupyter notebook, together with a dataset, is publicly downloadable.

Software Requirements

  • Python >= 3.7
  • TensorFlow >= 2.0.1, <= 2.3.1
  • scikit-learn == 0.22.2.post1
  • scanpy == 1.5.1
  • louvain == 0.6.1
  • pandas == 1.0.1
  • scipy == 1.4.1

Trouble shooting

Installation on MacOS should be smooth. If installing on Windows Subsystem for Linux (WSL), the user must properly configure their g++ compiler to ensure that the louvain package can be built during installation. If the compiler is not properly configured, the user may encounter a following deprecation error similar to the following.

"DEPRECATION: Could not build wheels for louvain which do not use PEP 517. pip will fall back to legacy 'setup.py install' for these. pip 21.0 will remove support for this functionality. A possible replacement is to fix the wheel build issue reported above."

To fix this error, try to install the libxml2-dev package.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cardec-1.0.3.tar.gz (21.3 kB view details)

Uploaded Source

Built Distribution

cardec-1.0.3-py3-none-any.whl (28.5 kB view details)

Uploaded Python 3

File details

Details for the file cardec-1.0.3.tar.gz.

File metadata

  • Download URL: cardec-1.0.3.tar.gz
  • Upload date:
  • Size: 21.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.5.0.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.56.0 CPython/3.7.0

File hashes

Hashes for cardec-1.0.3.tar.gz
Algorithm Hash digest
SHA256 12c7c74e5e932dfb1e57320d0c5aac5b978511f31054b6049cb12b7a2293bef9
MD5 da95fd66fc53f9b287f1732a1f901dc3
BLAKE2b-256 e35d4cfc22c3b81dfc0720c98e496c8a3e528c7bffbcf277e7ec778de7d183cd

See more details on using hashes here.

File details

Details for the file cardec-1.0.3-py3-none-any.whl.

File metadata

  • Download URL: cardec-1.0.3-py3-none-any.whl
  • Upload date:
  • Size: 28.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.5.0.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.56.0 CPython/3.7.0

File hashes

Hashes for cardec-1.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 47584cee3d723d0ecd6e695f9a45cec4ec385574474afa663133755b5fae7d4c
MD5 13048d98d7408d6cc39f6f1d43c9eb00
BLAKE2b-256 79a42e6471357686262ca9f17e83cb6ca39092e56506c7b9ad8d828b0a25f6b1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page