Skip to main content

A deep learning method for joint batch correction, denoting, and clustering of cite-seq data.

Project description

CarDEC_CITE

CarDEC (Count adapted regularized Deep Embedded Clustering) is a joint deep learning computational tool that is useful for analyses of single-cell RNA-seq data. CarDEC can be used to:

  1. Correct for batch effect in the full gene expression space, allowing the investigator to remove batch effect from downstream analyses like psuedotime analysis and coexpression analysis. Batch correction is also possible in a low-dimensional embedding space.
  2. Denoise gene expression.
  3. Cluster cells.

I then developed CarDEC_CITE based on CarDEC to further analyze CITE-seq data with addtional protein expression compared to single-cell RNA-seq data.

To use CarDEC_CITE, You will need to first read and QC your data to remove those bad cells. I used “pp.filter_cells“ and “pp.filter_genes” in the Scanpy package, and here is a tutorial published for CarDEC usage.

Please note this current version of CarDEC_CITE:

  • Does not support model_counts() function, so no denoised counts will be generated. But you can get denoised z scores with make_inference() function.
  • It only supports two cases: 1. LVG = LVP = True, 2. LVG = LVP = False.

Reproducibility

Justin Lakkis described and introduced CarDEC in his methodological paper. To find code to reproduce the results he generated in that paper, please visit this separate github repository, which provides all code (including that for other methods) necessary to reproduce his results.

Installation

Recomended installation procedure is as follows.

  1. Install Anaconda if you do not already have it.
  2. Create a conda environment using cardec_cite.yml, and then activate it as follows in terminal.
$ conda env create -f cardec_cite.yml
$ conda activate cardec_cite

Now, to use CarDEC_CITE, always make sure you activate the environment in terminal first ("conda activate cardec_cite"). And then run jupyter notebook. When you create a notebook to run CarDEC_CITE, make sure the active kernel is switched to "cardec_cite".

Software Requirements

  • Python >= 3.7
  • TensorFlow >= 2.0.1, <= 2.3.1
  • scikit-learn == 0.22.2.post1
  • scanpy == 1.5.1
  • louvain == 0.6.1
  • pandas == 1.0.1
  • scipy == 1.4.1

Trouble shooting

Installation on MacOS should be smooth. If installing on Windows Subsystem for Linux (WSL), the user must properly configure their g++ compiler to ensure that the louvain package can be built during installation. If the compiler is not properly configured, the user may encounter a following deprecation error similar to the following.

"DEPRECATION: Could not build wheels for louvain which do not use PEP 517. pip will fall back to legacy 'setup.py install' for these. pip 21.0 will remove support for this functionality. A possible replacement is to fix the wheel build issue reported above."

To fix this error, try to install the libxml2-dev package.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cardec_cite-1.1.0.tar.gz (23.2 kB view details)

Uploaded Source

Built Distribution

cardec_cite-1.1.0-py3-none-any.whl (30.3 kB view details)

Uploaded Python 3

File details

Details for the file cardec_cite-1.1.0.tar.gz.

File metadata

  • Download URL: cardec_cite-1.1.0.tar.gz
  • Upload date:
  • Size: 23.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.2

File hashes

Hashes for cardec_cite-1.1.0.tar.gz
Algorithm Hash digest
SHA256 044df575f4e2fa9e0f9b283018a2eb3b57339a38500a352a48eb013379888690
MD5 f4710013e89ab353422ed7b050d981c7
BLAKE2b-256 7e0694ffe491048b9c797dffd82d5da7d152c9c95647cab14de0aa37ed1f585a

See more details on using hashes here.

File details

Details for the file cardec_cite-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: cardec_cite-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 30.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.2

File hashes

Hashes for cardec_cite-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 1be8d201fe7e6b4da4308cd73e7af00510f6dd4b787b5fc6ce3bd30d0ae0abae
MD5 adf9bcb5a1a885c30b0caef3a3d528ba
BLAKE2b-256 2ac0c4f4a02a171f06eb83f31c35f03127f1e1ee632edbaa547498130d56ba32

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page