Skip to main content

scprep

Project description

scprep logo Latest PyPi version Latest Conda version Travis CI Build Read the Docs Coverage Status Twitter GitHub stars Code style: black

scprep provides an all-in-one framework for loading, preprocessing, and plotting matrices in Python, with a focus on single-cell genomics.

The philosophy of scprep:

  • Data shouldn’t be hidden in a complex and bespoke class object. scprep works with numpy arrays, pandas data frames, and scipy sparse matrices, all of which are popular data formats in Python and accepted as input to most common algorithms.

  • Your analysis pipeline shouldn’t have to change based on data format. Changing from a numpy array to a pandas data frame introduces endless technical differences (e.g. in indexing matrices). scprep provides data-agnostic methods that work the same way on all formats.

  • Simple analysis should mean simple code. scprep takes care of annoying edge cases and sets nice defaults so you don’t have to.

  • Using a framework shouldn’t be limiting. Because nothing is hidden from you, you have access to the power of numpy, scipy, pandas and matplotlib just as you would if you used them directly.

Installation

preprocessing is available on pip. Install by running the following in a terminal:

pip install --user scprep

Alternatively, scprep can be installed using Conda (most easily obtained via the Miniconda Python distribution):

conda install -c bioconda scprep

Quick Start

You can use scprep with your single cell data as follows:

import scprep
# Load data
data_path = "~/mydata/my_10X_data"
data = scprep.io.load_10X(data_path)
# Remove empty columns and rows
data = scprep.filter.remove_empty_cells(data)
data = scprep.filter.remove_empty_genes(data)
# Filter by library size to remove background
scprep.plot.plot_library_size(data, cutoff=500)
data = scprep.filter.filter_library_size(data, cutoff=500)
# Filter by mitochondrial expression to remove dead cells
mt_genes = scprep.select.get_gene_set(data, starts_with="MT")
scprep.plot.plot_gene_set_expression(data, genes=mt_genes, percentile=90)
data = scprep.filter.filter_gene_set_expression(data, genes=mt_genes,
                                                percentile=90)
# Library size normalize
data = scprep.normalize.library_size_normalize(data)
# Square root transform
data = scprep.transform.sqrt(data)

Examples

Help

If you have any questions or require assistance using scprep, please read the documentation at https://scprep.readthedocs.io/ or contact us at https://krishnaswamylab.org/get-help

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scprep-1.0.5.post2.tar.gz (8.3 MB view details)

Uploaded Source

Built Distribution

scprep-1.0.5.post2-py3-none-any.whl (98.1 kB view details)

Uploaded Python 3

File details

Details for the file scprep-1.0.5.post2.tar.gz.

File metadata

  • Download URL: scprep-1.0.5.post2.tar.gz
  • Upload date:
  • Size: 8.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.8.1

File hashes

Hashes for scprep-1.0.5.post2.tar.gz
Algorithm Hash digest
SHA256 ddff5cd50afd031f7c65483266c039be388a270ff0bad38b4943b660472848e9
MD5 fa67dbc98175323f7c0556c22f10cf42
BLAKE2b-256 bfdbf43eb5138a66faeae78908c243201e502560551796ed00eb697d0a6dbd35

See more details on using hashes here.

File details

Details for the file scprep-1.0.5.post2-py3-none-any.whl.

File metadata

  • Download URL: scprep-1.0.5.post2-py3-none-any.whl
  • Upload date:
  • Size: 98.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.8.1

File hashes

Hashes for scprep-1.0.5.post2-py3-none-any.whl
Algorithm Hash digest
SHA256 1a1b689359f343683be29a797d237c1d4e82ce7cf32e7547fc2f94a265e02f9f
MD5 cb524bcd363f9d67a0870192b4e77668
BLAKE2b-256 73380a4fe0cd17dd0b7e859718a769b1e799467098709b80b96175f43b098ca6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page