pyscenic

Python implementation of the SCENIC pipeline for transcription factor inference from single-cell transcriptomics experiments.

These details have not been verified by PyPI

Project links

Homepage

Project description

pySCENIC is a lightning-fast python implementation of the SCENIC pipeline (Single-Cell rEgulatory Network Inference and Clustering) which enables biologists to infer transcription factors, gene regulatory networks and cell types from single-cell RNA-seq data.

The pioneering work was done in R and results were published in Nature Methods [1]. A new and comprehensive description of this Python implementation of the SCENIC pipeline is available in Nature Protocols [4].

pySCENIC can be run on a single desktop machine but easily scales to multi-core clusters to analyze thousands of cells in no time. The latter is achieved via the dask framework for distributed computing [2].

Full documentation for pySCENIC is available on Read the Docs

pySCENIC is part of the SCENIC Suite of tools! See the main SCENIC website for additional information and a full list of tools available.

News and releases

0.12.0 | 2022-08-16

Only databases in Feather v2 format are supported now (ctxcore >= 0.2), which allow uses recent versions of pyarrow (>=8.0.0) instead of very old ones (<0.17). Databases in the new format can be downloaded from https://resources.aertslab.org/cistarget/databases/ and end with *.genes_vs_motifs.rankings.feather or *.genes_vs_tracks.rankings.feather.
Support clustered motif databases.
Use custom multiprocessing instead of dask, by default.
Docker image uses python 3.10 and contains only needed pySCENIC dependencies for CLI usage.
Remove unneeded scripts and notebooks for unused/deprecated database formats.

0.11.2 | 2021-05-07

Split some core cisTarget functions out into a separate repository, ctxcore. This is now a required package for pySCENIC.

0.11.1 | 2021-02-11

Fix bug in motif url construction (#275)
Fix for export2loom with sparse dataframe (#278)
Fix sklearn t-SNE import (#285)
Updates to Docker image (expose port 8787 for Dask dashboard)

0.11.0 | 2021-02-10

Major features:

Updated arboreto release (GRN inference step) includes:
- Support for sparse matrices (using the --sparse flag in pyscenic grn, or passing a sparse matrix to grnboost2/genie3).
- Fixes to avoid dask metadata mismatch error
Updated cisTarget:
- Fix for metadata mismatch in ctx prune2df step
- Support for databases Apache Parquet format
- Faster loading from feather databases
- Bugfix: loading genes from a database (previously missing the last gene name in the database)
Support for Anndata input and output
Package updates:
- Upgrade to newer pandas version
- Upgrade to newer numba version
- Upgrade to newer versions of dask, distributed
Input checks and more descriptive error messages.
- Check that regulons loaded are not empty.
Bugfixes:
- In the regulons output from the cisTarget step, the gene weights were incorrectly assigned to their respective target genes (PR #254).
- Motif url construction fixed when running ctx without pruning
- Compression of intermediate files in the CLI steps
- Handle loom files with non-standard gene/cell attribute names
- Reformat the genesig gmt input/output
- Fix AUCell output to loom with non-standard loom attributes

0.10.4 | 2020-11-24

Included new CLI option to add correlation information to the GRN adjacencies file. This can be called with pyscenic add_cor.

Overview

The pipeline has three steps:

First transcription factors (TFs) and their target genes, together defining a regulon, are derived using gene inference methods which solely rely on correlations between expression of genes across cells. The arboreto package is used for this step.
These regulons are refined by pruning targets that do not have an enrichment for a corresponding motif of the TF effectively separating direct from indirect targets based on the presence of cis-regulatory footprints.
Finally, the original cells are differentiated and clustered on the activity of these discovered regulons.

The most impactful speed improvement is introduced by the arboreto package in step 1. This package provides an alternative to GENIE3 [3] called GRNBoost2. This package can be controlled from within pySCENIC.

All the functionality of the original R implementation is available and in addition:

You can leverage multi-core and multi-node clusters using dask and its distributed scheduler.
We implemented a version of the recovery of input genes that takes into account weights associated with these genes.
Regulons, i.e. the regulatory network that connects a TF with its target genes, with targets that are repressed are now also derived and used for cell enrichment analysis.

Additional resources

For more information, please visit LCB, the main SCENIC website, or SCENIC (R version). There is a tutorial to create new cisTarget databases. The CLI to pySCENIC has also been streamlined into a pipeline that can be run with a single command, using the Nextflow workflow manager. There are two Nextflow implementations available:

SCENICprotocol: A Nextflow DSL1 implementation of pySCENIC alongside a basic “best practices” expression analysis. Includes details on pySCENIC installation, usage, and downstream analysis, along with detailed tutorials.
VSNPipelines: A Nextflow DSL2 implementation of pySCENIC with a comprehensive and customizable pipeline for expression analysis. Includes additional pySCENIC features (multi-runs, integrated motif- and track-based regulon pruning, loom file generation).

Acknowledgments

We are grateful to all providers of TF-annotated position weight matrices, in particular Martha Bulyk (UNIPROBE), Wyeth Wasserman and Albin Sandelin (JASPAR), BioBase (TRANSFAC), Scot Wolfe and Michael Brodsky (FlyFactorSurvey) and Timothy Hughes (cisBP).

References

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.12.1

Nov 21, 2022

0.12.0

Aug 16, 2022

0.11.2

May 7, 2021

0.11.1

Apr 16, 2021

0.11.0

Feb 10, 2021

0.10.4

Nov 24, 2020

0.10.3

Jul 17, 2020

0.10.2

Jun 5, 2020

0.10.1

May 17, 2020

0.10.0

Feb 27, 2020

0.9.19

Oct 9, 2019

0.9.18

Sep 25, 2019

0.9.17

Sep 19, 2019

0.9.16

Aug 21, 2019

0.9.15

Jul 28, 2019

0.9.14

Jul 12, 2019

0.9.13

Jul 7, 2019

0.9.12

Jul 7, 2019

0.9.11

Jun 23, 2019

0.9.10

Jun 14, 2019

0.9.9

May 10, 2019

0.9.8

Apr 29, 2019

0.9.7

Mar 21, 2019

0.9.6

Mar 10, 2019

0.9.5

Feb 12, 2019

0.9.4

Jan 24, 2019

0.9.3

Jan 16, 2019

0.9.2

Jan 14, 2019

0.9.1

Dec 20, 2018

0.9.0

Dec 18, 2018

0.8.16

Dec 4, 2018

0.8.15

Dec 4, 2018

0.8.14

Nov 29, 2018

0.8.13

Nov 28, 2018

0.8.12

Nov 26, 2018

0.8.11

Nov 5, 2018

0.8.10

Nov 5, 2018

0.8.9

Aug 22, 2018

0.8.8

Aug 2, 2018

0.8.7

Jul 12, 2018

0.8.6

Jun 27, 2018

0.8.5

Jun 14, 2018

0.8.4

May 3, 2018

0.8.3

May 2, 2018

0.8.2

May 1, 2018

0.8.1

Apr 28, 2018

0.8.0

Apr 27, 2018

0.7.2

Apr 23, 2018

0.7.1

Apr 18, 2018

0.7.0

Apr 17, 2018

0.6.14

Apr 5, 2018

0.6.12

Mar 27, 2018

0.6.11

Mar 26, 2018

0.6.10

Mar 23, 2018

0.6.9

Mar 22, 2018

0.6.8

Mar 22, 2018

0.6.7

Mar 20, 2018

0.6.6

Mar 20, 2018

0.6.5

Mar 19, 2018

0.6.4

Mar 17, 2018

0.6.3

Mar 16, 2018

0.6.2

Mar 16, 2018

0.6.1

Mar 16, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyscenic-0.12.1.tar.gz (7.0 MB view details)

Uploaded Nov 21, 2022 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pyscenic-0.12.1-py3-none-any.whl (7.1 MB view details)

Uploaded Nov 21, 2022 Python 3

File details

Details for the file pyscenic-0.12.1.tar.gz.

File metadata

Download URL: pyscenic-0.12.1.tar.gz
Upload date: Nov 21, 2022
Size: 7.0 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.8.8

File hashes

Hashes for pyscenic-0.12.1.tar.gz
Algorithm	Hash digest
SHA256	`ae8fafa707d2578ffe08f9eed85f14a4cd9e1b53d57217420e2e956f0a8ddba2`
MD5	`ccf2decf031071b8fcdb3d285f784522`
BLAKE2b-256	`d2ea109aa69d72b54ab78eb353ddc8b6ff7e78208b5d85b7d77f5d146b78681d`

See more details on using hashes here.

File details

Details for the file pyscenic-0.12.1-py3-none-any.whl.

File metadata

Download URL: pyscenic-0.12.1-py3-none-any.whl
Upload date: Nov 21, 2022
Size: 7.1 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.8.8

File hashes

Hashes for pyscenic-0.12.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a250d682e073e67dc80505843764d9cade68dada45a40a622e1aefbae78756e9`
MD5	`ced17e15ac05fd9991a55099efd5e7ba`
BLAKE2b-256	`5ce36a0eaf46a897da829c896f0a034fce82133fce72f95d314bea81287c4279`

See more details on using hashes here.

pyscenic 0.12.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

News and releases

0.12.0 | 2022-08-16

0.11.2 | 2021-05-07

0.11.1 | 2021-02-11

0.11.0 | 2021-02-10

0.10.4 | 2020-11-24

Overview

Additional resources

Acknowledgments

References

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes