Skip to main content

Circe: Package for building co-accessibility networks from ATAC-seq data.

Project description

Circe logo


CIRCE: Cis-regulatory interactions between chromatin regions

Unit_Tests codecov PyPI version Downloads

Description

This repo contains a python package for inferring co-accessibility networks from single-cell ATAC-seq data, using skggm for the graphical lasso and scanpy for data processing.

It is based on the pipeline and hypotheses presented in the manuscript "Cicero Predicts cis-Regulatory DNA Interactions from Single-Cell Chromatin Accessibility Data" by Pliner et al. (2018). This R package Cicero is available here.

Results may slitghly vary between both packages, notably due to the different implementations of graphical lasso.
Currently, scores are very close when applied to the same metacells, computed from Cicero's methodology. (cf comparison plots below). It should run significantly faster than Cicero (e.g.: running time of 5 sec instead of 17 min for the dataset 2).

If you have any suggestion, don't hesitate ! This package is still a work in progress :)

Installation

The package can be installed using pip:

pip install circe-py

and from github

pip install "git+https://github.com/cantinilab/circe.git"

Minimal example

import anndata as ad
import circe as ci

atac = ad.read_h5ad('atac_data.h5ad')
atac = ci.add_region_infos(atac)
ci.compute_atac_network(atac)
df_network = ci.extract_atac_links(atac)

Comparison to Cicero R package


On the same metacells obtained from Cicero code.

All tests can be found in the circe benchmark repo

Toy dataset 1 (fake data):

  • Pearson correlation coefficient: 0.999126
  • Spearman correlation coefficient: 0.99838

Real dataset 2 (subsample of neurips PBMC)

  • Pearson correlation coefficient: 0.999958
  • Spearman correlation coefficient: 0.999911

Performance on real dataset 2:

  • Runtime: ~100x faster
  • Memory usage: ~5x less

Coming:

  • Calculate metacells !
  • Add stats on similarity on large datasets.
  • Add stats on runtime, memory usage.
  • This package can be run on multiple cores. Currently working to speed up the mutlithreding use.
  • Fix seed for reproducibility.

Usage

It is currently developped to work with AnnData objects. Check Example1.ipynb for a simple usage example.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

circe_py-0.2.0.tar.gz (237.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

circe_py-0.2.0-cp310-cp310-manylinux_2_35_x86_64.whl (237.9 kB view details)

Uploaded CPython 3.10manylinux: glibc 2.35+ x86-64

File details

Details for the file circe_py-0.2.0.tar.gz.

File metadata

  • Download URL: circe_py-0.2.0.tar.gz
  • Upload date:
  • Size: 237.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.2 CPython/3.10.9 Linux/6.5.0-41-generic

File hashes

Hashes for circe_py-0.2.0.tar.gz
Algorithm Hash digest
SHA256 f33b55934fb7a8277e3f80bc36dac3510cc4f4b7a52982ccd479eab77e3b1a64
MD5 a8f849499b98de70ad3966bf3b4e218b
BLAKE2b-256 d68a07de7d8d7d808d9460dde49148871ff9f47e9ec2a4afff17349e97a39b03

See more details on using hashes here.

File details

Details for the file circe_py-0.2.0-cp310-cp310-manylinux_2_35_x86_64.whl.

File metadata

  • Download URL: circe_py-0.2.0-cp310-cp310-manylinux_2_35_x86_64.whl
  • Upload date:
  • Size: 237.9 kB
  • Tags: CPython 3.10, manylinux: glibc 2.35+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.2 CPython/3.10.9 Linux/6.5.0-41-generic

File hashes

Hashes for circe_py-0.2.0-cp310-cp310-manylinux_2_35_x86_64.whl
Algorithm Hash digest
SHA256 b9c30083827d8c6406a936d51d9164948c14d405db96b00eeff6f85a6ecd9efb
MD5 128d74aefcfa5b8fdaab5fcb6a9ec852
BLAKE2b-256 343cbd77f18b47772c5396bf46240fbed2c750b197377daa17917e68f12587bc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page