Skip to main content

REMO v1 regulatory element modules for GRCh38

Project description

remopy

Python implementation of REMO.v1.GRCh38, the R data package from the Stuart Lab.

REMO (Regulatory Element MOdules) provides pre-defined, cell-type annotated regulatory element groupings for single-cell chromatin accessibility analysis.

Installation

# Core data package (just polars)
pip install remopy

# With fragment quantification support
pip install remopy[quantify]

Quick Start

Data Access

import remopy as remo

# Load module coordinates (1.5M CRE intervals → 340k modules)
modules = remo.modules()
print(modules.head())

# Load module metadata
metadata = remo.metadata()
print(metadata.columns)  # ['REMO', 'CREs', 'Bases', 'Chromosome', 'GC_mean', 'CL']

# Get modules associated with a cell type
terms = remo.terms()
t_cell_modules = terms.get('T cell', [])

# Get cell types present in a tissue
tissues = remo.tissues()
brain_cell_types = tissues.get('Brain', [])

Fragment Quantification (scATAC-seq)

Skip peak calling entirely — quantify fragments into REMO:

import scanpy as sc
import remopy as remo

# Quantify fragments into modules (requires polars-bio)
adata = remo.quantify('fragments.tsv.gz', min_fragments=1000)

# Standard scanpy workflow
sc.pp.normalize_total(adata, target_sum=1e4)
sc.pp.log1p(adata)
sc.tl.pca(adata)
sc.pp.neighbors(adata)
sc.tl.umap(adata)
sc.tl.leiden(adata)

Data Contents

Data Description
modules() 1,507,327 CRE intervals grouped into 340,069 modules
metadata() Module-level stats: CRE count, bases, GC content, cell ontology
terms() Cell type name → module ID mappings (144 cell types)
ontology() Cell Ontology ID → module ID mappings
tissues() Tissue → cell type mappings (25 tissues)

Why REMO?

  • No peak calling needed: Use pre-defined, validated features
  • Reproducible: Same features across all datasets
  • Cell-type annotated: Modules linked to Cell Ontology terms
  • Fast: Direct fragment → module quantification

Citation

Lim C, et al. Regulatory element modules as universal features for single-cell chromatin analysis. (2025)

Preprint on bioRxiv

License

Artistic License 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

remopy-1.0.2.tar.gz (29.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

remopy-1.0.2-py3-none-any.whl (30.0 MB view details)

Uploaded Python 3

File details

Details for the file remopy-1.0.2.tar.gz.

File metadata

  • Download URL: remopy-1.0.2.tar.gz
  • Upload date:
  • Size: 29.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for remopy-1.0.2.tar.gz
Algorithm Hash digest
SHA256 99cf72fff08e1ddbc9bc459d6f2f864840f29e3ad5153035e0739ad5ee875869
MD5 73830c7d7514d3d45757da01376425bd
BLAKE2b-256 3df3922430858b23b6592f939452865ec02ab01513b1b2f484cf54238aa83b93

See more details on using hashes here.

File details

Details for the file remopy-1.0.2-py3-none-any.whl.

File metadata

  • Download URL: remopy-1.0.2-py3-none-any.whl
  • Upload date:
  • Size: 30.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for remopy-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 026d11b1b17cf8953e84d04351197965cd9f9299c89c081974545f7320bafeec
MD5 a0fb8e36ff31b73cbf2201b9944bb3cb
BLAKE2b-256 f8d755b292ca7ab3663c4c72719cbf430e8a74a3c1946899bab4ff6084809e7c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page