Skip to main content

REMO v1 regulatory element modules for GRCh38

Project description

remopy

Python implementation of REMO.v1.GRCh38, the R data package from the Stuart Lab.

REMO (Regulatory Element MOdules) provides pre-defined, cell-type annotated regulatory element groupings for single-cell chromatin accessibility analysis.

Installation

# Core data package (just polars)
pip install remopy

# With fragment quantification support
pip install remopy[quantify]

Quick Start

Data Access

import remopy as remo

# Load module coordinates (1.5M CRE intervals → 340k modules)
modules = remo.modules()
print(modules.head())

# Load module metadata
metadata = remo.metadata()
print(metadata.columns)  # ['REMO', 'CREs', 'Bases', 'Chromosome', 'GC_mean', 'CL']

# Get modules associated with a cell type
terms = remo.terms()
t_cell_modules = terms.get('T cell', [])

# Get cell types present in a tissue
tissues = remo.tissues()
brain_cell_types = tissues.get('Brain', [])

Fragment Quantification (scATAC-seq)

Skip peak calling entirely — quantify fragments into REMO:

import scanpy as sc
import remopy as remo

# Quantify fragments into modules (requires polars-bio)
adata = remo.quantify('fragments.tsv.gz', min_fragments=1000)

# Standard scanpy workflow
sc.pp.normalize_total(adata, target_sum=1e4)
sc.pp.log1p(adata)
sc.tl.pca(adata)
sc.pp.neighbors(adata)
sc.tl.umap(adata)
sc.tl.leiden(adata)

Data Contents

Data Description
modules() 1,507,327 CRE intervals grouped into 340,069 modules
metadata() Module-level stats: CRE count, bases, GC content, cell ontology
terms() Cell type name → module ID mappings (144 cell types)
ontology() Cell Ontology ID → module ID mappings
tissues() Tissue → cell type mappings (25 tissues)

Why REMO?

  • No peak calling needed: Use pre-defined, validated features
  • Reproducible: Same features across all datasets
  • Cell-type annotated: Modules linked to Cell Ontology terms
  • Fast: Direct fragment → module quantification

Citation

Lim C, et al. Regulatory element modules as universal features for single-cell chromatin analysis. (2025)

Preprint on bioRxiv

License

Artistic License 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

remopy-1.0.0.tar.gz (29.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

remopy-1.0.0-py3-none-any.whl (30.0 MB view details)

Uploaded Python 3

File details

Details for the file remopy-1.0.0.tar.gz.

File metadata

  • Download URL: remopy-1.0.0.tar.gz
  • Upload date:
  • Size: 29.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for remopy-1.0.0.tar.gz
Algorithm Hash digest
SHA256 9511aabcf4afa4c894b03f9ad5f7a9be9174f3c6a09773ac1ababf3466a0bdcb
MD5 6933b00d5331b84e9189a9f0b6bcb32e
BLAKE2b-256 e7ba1a4eb83f33ef141636f676ac6252a5135b7600902a9190fcac6604c5eb3e

See more details on using hashes here.

File details

Details for the file remopy-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: remopy-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 30.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for remopy-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a1660563e79c4c43d20b05b97a71b52af23044d91808fd71f4abe8ed75dbd6bf
MD5 72273e9c68b69f415b4205d8c6511e54
BLAKE2b-256 0904b2e56c7362f34d58c14b3c2b550ec4252296c70fba76026cd4b1af56eb48

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page