Skip to main content

REMO v1 regulatory element modules for GRCh38

Project description

remopy

Python implementation of REMO.v1.GRCh38, the R data package from the Stuart Lab.

REMO (Regulatory Element MOdules) provides pre-defined, cell-type annotated regulatory element groupings for single-cell chromatin accessibility analysis.

Installation

# Core data package (just polars)
pip install remopy

# With fragment quantification support
pip install remopy[quantify]

Quick Start

Data Access

import remopy as remo

# Load module coordinates (1.5M CRE intervals → 340k modules)
modules = remo.modules()
print(modules.head())

# Load module metadata
metadata = remo.metadata()
print(metadata.columns)  # ['REMO', 'CREs', 'Bases', 'Chromosome', 'GC_mean', 'CL']

# Get modules associated with a cell type
terms = remo.terms()
t_cell_modules = terms.get('T cell', [])

# Get cell types present in a tissue
tissues = remo.tissues()
brain_cell_types = tissues.get('Brain', [])

Fragment Quantification (scATAC-seq)

Skip peak calling entirely — quantify fragments into REMO:

import scanpy as sc
import remopy as remo

# Quantify fragments into modules (requires polars-bio)
adata = remo.quantify('fragments.tsv.gz', min_fragments=1000)

# Standard scanpy workflow
sc.pp.normalize_total(adata, target_sum=1e4)
sc.pp.log1p(adata)
sc.tl.pca(adata)
sc.pp.neighbors(adata)
sc.tl.umap(adata)
sc.tl.leiden(adata)

Data Contents

Data Description
modules() 1,507,327 CRE intervals grouped into 340,069 modules
metadata() Module-level stats: CRE count, bases, GC content, cell ontology
terms() Cell type name → module ID mappings (144 cell types)
ontology() Cell Ontology ID → module ID mappings
tissues() Tissue → cell type mappings (25 tissues)

Why REMO?

  • No peak calling needed: Use pre-defined, validated features
  • Reproducible: Same features across all datasets
  • Cell-type annotated: Modules linked to Cell Ontology terms
  • Fast: Direct fragment → module quantification

Citation

Lim C, et al. Regulatory element modules as universal features for single-cell chromatin analysis. (2025)

Preprint on bioRxiv

License

Artistic License 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

remopy-1.0.1.tar.gz (29.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

remopy-1.0.1-py3-none-any.whl (30.0 MB view details)

Uploaded Python 3

File details

Details for the file remopy-1.0.1.tar.gz.

File metadata

  • Download URL: remopy-1.0.1.tar.gz
  • Upload date:
  • Size: 29.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for remopy-1.0.1.tar.gz
Algorithm Hash digest
SHA256 eda393f98839a1b8d661c514e2cf98243e3965186d27ac190ce6024336bf959e
MD5 8d91e04bc449985ac4fdb73d76443888
BLAKE2b-256 4e7770b5d4b648a7c3983a6b3a86df221042fccd50e3505383a9de33a8b8162e

See more details on using hashes here.

File details

Details for the file remopy-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: remopy-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 30.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for remopy-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 6250338c247927379954d0b441b4a7ab0956cc7d036de3b54c3753665dbe22fb
MD5 fc4635bbf4c2c8c241c23cf6370ee436
BLAKE2b-256 0f528042e7bdba0e9b3f276c636237d7b4abc0733b718b44ffe8987e58b409b7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page