Skip to main content

command line tool for the analysis of single-cell bisulfite-sequencing data

Project description

PyPI PyPIDownloads Stars

MethSCAn: a command line tool for Single-Cell Analysis of Methylation data

formerly known as scbs.

Installation

This software requires a working installation of Python 3 (≥3.8) and requires the use of a shell terminal. It was extensively tested on Linux (Ubuntu 18, 20 and 22) and MacOS, and briefly tested on Windows 10.

You can install methscan from the Python package index as follows:

python3 -m pip install --upgrade pip  # you need a recent pip version
python3 -m pip install methscan

Installation of methscan should take no longer than a few seconds. All required dependencies are automatically installed, this may take a few minutes. Afterwards, restart your terminal. The installation is now finished and the command line interface should now be available when typing the command methscan in your terminal. If this is not the case, check the "troubleshooting" section below.

Updating to the latest version

Just use --upgrade when installing the package, otherwise it's the same process as installing:

python3 -m pip install --upgrade methscan

Afterwards, make sure that the latest version is correctly installed:

methscan --version

Tutorial of a typical methscan run

A tutorial / demo can be found here. This gives instructions on how to use methscan on a small example data set which we provide.

Also make sure to read the help by typing methscan --help or by checking this page.

What can this package do?

methscan takes as input a number of single-cell methylation files and allows you to quickly and easily obtain a cell × region matrix for downstream analysis (e.g. PCA, UMAP or clustering). It also facilitates quality control, allows you to discover variably methylated regions (VMRs), accurately quantifies methylation in genomic intervals, and stores your sc-methylomes in an efficient manner. Lastly, you can also select two cell populations and identify differentially methylated regions (DMRs) between them.

schematic showing the capabilities of methscan.

You can find a list of the available methscan commands here.

bioRxiv preprint

For a detailed explanation of the methods implemented in methscan, please check our bioRxiv preprint:

Analyzing single-cell bisulfite sequencing data with scbs
Lukas PM Kremer, Leonie Kuechenhoff, Santiago Cerrizuela, Ana Martin-Villalba, Simon Anders
bioRxiv 2022.06.15.496318; doi: https://doi.org/10.1101/2022.06.15.496318

Hardware requirements

For intermediate data sets consisting of 1000 to 5000 cells, we recommend to use a computer with at least 16 gigabytes of RAM. Very large data sets (~100k cells) require at least 128 GB. Multiple CPU cores are not strictly required but will greatly speed up some commands such as methscan scan or methscan diff when using the --threads argument.

Troubleshooting

Installation issues

Carefully check the output log of PIP. Look for a message like WARNING: The script methscan is installed in '/home/ubuntu/.local/bin' which is not on PATH., which would indicate that you need to add /home/ubuntu/.local/bin to your path. Alternatively, you can copy /home/ubuntu/.local/bin/methscan to e.g. /usr/local/bin.

If you encounter other problems during installation, make sure you have Python3.8 or higher, and make sure you have the latest PIP version. If the problem persists, consider installing methscan in a clean Python environment (for example using venv).

Too many open files

If you encounter a "too many open files" error during methscan prepare (OSError: [Errno 24] Too many open files), you need to increase the maximum number of files that can be opened. In Unix systems, try ulimit -n 9999.

Contributors

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

methscan-1.0.0.tar.gz (42.0 kB view hashes)

Uploaded Source

Built Distribution

methscan-1.0.0-py3-none-any.whl (45.9 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page