Skip to main content

NANOME (Nanopore methylation) pipeline developed by Li Lab at The Jackson Laboratory

Project description

NANOME pipeline (Nanopore long-read sequencing data consensus DNA methylation detection)

demo_gif.gif

Highlights of NANOME pipeline

Several first highlights for NANOME

Figure_pipe_comp

  • Enables users to process terabasescale Oxford Nanopore sequencing datasets.
  • Provide a one command line/web-based UI for end-to-end analyzing Nanopore sequencing methylation-callings.
  • Support various platform executions: local, HPC and CloudOS, without needs for tools' installation (NANOME support docker and singularity).
  • First standardized whole genome-wide evaluation framework, considering per-read and per-site performance for singletons/non-singletons, genic and intergenic regions, CpG islands/shores/shelves, different CG densities regions and repetitive regions.
  • The first Nextflow based DNA methylation-calling pipeline for ONT data. Please check more articles about Nextflow based workflow technology from Nature Biotechnology: https://doi.org/10.1038/s41587-020-0439-x and https://doi.org/10.1038/nbt.3820.
  • Allow add new modules/tools in simple config txt file, without need to touch the main pipeline codes, supporting rapid development and evaluation.
  • Consensus of top performers by XGBoost model, allow NA values.
  • Multi-modifications for 5mC and 5hmC.
  • Haplotype-awared phasing and allele-specific methylation detection.

Methodology of NANOME pipeline

Figure1A

Fig. 1A. Survey of methylation calling tools . Timeline of publication and technological developments of Oxford Nanopore Technologies (ONT) methylation calling tools to detect DNA cytosine modifications.

Figure1B Fig. 1B. Workflow for 5-methylcytosine (5mC) detection for nanopore sequencing.

CI/CD features

We use CI Automation Tools to enable the automated testing on every commit and on PRs to make sure that updates are not introducing bugs. Please check the automatic testing results on Github.

System Requirements

Hardware requirements

NANOME pipeline can be easily configured with different RAM, CPU/GPU resources schema to parallelly run methylation-calling tools. For optimal usage, we recommend running NANOME pipeline on HPC or cloud computing platform, e.g., google cloud platform (GCP). The basic hardware requirements are below:

  • GPU or CPU with 2+ cores.
  • RAM: 7+ GB per cpu.
  • Storage using HDD or SSD. Please ensure the storage before running the pipeline.

Software requirements

NANOME pipeline uses Nextflow technology. Users only need to install Nextflow (check the installation guide from https://nf-co.re/usage/installation), and have one of below commonly used environment tool:

We provide conda, docker and singularity environments that depend on below well-known open-source packages for basecalling/methylation-calling/phasing on nanopore sequencing data:

nanopolish >=0.13.2
megalodon >=2.2.9
deepsignal >=0.1.8
ont-tombo >=1.5.1
deepmod >=0.1.3
METEORE >=1.0.0
ont-pyguppy-client-lib >=4.2.2
fast5mod >=1.0.5
Clair3 >=v0.1-r11
Whatshap >=1.0
NanomethPhase bam2bis >= 1.0
GNU Parallel >=20170422

Guppy software >= 4.2.2 from ONT (Oxford Nanopore Technologies) website

Installation

Users only need to install Nextflow (https://nf-co.re/usage/installation). NANOME execution environment will be automatically configured with the support of conda, docker or singularity containers. Below is steps for installing Nextflow:

# Install nextflow
conda install -c conda-forge -c bioconda nextflow
nextflow -v

NANOME pipeline support running with various ways in different platforms:

  • Docker
  • Singularity
  • Conda
  • Local execution: running directly on default platform
  • HPC clusters with SLURM support
  • Cloud computing platform, e.g., Google Cloud Platform(GCP) with google-lifesciences support

Simple usage

Please refer to Usage and Specific Usage and NANOME options for how to use NANOME pipeline. For running on CloudOS platform (e.g., google cloud), please check Usage on CloudOS. We provide a tutorial video for running NANOME pipeline:

IMAGE ALT TEXT HERE

When you have Nextflow software, NANOME pipeline can be directly executed without any other additional installation steps:

# Run NANOME via docker
nextflow run LabShengLi/nanome\
    -profile test,docker

# Run NANOME via singularity
nextflow run LabShengLi/nanome\
    -profile test,singularity

# Run NANOME for human data
nextflow run LabShengLi/nanome\
    -profile test_human,[docker/singularity]

Please note that above commands are integrated in our CI/CD test cases. Our GitHub will automatically test and report results on every commit and PRs (https://github.com/LabShengLi/nanome/actions).

We firstly proposed the standardized whole genome-wide evaluation packages, check standardized evaluation tool usage for more detail. We do not suggest evaluating on a portion of CpGs for performance comparisons.

Pipeline reports for NANOME

Benchmarking reports on our HPC using Nextflow

We constructed a set of benchmarking datasets that contain reads from 800 to about 7,200 reads for NA19240, and monitored job running timeline and resource usage on our HPC, reports generated by Nextflow workflows are: Trace file, Report and Timeline.

Our HPC hardware specifications are as follows:

  • CPU: Intel(R) Xeon(R) Gold 6136 CPU @ 3.00GHz
  • GPU: Tesla V100-SXM2-32GB
  • RAM: 300 GB
  • Slurm manager version: 19.05.5

Timeline figure for benchmarking experiments are below: Bench-timeline

Pipeline DAG

NanomeDag

NANOME report

Please check NANOME report for the sample report by NANOME pipeline.

NanomeReportHtml

Haplotype-aware consensus methylations

Please check phasing usage. PhasingDemo

Lifebit CloudOS report

We now support running NANOME on cloud computing platform. Lifebit is a web-based cloud computing platform, and below is the running reports:

Revision History

For release history, please visit here. For details, please go here.

Contact

If you have any questions/issues/bugs, please post them on GitHub. We will continuously update the GitHub to support famous methylation-calling tools for Oxford Nanopore sequencing.

Reference

DNA methylation-calling tools for Oxford Nanopore sequencing: a survey and human epigenome-wide evaluation. Genome Biology 22, 295 (2021). https://doi.org/10.1186/s13059-021-02510-z

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nanome-jax-2.0.11.tar.gz (2.8 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nanome_jax-2.0.11-py3-none-any.whl (3.0 MB view details)

Uploaded Python 3

File details

Details for the file nanome-jax-2.0.11.tar.gz.

File metadata

  • Download URL: nanome-jax-2.0.11.tar.gz
  • Upload date:
  • Size: 2.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.8.2 pkginfo/1.8.1 requests/2.28.1 requests-toolbelt/0.9.1 tqdm/4.64.0 CPython/3.9.7

File hashes

Hashes for nanome-jax-2.0.11.tar.gz
Algorithm Hash digest
SHA256 2d176f7b648e85e77d8b5054e6016925ef5d5e8a63786e7333077580a803d8f7
MD5 42e32f097ff9f69f10b7ac2867e0bc87
BLAKE2b-256 32a76ea7fa19724d670be02a8b72be6310d1d98a9f394e825bbd8395e94dbb87

See more details on using hashes here.

File details

Details for the file nanome_jax-2.0.11-py3-none-any.whl.

File metadata

  • Download URL: nanome_jax-2.0.11-py3-none-any.whl
  • Upload date:
  • Size: 3.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.8.2 pkginfo/1.8.1 requests/2.28.1 requests-toolbelt/0.9.1 tqdm/4.64.0 CPython/3.9.7

File hashes

Hashes for nanome_jax-2.0.11-py3-none-any.whl
Algorithm Hash digest
SHA256 42d1fc008865505a30ddc73143f7570821e149fbdca38ac58312c67b435fcc26
MD5 7f37f718a90a0e6a3280eb4f180974b0
BLAKE2b-256 787e0c4e3a37ab01ec80dd85aa9124df078a9b2b5e3e09d4315ea974cb98a9e8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page