Skip to main content

Analysis of nanopore sequencing data.

Project description

Summary

This package provides tools for the analysis of raw nanopore sequencing data, including correction of basecalls and visualization.

Full Documentation

Full documentation avaialable at Read the Docs: https://nanoraw.readthedocs.io

Installation

Install nanoraw without plotting dependencies (base genome_resquiggle algorithm and text outputs: wig and fasta)

pip install nanoraw

Install nanoraw with plotting dependencies (requires separate installation of R packages ggplot2 and cowplot)

pip install nanoraw[plot]

Install nanoraw via pip

pip install nanoraw

Install bleeding edge via github

pip install git+https://github.com/marcus1487/nanoraw.git

Usage

nanoraw -h
nanoraw [command] [options]

Main Command (Must be run before any other commands):

  • genome_resquiggle: Re-annotate raw signal with genomic aignement of existing basecalls.

Genome Anchored Plotting Commands:

  • plot_max_coverage: Plot signal in regions with the maximum coverage.

  • plot_genome_location: Plot signal at defined genomic locations.

  • plot_motif_centered: Plot signal at regions centered on a specific motif.

  • plot_max_difference: Plot signal where signal differs the most between two groups.

  • plot_most_significant: Plot signal where signal differs the most significantly between two groups.

  • plot_motif_with_stats: Plot signal from several regions and test statistics centered on a motif of interst.

Sequencing Time Anchored Plotting Command:

  • plot_correction: Plot segmentation before and after correction.

  • plot_multi_correction: Plot multiple raw signals anchored by genomic location.

Other Plotting Command:

  • plot_kmer: Plot signal quantiles acorss kmers.

  • cluster_most_significant: Clustering traces at bases with significant differences.

Auxiliary Command:

  • write_most_significant_fasta: Write sequence where signal differs the most significantly between two groups.

  • write_wiggles : Write wiggle files for nanopore signal values, coverage, and statistics.

    Get additional help for subcommands with nanoraw [command] -h

Requirements

python Requirements:

  • numpy

  • scipy

  • h5py

Optional plotting packages (install R packages with install.packages([package_name]) from an R prompt):

  • rpy2 (python package; with R installation)

  • ggplot2 (required for all plotting subcommands)

  • cowplot (required for plot_motif_with_stats subcommand)

Citation

Stoiber, M.H. et al. De novo Identification of DNA Modifications Enabled by Genome-Guided Nanopore Signal Processing. bioRxiv (2016).

http://biorxiv.org/content/early/2017/04/10/094672

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nanoraw-0.5.tar.gz (50.7 kB view details)

Uploaded Source

File details

Details for the file nanoraw-0.5.tar.gz.

File metadata

  • Download URL: nanoraw-0.5.tar.gz
  • Upload date:
  • Size: 50.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for nanoraw-0.5.tar.gz
Algorithm Hash digest
SHA256 44fe662cc18f3e8b82d299bb3c31fe1783cbac8c14b02dbcc089a3f5a52427f9
MD5 f1c06fdcb60d1b672e8664961fb65a30
BLAKE2b-256 a28c4c30e9d9bfef5de2e299c6d304ddfe339ad24b7f4ce4e78fdd509d6b6f0e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page