Skip to main content

Analysis of raw nanopore sequencing data.

Project description

/ONT_logo.png

Tombo

Tombo is a suite of tools primarily for the identification of modified nucleotides from nanopore sequencing data. Tombo also provides tools for the analysis and visualization of raw nanopore signal.

Detailed documentation for all Tombo commands and algorithms can be found on the tombo documentation page.

Features

  • Modified Base Detection

    • Supports both DNA and direct RNA

    • Three detection algorithms support broad range of applications

      • Alternative model (preferred)

      • Sample comparison

      • De novo

  • Reference-anchored raw signal vizualization

  • Raw signal analysis python API

  • User-friendly model estimation methods with tutorial


Getting Started

bioconda_badge pypi_badge

Conda installation (preferred method)

# install via bioconda environment (https://bioconda.github.io/#set-up-channels)
conda install -c bioconda ont-tombo

The first step in any Tombo analysis is to re-squiggle (raw signal to reference sequence alignment) raw nanopore reads. This creates an index and stores the raw signal alignments necessary to perform downstream analyses.

In this example, an E. coli sample is tested for dam and dcm methylation (CpG model also available for human analysis). Using these results, raw signal is plotted at the most significantly modified dcm positions and the dam modified base predictions are output to a wiggle file for use in downstream processing or visualization in a genome browser.

tombo resquiggle path/to/fast5s/ genome.fasta --processes 4 --num-most-common-errors 5
tombo detect_modifications alternative_model --fast5-basedirs path/to/fast5s/ \
    --statistics-file-basename native.e_coli_sample \
    --alternate-bases dam dcm --processes 4

# plot raw signal at most significant dcm locations
tombo plot most_significant --fast5-basedirs path/to/fast5s/ \
    --statistics-filename native.e_coli_sample.dcm.tombo.stats \
    --plot-standard-model --plot-alternate-model dcm \
    --pdf-filename sample.most_significant_dcm_sites.pdf

# produces wig file with estimated fraction of modified reads at each valid reference site
tombo text_output browser_files --statistics-filename native.e_coli_sample.dam.tombo.stats \
    --file-types dampened_fraction --browser-file-basename native.e_coli_sample.dam
# also produce successfully processed reads coverage file for reference
tombo text_output browser_files --fast5-basedirs path/to/fast5s/ \
    --file-types coverage --browser-file-basename native.e_coli_sample

While motif models (CpG, dcm and dam; most accurate) and all-context specific alternate base models (5mC and 6mA; more accurate) are preferred, Tombo also allows users to investigate other or even unknown base modifications.

Here are two example commands running the de_novo method (detect deviations from expected cannonical signal levels) and the level_sample_compare method (detect deviation in signal levels between two samples of interest; works best with high coverage).

tombo detect_modifications de_novo --fast5-basedirs path/to/fast5s/ \
    --statistics-file-basename sample.de_novo_detect --processes 4
tombo text_output browser_files --statistics-filename sample.de_novo_detect.tombo.stats \
    --browser-file-basename sample.de_novo_detect --file-types dampened_fraction

tombo detect_modifications level_sample_compare --fast5-basedirs path/to/fast5s/ \
    --control-fast5-basedirs path/to/control/fast5s/ --minimum-test-reads 50 \
    --processes 4 --statistics-file-basename sample.level_samp_comp_detect
tombo text_output browser_files --statistics-filename sample.level_samp_comp_detect.tombo.stats \
    --browser-file-basename sample.level_samp_comp_detect --file-types statistic

See more complete tutorials on the documentation page.

Alternative Installation Methods

Tombo is available for installation via pip, but requires an R installation as well as R package dependencies (ggplot2 and gridextra) for all visualization functions.

# install pip package (numpy install required before tombo for cython optimization)
pip install numpy
pip install ont-tombo[full]

Tombo can also be installed directly from source (mostly for development) by running the following commands:

git clone https://github.com/nanoporetech/tombo
cd tombo
pip install -e .

Known Issues

Tombo does not support multi-read FAST5 format read data files. Please use the multi_to_single_fast5 command from the ont_fast5_api package in order to convert to single-read FAST5 format before processing with Tombo.

Help

References and Supporting Information

Stoiber, M.H. et al. De novo Identification of DNA Modifications Enabled by Genome-Guided Nanopore Signal Processing. bioRxiv (2016).

http://biorxiv.org/content/early/2017/04/10/094672

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ont-tombo-1.5.1.tar.gz (377.7 kB view details)

Uploaded Source

Built Distribution

ont_tombo-1.5.1-cp37-cp37m-macosx_10_14_x86_64.whl (528.3 kB view details)

Uploaded CPython 3.7m macOS 10.14+ x86-64

File details

Details for the file ont-tombo-1.5.1.tar.gz.

File metadata

  • Download URL: ont-tombo-1.5.1.tar.gz
  • Upload date:
  • Size: 377.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.7.2

File hashes

Hashes for ont-tombo-1.5.1.tar.gz
Algorithm Hash digest
SHA256 f5ae8f4c5fc1ab212306ff14ae93fbd1c1034421148ff2e728f169f69a824380
MD5 ac5cf28caaea9ff6b6b7b6f5395b3a46
BLAKE2b-256 346df3426707cf097d381dee752b73027920353d4a7744d8bdab696012f1bc8f

See more details on using hashes here.

File details

Details for the file ont_tombo-1.5.1-cp37-cp37m-macosx_10_14_x86_64.whl.

File metadata

  • Download URL: ont_tombo-1.5.1-cp37-cp37m-macosx_10_14_x86_64.whl
  • Upload date:
  • Size: 528.3 kB
  • Tags: CPython 3.7m, macOS 10.14+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.7.2

File hashes

Hashes for ont_tombo-1.5.1-cp37-cp37m-macosx_10_14_x86_64.whl
Algorithm Hash digest
SHA256 65232db4da447b3c8b1a7aa817baf6d12a06e3508fa460c4d274f79af32824a1
MD5 a75ba6606196c81cf807e3ba718059e8
BLAKE2b-256 97ae369eafa6f61fe9c10c3dd4dc4043df4a2d3026f10e2f3666a1041a17f1ed

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page