Skip to main content

Mycobacterium tuberculosis genomic analysis from Nanopore sequencing data

Project description

TBpore

Mycobacterium tuberculosis genomic analysis from Nanopore sequencing data

Python CI codecov PyPI PyPI - Python Version License: MIT Code style: black

Table of Contents

Synopsis

tbpore is a tool with two main goals. First is to process Nanopore Mycobacterium tuberculosis sequencing data to describe variants with respect to the canonical TB strain H37Rv and predict antibiotic resistance (command tbpore process). Variant description is done by decontaminating reads, calling variants with bcftools and filtering variants. Antibiotic resistance is predicted with mykrobe. Second, tbpore can be used to cluster TB samples based on their genotyping and a given distance threshold (command tbpore cluster).

Citation

TBpore is a slimmed-down version of the full pipeline used in our paper 👇

Hall, M. B. et al. Evaluation of Nanopore sequencing for Mycobacterium tuberculosis drug susceptibility testing and outbreak investigation: a genomic analysis. The Lancet Microbe 0, (2022) doi: 10.1016/S2666-5247(22)00301-9.

Installation

conda

Conda (channel only) bioconda version Conda

Prerequisite: conda (and bioconda channel correctly set up)

$ conda install tbpore

pip

PyPI PyPI - Python Version

The python components of tbpore are availble to install through PyPI.

pip install tbpore

However, you will need to install the following dependencies, which cannot be installed through PyPI.

Dependencies

We make no guarentees about the performance of tbpore with versions other than those specified above. In particular, the bcftools version is very important. The latest versions of the other dependencies can likely be used.

Container

Docker images are provided through biocontainers.

singularity

Prerequisite: singularity

$ URI="docker://quay.io/biocontainers/tbpore:<tag>"
$ singularity exec "$URI" tbpore --help

see here for valid values for <tag>.

docker

Docker Repository on Quay

Prerequisite: Docker

$ docker pull quay.io/biocontainers/tbpore:<tag>
$ docker run quay.io/biocontainers/tbpore:<tag> tbpore --help

see here for valid values for <tag>.

Configuring the decontamination database index

After installing TBpore, you will need to download the decontamination database index.

$ tbpore download

By default, this will download the index to ${HOME}/.tbpore/decontamination_db/remove_contam.map-ont.mmi, as this is the default location tbpore process will search for.

If you prefer to download the index to another location, this can be done with

$ tbpore download -o other/location/db.mmi

Keep in mind, if you specify a non-default location, you will need to use the --db option when running tbpore process.

Performance

tbpore process

Benchmarked on 151 TB ONT samples with 1 thread:

  • Runtime: 2103s avg, 4048s max (s = seconds);
  • RAM: 12.4GB avg, 13.1GB max (GB = Gigabytes);

tbpore cluster

Clustering 151 TB ONT samples:

  • Runtime: 286s;
  • RAM: <1GB;

Usage

General usage

Usage: tbpore [OPTIONS] COMMAND [ARGS]...

Options:
  -h, --help     Show this message and exit.
  -V, --version  Show the version and exit.
  -v, --verbose  Turns on debug-level logger. Option is mutually exclusive
                 with quiet.
  -q, --quiet    Turns off all logging except errors. Option is mutually
                 exclusive with verbose.

Commands:
  cluster   Cluster consensus sequences
  download  Download and validate the decontamination database
  process   Single-sample TB genomic analysis from Nanopore sequencing data

process

Usage: tbpore process [OPTIONS] [INPUTS]...

  Single-sample TB genomic analysis from Nanopore sequencing data

  INPUTS: Fastq file(s) and/or a directory containing fastq files. All files
  will be joined into a single fastq file, so ensure they're all part of the
  same sample/isolate.

Options:
  -h, --help                      Show this message and exit.
  -r, --recursive                 Recursively search INPUTS for fastq files
  -S, --name TEXT                 Name of the sample. By default, will use the
                                  first INPUT file with fastq extensions
                                  removed
  -A, --report_all_mykrobe_calls  Report all mykrobe calls (turn on flag -A,
                                  --report_all_calls when calling mykrobe)
  --db PATH                       Path to the decontaminaton database
                                  [default: /home/mihall/.tbpore/decontaminati
                                  on_db/remove_contam.map-ont.mmi]
  -m, --metadata PATH             Path to the decontaminaton database metadata
                                  file  [default: /data/scratch/projects/punim
                                  1703/tmp/outliers/tbpore/data/decontaminatio
                                  n_db/remove_contam.tsv.gz]
  -c, --coverage INTEGER          Depth of coverage to subsample to. Use 0 to
                                  disable
  -o, --outdir DIRECTORY          Directory to place output files  [default:
                                  .]
  --tmp DIRECTORY                 Specify where to write all (tbpore)
                                  temporary files. [default: <outdir>/.tbpore]
  -t, --threads INTEGER           Number of threads to use in multithreaded
                                  tools  [default: 1]
  -d, --cleanup / -D, --no-cleanup
                                  Remove all temporary files on *successful*
                                  completion  [default: no-cleanup]
  --cache DIRECTORY               Path to use for the cache  [default:
                                  /home/mihall/.cache]

cluster

Usage: tbpore cluster [OPTIONS] [INPUTS]...

  Cluster consensus sequences

  Preferably input consensus sequences previously generated with tbpore
  process.

  INPUTS: Two or more consensus fasta sequences. Use glob patterns to input
  several easily (e.g. output/sample_*/*.consensus.fa).

Options:
  -h, --help                      Show this message and exit.
  -T, --threshold INTEGER         Clustering threshold  [default: 6]
  -o, --outdir DIRECTORY          Directory to place output files  [default:
                                  .]
  --tmp DIRECTORY                 Specify where to write all (tbpore)
                                  temporary files. [default: <outdir>/.tbpore]
  -t, --threads INTEGER           Number of threads to use in multithreaded
                                  tools  [default: 1]
  -d, --cleanup / -D, --no-cleanup
                                  Remove all temporary files on *successful*
                                  completion  [default: no-cleanup]
  --cache DIRECTORY               Path to use for the cache  [default:
                                  /Users/michaelhall/.cache]

download

Usage: tbpore download [OPTIONS]

  Download and validate the decontamination database

Options:
  -h, --help         Show this message and exit.
  -o, --output PATH  Download database to a specified filepath  [default: ${HOME}/
                     .tbpore/decontamination_db/remove_contam.map-ont.mmi]
  -f, --force        Force overwrite if the database already exists

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tbpore-0.7.1.tar.gz (2.2 MB view details)

Uploaded Source

Built Distribution

tbpore-0.7.1-py3-none-any.whl (2.2 MB view details)

Uploaded Python 3

File details

Details for the file tbpore-0.7.1.tar.gz.

File metadata

  • Download URL: tbpore-0.7.1.tar.gz
  • Upload date:
  • Size: 2.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.10.12 Darwin/23.5.0

File hashes

Hashes for tbpore-0.7.1.tar.gz
Algorithm Hash digest
SHA256 c48b1aa71a4f9e5c74d12b652e08f098b6fbccfef098be0923b41a788b817c02
MD5 c3f762a27d489e6cef41938c7a6edadc
BLAKE2b-256 c420ee9b891e38d66cdad24909c2b2355c9c1f729fd558121cf3b9d8b3751e97

See more details on using hashes here.

File details

Details for the file tbpore-0.7.1-py3-none-any.whl.

File metadata

  • Download URL: tbpore-0.7.1-py3-none-any.whl
  • Upload date:
  • Size: 2.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.10.12 Darwin/23.5.0

File hashes

Hashes for tbpore-0.7.1-py3-none-any.whl
Algorithm Hash digest
SHA256 81482904db5f3a07612f1a98e4ecb2253238bcf6407cf0109857ecb8ac730810
MD5 6a77f2561939a1e34a995a653d6b87fe
BLAKE2b-256 23e744aad6f3617fbd888f20e6a753d52b1374baa93d47c247b874f797db2ead

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page