mintyper: an outbreak-detection method for accurate and rapid SNP typing of clonal clusters with noisy long reads.
Project description
Read more about mintyper here:
https://academic.oup.com/biomethods/article/6/1/bpab008/6243723
Table of contents
Introduction
Mintyper is a tool designed to quickly estimate a distance matrix from a set of input sequencing reads. It can take both short read from Illumina's sequencing platforms and long reads from Oxford Nanopore's platform.
Installation
Dependencies
Mintyper requires the following dependencies:
- Python 3.6 or higher
- KMA (https://bitbucket.org/genomicepidemiology/kma/src/master/)
- CCphylo (https://bitbucket.org/genomicepidemiology/ccphylo/src/master/)
Dependencies installation
Conda (works on linux only due to KMA conda package)
git clone https://bitbucket.org/genomicepidemiology/mintyper.git
cd mintyper
conda env create -f environment.yml -n mintyper
conda activate mintyper
Manual installation (Linus and MacOS, requires libz-dev installed)
git clone https://bitbucket.org/genomicepidemiology/mintyper.git
cd mintyper
python3 scripts/install_dependencies.py
Mintyper installation
Using pip
The easiest way to install mintyper is using pip:
pip install mintyper
From source
git clone https://bitbucket.org/genomicepidemiology/mintyper.git
cd mintyper
python3 setup.py install
Database
Mintyper provides two options: Either the user can give a single fasta file as reference sequence by using the -ref argument, or a whole KMA indexed database can be used with the -db argument.
A guide for indexing a given database correctly can be found at https://bitbucket.org/genomicepidemiology/kma/src/master/
A database of complete bacterial genomes can be downloaded with:
wget https://cge.food.dtu.dk/services/MINTyper/bac_db.tar.gz .
Standard usage
mintyper --illumina /path/to/illlumina_files/* --nanopore /path/to/nanopore_files/* --db /path/to/database --output output_folder_name
Output
The output consists of a distance matrix in relaxed phylip format (distmatrix.phy), a list of template candidates (Template_kma_results.spa), the chosen template sequence (template_sequence.fasta), the combined VCF file of all analyzed reads (Combined.vcf.gz), a newick file of the generated phylogenetic tree (outtree.newick), a cluster file (Optional, only outputted if -cluster_length > 0) and a logfile. Additionally, a data folder containing the consensus sequences, the individual VCFs and alignment files is created.
Citation
- Malte B Hallgren, Søren Overballe-Petersen, Ole Lund, Henrik Hasman, Philip T L C Clausen, mintyper: an outbreak-detection method for accurate and rapid SNP typing of clonal clusters with noisy long reads, Biology Methods and Protocols, Volume 6, Issue 1, 2021, bpab008, https://doi.org/10.1093/biomethods/bpab008
Licens
Copyright (c) 2021, Malte Hallgren, Technical University of Denmark All rights reserved.
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.
Additional information
As of April 2023 IQtree and FastTree are no longer supported automatically. If you wish to use these programs, please install them manually and use the distance matrix or multiple alignment as input.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file mintyper-1.1.0.tar.gz
.
File metadata
- Download URL: mintyper-1.1.0.tar.gz
- Upload date:
- Size: 8.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d4016baaf718679beb7908d25e912a93ca548404fcc0711e0b024f3fb83ddd70 |
|
MD5 | 5687d896ba3961dc513eee9929da74a0 |
|
BLAKE2b-256 | 3ff2feb8f8b09755ad88fec51136e8d7f3054276dced6119d52d55b195a8d540 |