Skip to main content

A tool for drawing ANI clustermap between all-vs-all microbial genomes

Project description

ANIclustermap

Python3 OS License Latest PyPI version Bioconda

Overview

ANIclustermap is easy-to-use tool for drawing ANI(Average Nucleotide Identity) clustermap between all-vs-all microbial genomes. ANI between all-vs-all genomes are calculated by fastANI (or skani) and clustermap is drawn using seaborn.

ANIclustermap.png
Fig1. ANI clustermap between all-vs-all 33 genomes.

ANIclustermap.png
Fig2. ANI clustermap between all-vs-all 18 genomes. If no similarity detected by fastANI, filled in gray.

Installation

Python 3.8 or later is required for installation. fastANI or skani is required to calculate ANI.

Install bioconda package:

conda install -c conda-forge -c bioconda aniclustermap

Install PyPI stable package:

pip install aniclustermap

Workflow

Description of ANIclustermap's automated workflow.

  1. Calculate ANI between all-vs-all microbial genomes by fastANI (or skani).
    If no similarity detected by fastANI, NA is output. In that case, NA is replaced by 0.0.
    If previous result available at the time of re-run, reuse previous result.
  2. Clustering ANI matrix by scipy's UPGMA method.
  3. Using clustered matrix, draw ANI clustermap by seaborn.

Usage

Basic Command

ANIclustermap -i [Genome fasta directory] -o [output directory]

Options

$ ANIclustermap --help
usage: ANIclustermap -i [Genome fasta directory] -o [output directory]

Draw ANI(Average Nucleotide Identity) clustermap

optional arguments:
  -i I, --indir I      Input genome fasta directory (*.fa|*.fna[.gz]|*.fasta)
  -o O, --outdir O     Output directory
  -m , --mode          ANI calculation mode ('fastani'[default]|'skani')
  -t , --thread_num    Thread number parameter (Default: MaxThread - 1)
  --overwrite          Overwrite previous ANI calculation result (Default: OFF)
  --fig_width          Figure width (Default: 10)
  --fig_height         Figure height (Default: 10)
  --dendrogram_ratio   Dendrogram ratio to figsize (Default: 0.15)
  --cmap_colors        cmap interpolation colors parameter (Default: 'lime,yellow,red')
  --cmap_gamma         cmap gamma parameter (Default: 1.0)
  --cmap_ranges        Range values (e.g. 80,90,95,100) for discrete cmap (Default: None)
  --cbar_pos           Colorbar position (Default: (0.02, 0.8, 0.05, 0.18))
  --annotation         Show ANI value annotation (Default: OFF)
  --annotation_fmt     Annotation value format (Default: '.3g')
  -v, --version        Print version information
  -h, --help           Show this help message and exit

Example Command

7 genomes minimal dataset. Click here to download dataset (Size=3.6MB).

ANIclustermap -i ./minimal_dataset/ -o ./ANIclustermap_result

Output Contents

ANIclustermap outputs 3 types of files.

  • ANIclustermap.[png|svg] (example1, example2)
    ANI clustermap result figure.

  • ANIclustermap_matrix.tsv (example)
    Clustered all-vs-all ANI matrix.

  • ANIclustermap_dendrogram.nwk (example)
    Newick format clustering dendrogram.

Gallery

Example gallery of 33 genomes normal dataset.
If you want to try it for yourself, click here to donwload dataset (Size=63.5MB).

Normal parameter:

ANIclustermap -i ./normal_dataset -o ./ANIclustermap_result \
              --fig_width 15

ANIclustermap.png

Change cmap_gamma parameter:

ANIclustermap -i ./normal_dataset -o ./ANIclustermap_result \ 
              --fig_width 15 --cmap_gamma 0.5

ANIclustermap.png

Change cmap_colors(=white,orange,red) paramter:

ANIclustermap -i ./normal_dataset -o ./ANIclustermap_result \ 
              --fig_width 15 --cmap_colors white,orange,red

ANIclustermap.png

Change cmap_ranges paramter:

ANIclustermap -i ./normal_dataset -o ./ANIclustermap_result \ 
              --fig_width 15 --cmap_ranges 80,85,90,92.5,95,97.5,100

See this issue for more details.

ANIclustermap.png

Add ANI value annotation parameter:

ANIclustermap -i ./normal_dataset -o ./ANIclustermap_result \ 
              --fig_width 20 --fig_height 15 --annotation

ANIclustermap.png

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aniclustermap-1.4.0.tar.gz (7.3 MB view details)

Uploaded Source

Built Distribution

aniclustermap-1.4.0-py3-none-any.whl (291.1 kB view details)

Uploaded Python 3

File details

Details for the file aniclustermap-1.4.0.tar.gz.

File metadata

  • Download URL: aniclustermap-1.4.0.tar.gz
  • Upload date:
  • Size: 7.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.12.4 Linux/6.5.0-1023-azure

File hashes

Hashes for aniclustermap-1.4.0.tar.gz
Algorithm Hash digest
SHA256 c0542543ee14801d448ad9aff1abc4d7c225f94dcae21601e7c4f93bf1b63f26
MD5 afd841096f180d3f94b4b4b4345c9ff7
BLAKE2b-256 d8dcbde99ebe1cd8633d513f6e551995583953056312c5e4410a84821dcfcd0f

See more details on using hashes here.

File details

Details for the file aniclustermap-1.4.0-py3-none-any.whl.

File metadata

  • Download URL: aniclustermap-1.4.0-py3-none-any.whl
  • Upload date:
  • Size: 291.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.12.4 Linux/6.5.0-1023-azure

File hashes

Hashes for aniclustermap-1.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 25f0731cc511d251c7bda28a3b501e15efd12ac0e73b789b1c32196016b7cb9e
MD5 96e1ea69e681f6354ebe306a1502f113
BLAKE2b-256 e98b87e95abcd0e486086c4bb4c575a5666f938f59a01987b267c9b877cab8f0

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page