Skip to main content

PanPhlAn is a strain-level metagenomic profiling tool for identifying the gene composition and *in-vivo* transcriptional activity of individual strains in metagenomic samples. PanPhlAn’s ability for strain-tracking and functional analysis of unknown pathogens makes it an efficient tool for culture-free infectious outbreak epidemiology and microbial population studies.

Project description

README

PanPhlAn 3 - strain detection and characterization

Pangenome-based Phylogenomic Analysis

PanPhlAn is a strain-level metagenomic profiling tool for identifying the gene composition of individual strains in metagenomic samples. PanPhlAn’s ability for strain-tracking and functional analysis of unknown pathogens makes it an efficient tool for culture-free microbial population studies.

PanPhlAn is written in Python and covers the 4 main tasks:

  • panphlan_download_pangenome.py, to download pangenome files (fasta, BowTie2 indexes and general information) for over 3,000 species
  • panphlan_map.py, to profile each metagenomic sample by mapping it against the species of interest
  • panphlan_profile.py, to merge and process the mapping results in order to get the final gene presence/absence matrix
  • panphlan_find_gene_grp.py, organise OPTICS clustering to find some group of gene with similar profile and assess if they could be mobile elements in the genome. Also plot the presence/absence matrix as Heatmap.

PanPhlAn runs under Ubuntu/Linux and requires the following software tools to be installed on your system:

  • Bowtie2
  • Samtools
  • Python 3

And the following Python libraries:

  • numpy
  • pandas
  • scipy
  • sklearn (only if using panphlan_find_gene_grp.py)
    If visualizations are made, one also needs :
  • matplotlib
  • seaborn

For any help see the wiki or the bioBakery forum


[PanPhlAn] is a project of the Computational Metagenomics Lab at CIBIO, University of Trento, Italy

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

panphlan-3.0.tar.gz (21.8 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page