Skip to main content

spora: Streamlined Phylogenomic Outbreak Report Analysis

Project description

spora: Streamlined Phylogenomic Outbreak Report Analysis

example workflow

snakemake and Python integrated workflow for intermediate file generation for COVID outbreak analysis

Installation

git clone https://github.com/matt-sd-watson/spora.git
conda env create -f ncov_spora/environments/environment.yml
conda activate ncov_spora
cd spora
pip install . 

Updating

conda activate ncov_spora
cd ~/spora
git checkout main
git pull
pip install . 

Usage

usage: 
    	spora -c <config.yaml> 
    	OR
    	spora --focal_list ...<input args>

spora: Streamlined Phylogenomic Outbreak Report Analysis

optional arguments:
  -h, --help            Show the help output and exit.
  -c CONFIG, --config CONFIG
                        Input config file in yaml format, all command line arguments can be passed via the config file.
  -f FOCAL_SEQS, --focal-sequences FOCAL_SEQS
                        Input .txt list or multi-FASTA focal samples for outbreak. Required
  -b BACKGROUND_SEQS, --background-sequences BACKGROUND_SEQS
                        Optional input .txt list or multi-FASTA background samples to add to analysis
  -m MASTER_FASTA, --master-fasta MASTER_FASTA
                        Master FASTA of genomic sequences to select from. Required if either --focal-sequences or --background-sequences are not supplied in FASTA format
  -o OUTDIR, --output-directory OUTDIR
                        Path to the desired output directory. If none is provided, a new folder named spora will be created in the current directory
  -r REFERENCE, --reference REFERENCE
                        .gb file containing the desired COVID-19 reference sequence. Required
  -p PREFIX, --prefix PREFIX
                        Prefix string to label all output files. Default: outbreak
  -t NTHREADS, --nthreads NTHREADS
                        Number of threads to use for processing. Default: 2
  -s, --snps-only       Generate a snps-only FASTA from the input FASTA. Default: False
  -rn, --rename         Rename the FASTA headers to be compatible with NML standards. Default: False
  -nc NAMES_CSV, --names-csv NAMES_CSV
                        Use the contents of a CSV to rename the input FASTA. Requires the following column headers: original_name, new_name
  -ncs, --no-constant-sites
                        Do not enable constant sites to be used for SNPs only tree generation. Default: Enabled
  -fi, --filter         Filter both the focal and background sequences based on genome completeness and length. Default: Not enabled
  -gc GENOME_COMPLETENESS, --genome-completeness GENOME_COMPLETENESS
                        Integer for the minimum genome completeness percentage for filtering. Default: 90
  -gl GENOME_LENGTH, --genome-length GENOME_LENGTH
                        Integer for the minimum genome length for filtering. Default: 29500
  -rp, --report         Generate a summary output report for the spora run. Default: Not enabled
  -v, --version         Show the current spora version then exit.

Documentation

More detailed documentation for spora usage and functionality can be found here

Acknowledgments

Inspiration for code structure and design for spora was inspired by pangolin and civet, and minor code blocks were adopted from these software.

The Background section in the documentation describing outbreak definitions was written by Mark Horsman.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spora-0.7.0.tar.gz (26.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

spora-0.7.0-py3-none-any.whl (34.6 kB view details)

Uploaded Python 3

File details

Details for the file spora-0.7.0.tar.gz.

File metadata

  • Download URL: spora-0.7.0.tar.gz
  • Upload date:
  • Size: 26.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for spora-0.7.0.tar.gz
Algorithm Hash digest
SHA256 3188d49b0ff9301c33881bafa6356ac7e9a15938aab8177025670d695eaf5692
MD5 9a9f18012d7ab795d8e7eea9cd995eb5
BLAKE2b-256 fb1b720b17264db563b593eb65d9dc3d45af655e9fd6394b754b44dec673417f

See more details on using hashes here.

File details

Details for the file spora-0.7.0-py3-none-any.whl.

File metadata

  • Download URL: spora-0.7.0-py3-none-any.whl
  • Upload date:
  • Size: 34.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for spora-0.7.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f53d8f3b84adb21f32bbb0fe87601b1688009ee8c4e82df7f84c3d1d674ecb19
MD5 ae21ccf6fd8134d40043d22ac22b4a0f
BLAKE2b-256 e2c0f28e6a25f698c6e2a7e0f5bd03d8bd0fa0f37691559585317561cdeafdc8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page