Skip to main content
Donate to the Python Software Foundation or Purchase a PyCharm License to Benefit the PSF! Donate Now

Remove outlier sequences from multiple sequence alignment

Project description

pysickle.py will try to remove sequences that cause misalignments from a multiple sequence alignment (MSA). It reads a given MSA in multi-fasta format and removes sequences with the highest penalty scores, then builds the next MSA without those sequences. This process is repeated until a user-specified cuttoff is reached or less than three sequences are left to be aligned.

Usage:

######################################
# pysickle.py
######################################
usage:
    pysickle.py -f multifasta alignment
options:
    -f, --fasta=FILE    multifasta alignment (eg "align.fas")
    OR
    -F, --fasta_dir=DIR directory with multifasta files (needs -s SUFFIX)
    -s, --suffix=SUFFIX will try to work with files that end with SUFFIX
                        (eg ".fas")

    -a, --msa_tool=STR  supported: "mafft" [default:"mafft"]
    -i, --max_iterations=NUM    force stop after NUM iterations
    -n, --num_threads=NUM   max number of threads to be executed in
                            parallel [default: 1]

    -h, --help      prints this

Currently supported multiple sequence aligners:

  • mafft (Katoh, Standley 2013 (Molecular Biology and Evolution 30:772-780) MAFFT multiple sequence alignment software version 7: improvements in performance and usability. http://mafft.cbrc.jp/alignment/software/)

Requirements

  • matplotlib
  • numpy

External Programs

  • mafft

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
pysickle-0.1.1-py2.7.egg (24.1 kB) Copy SHA256 hash SHA256 Egg 2.7
pysickle-0.1.1.tar.gz (20.6 kB) Copy SHA256 hash SHA256 Source None

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page