Skip to main content

Kalgin-based codon-aware aligner for multiple sequences

Project description

Kc-Align

Kc-Align is a codon-aware multiple aligner that uses Kalgin2 to produce in-frame gapped codon alignments for selection analysis of small genomes (mostly viral and some smaller bacterial genomes). Takes nucleotide seqeunces as inputs, converts them to their in-frame amino acid sequences, performs multiple alignment with Kalign, and then converts the alignments back to their original codon sequence while preserving the gaps. Produces two outputs: the gapped nucleotide alignments in FASTA format and in CLUSTAL format.

Kc-Align will also attempt to detect any frameshift mutations in the input reads. If a frameshift is detected, that sequence will not be included in the multiple alignment and its ID will be printed to stdout.

Modes

Kc-Align can be run in three different modes, depending on your input data.

In "genome" mode, the "reference" and "reads" input parameters are all full genome FASTA files. This mode also requires the 1-based start and end position numbers corresponding to the gene you are interested in aligning from the reference input.

If both the "reference" and "reads" inputs are already in-frame genes, the "gene" mode should be used. This mode does not require start and end position parameters as the reference is already in-frame.

For the case when your "reference" is an in-frame gene while the "reads" are whole genomes, the "mixed" mode can be used. Like gene mode, this mode does not require the start and end point position parameters.

USAGE:

kc-align --mode genome --reference [reference FASTA] --reads [reads FASTA] --start [start] --end [end]

kc-align --mode gene --reference [reference FASTA] --reads [reads FASTA]

kc-align --mode mixed --reference [reference FASTA] --reads [reads FASTA]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kcalign-0.5.2.tar.gz (9.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

kcalign-0.5.2-py3-none-any.whl (10.2 kB view details)

Uploaded Python 3

File details

Details for the file kcalign-0.5.2.tar.gz.

File metadata

  • Download URL: kcalign-0.5.2.tar.gz
  • Upload date:
  • Size: 9.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2.post20191201 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.7.3

File hashes

Hashes for kcalign-0.5.2.tar.gz
Algorithm Hash digest
SHA256 05516fb5623c9c475f888f5d6b904e9d1607cf8ac6264ac65d661ced7733ea32
MD5 fafa022984154268a2ba497621bb5c1b
BLAKE2b-256 9034b061b1bb50eea36945a11cc882309831c1b59f15518bdc166d0b2b0fb133

See more details on using hashes here.

File details

Details for the file kcalign-0.5.2-py3-none-any.whl.

File metadata

  • Download URL: kcalign-0.5.2-py3-none-any.whl
  • Upload date:
  • Size: 10.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2.post20191201 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.7.3

File hashes

Hashes for kcalign-0.5.2-py3-none-any.whl
Algorithm Hash digest
SHA256 968f3ad19c4263cc255edaf2b37dd160e68524640cb98a63b38db1806cf089cb
MD5 df13570330ebb6d3f35bc1bb25a6682b
BLAKE2b-256 f7eb603e1c7515d88757658ffce6ac1f0ae7f3a66332059e4c2e028de4cf888b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page