Skip to main content

Toolkit for cleaning and interpreting multiple sequence alignments

Project description

CI

CIAlign

CIAlign documentation is now available via ReadTheDocs

Installation

Requirements

  • python >= 3.6
  • matplotlib >= 2.1.1
  • numpy >= 1.16.3
  • scipy >= 1.3.0

The easiest way to install CIAlign is using conda or pip3.

Conda

conda install -c bioconda cialign

link

pip3 pip3 install cialign

link

Download The current release of CIAlign can also be downloaded directly using this link,

If you download the package directly, you will also need to add the CIAlign directory to your PATH environment variable as described here

Summary

CIAlign allows the user to:

Clean

  • Remove sources of noise from an MSA
    • Remove sequences above a threshold level percentage of divergence from the majority.
    • Remove insertions which are not present in the majority of sequences.
    • Crop poorly aligned sequence ends.
    • Remove short sequences below a threshold number of bases or amino acids.
    • Remove columns containing only gaps.
    • Remove either end of an alignment where columns don't meet a minimum identity threshold and coverage level.

Visualise

  • Visualise alignments.
    • Generate image files summarising the alignment.
    • Label these images to show how CIAlign has affected the alignment.
    • Draw sequence logos
    • Plot alignment statistics - visualise coverage and conservation at each position in the alignment.

Interpret

  • Generate consensus sequences.
  • Generate position frequency, position probability and position weight matrices
  • Format these matrices to be used as input for the BLAMM and MEME motif analysis tools.
  • Generate a similarity matrix showing the percentage identity between each sequence pair.

Edit

  • Extract a section of the alignment.
  • Unalign the alignment.
  • Replace U with T, or T with U in a nucleotide alignment.

CIAlign is designed to be highly customisable, allowing users to specify exactly which functions to run and which settings to use.

It is also transparent, generating a clear log file and alignment markup showing exactly how the alignment has changed and what has been removed by which function.

Citation

If you found CIAlign useful, please cite:

Tumescheit C, Firth AE, Brown K. 2022. CIAlign: A highly customisable command line tool to clean, interpret and visualise multiple sequence alignments. PeerJ 10:e12983 https://doi.org/10.7717/peerj.12983

Mailing List

Sign up here for updates when a new feature is added to CIAlign

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cialign-1.1.4.tar.gz (84.0 kB view details)

Uploaded Source

Built Distribution

cialign-1.1.4-py3-none-any.whl (131.7 kB view details)

Uploaded Python 3

File details

Details for the file cialign-1.1.4.tar.gz.

File metadata

  • Download URL: cialign-1.1.4.tar.gz
  • Upload date:
  • Size: 84.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.0

File hashes

Hashes for cialign-1.1.4.tar.gz
Algorithm Hash digest
SHA256 95e40f956e004e9e482e18249af7803276843df9fce7d81cc11f7fe46c083d1e
MD5 f1d9b9580800879e0ff9c5b4e032468e
BLAKE2b-256 7f08b78769b42f378bd56f3a63a87c49fe94c373e27a405ea18ecc9e8d8391b7

See more details on using hashes here.

File details

Details for the file cialign-1.1.4-py3-none-any.whl.

File metadata

  • Download URL: cialign-1.1.4-py3-none-any.whl
  • Upload date:
  • Size: 131.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.0

File hashes

Hashes for cialign-1.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 d6e30757a7fc518d0ed326818a1a200cff85b6a2bde59c0bff48b41082a73348
MD5 23c81af4914ed06a8306e128e5357895
BLAKE2b-256 573363d34ff84f3ef9cfea1b64d664eb375ff3d06fc2a4f980fbc609b65c91e9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page