Toolkit for cleaning and interpreting multiple sequence alignments
Project description
CIAlign
CIAlign documentation is now available via ReadTheDocs
Installation
Requirements
- python >= 3.6
- matplotlib >= 2.1.1
- numpy >= 1.16.3
- scipy >= 1.3.0
The easiest way to install CIAlign is using conda or pip3.
Conda
conda install -c bioconda cialign
pip3
pip3 install cialign
Download The current release of CIAlign can also be downloaded directly using this link,
If you download the package directly, you will also need to add the CIAlign directory to your PATH environment variable as described here
Summary
CIAlign allows the user to:
Clean
- Remove sources of noise from an MSA
- Remove sequences above a threshold level percentage of divergence from the majority.
- Remove insertions which are not present in the majority of sequences.
- Crop poorly aligned sequence ends.
- Remove short sequences below a threshold number of bases or amino acids.
- Remove columns containing only gaps.
- Remove either end of an alignment where columns don't meet a minimum identity threshold and coverage level.
Visualise
- Visualise alignments.
- Generate image files summarising the alignment.
- Label these images to show how CIAlign has affected the alignment.
- Draw sequence logos
- Plot alignment statistics - visualise coverage and conservation at each position in the alignment.
Interpret
- Generate consensus sequences.
- Generate position frequency, position probability and position weight matrices
- Format these matrices to be used as input for the BLAMM and MEME motif analysis tools.
- Generate a similarity matrix showing the percentage identity between each sequence pair.
Edit
- Extract a section of the alignment.
- Unalign the alignment.
- Replace U with T, or T with U in a nucleotide alignment.
CIAlign is designed to be highly customisable, allowing users to specify exactly which functions to run and which settings to use.
It is also transparent, generating a clear log file and alignment markup showing exactly how the alignment has changed and what has been removed by which function.
Citation
If you found CIAlign useful, please cite:
Tumescheit C, Firth AE, Brown K. 2022. CIAlign: A highly customisable command line tool to clean, interpret and visualise multiple sequence alignments. PeerJ 10:e12983 https://doi.org/10.7717/peerj.12983
Mailing List
Sign up here for updates when a new feature is added to CIAlign
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file cialign-1.1.4.tar.gz
.
File metadata
- Download URL: cialign-1.1.4.tar.gz
- Upload date:
- Size: 84.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 95e40f956e004e9e482e18249af7803276843df9fce7d81cc11f7fe46c083d1e |
|
MD5 | f1d9b9580800879e0ff9c5b4e032468e |
|
BLAKE2b-256 | 7f08b78769b42f378bd56f3a63a87c49fe94c373e27a405ea18ecc9e8d8391b7 |
File details
Details for the file cialign-1.1.4-py3-none-any.whl
.
File metadata
- Download URL: cialign-1.1.4-py3-none-any.whl
- Upload date:
- Size: 131.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d6e30757a7fc518d0ed326818a1a200cff85b6a2bde59c0bff48b41082a73348 |
|
MD5 | 23c81af4914ed06a8306e128e5357895 |
|
BLAKE2b-256 | 573363d34ff84f3ef9cfea1b64d664eb375ff3d06fc2a4f980fbc609b65c91e9 |