Skip to main content

Toolkit for cleaning and interpreting multiple sequence alignments

Project description

CI

CIAlign

CIAlign documentation is now available via ReadTheDocs

Summary

CIAlign allows the user to:

Clean

  • Remove sources of noise from an MSA
    • Remove sequences above a threshold level percentage of divergence from the majority.
    • Remove insertions which are not present in the majority of sequences.
    • Crop poorly aligned sequence ends.
    • Remove short sequences below a threshold number of bases or amino acids.
    • Remove columns containing only gaps.
    • Remove either end of an alignment where columns don't meet a minimum identity threshold and coverage level.

Visualise

  • Visualise alignments.
    • Generate image files summarising the alignment.
    • Label these images to show how CIAlign has affected the alignment.
    • Draw sequence logos
    • Plot alignment statistics - visualise coverage and conservation at each position in the alignment.

Interpret

  • Generate consensus sequences.
  • Generate position frequency, position probability and position weight matrices
  • Format these matrices to be used as input for the BLAMM and MEME motif analysis tools.
  • Generate a similarity matrix showing the percentage identity between each sequence pair.

Edit

  • Extract a section of the alignment.
  • Unalign the alignment.
  • Replace U with T, or T with U in a nucleotide alignment.

CIAlign is designed to be highly customisable, allowing users to specify exactly which functions to run and which settings to use.

It is also transparent, generating a clear log file and alignment markup showing exactly how the alignment has changed and what has been removed by which function.

Citation

If you found CIAlign useful, please cite:

Tumescheit C, Firth AE, Brown K. 2022. CIAlign: A highly customisable command line tool to clean, interpret and visualise multiple sequence alignments. PeerJ 10:e12983 https://doi.org/10.7717/peerj.12983

Mailing List

Sign up here for updates when a new feature is added to CIAlign

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cialign-1.1.0.tar.gz (50.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cialign-1.1.0-py3-none-any.whl (61.9 kB view details)

Uploaded Python 3

File details

Details for the file cialign-1.1.0.tar.gz.

File metadata

  • Download URL: cialign-1.1.0.tar.gz
  • Upload date:
  • Size: 50.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.1 pkginfo/1.8.2 requests/2.27.1 requests-toolbelt/0.9.1 tqdm/4.64.0 CPython/3.7.8

File hashes

Hashes for cialign-1.1.0.tar.gz
Algorithm Hash digest
SHA256 8244cd7402ce7a9eea2bec2ca7954ef58b01408537c962611e60d7eb97a43966
MD5 d35bed4cafa2fecab578866c09ac5a98
BLAKE2b-256 3d0f23ce63b238948af1ee31e5bd55ba7775ab3a499a509a107c9bc2f03777da

See more details on using hashes here.

File details

Details for the file cialign-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: cialign-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 61.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.1 pkginfo/1.8.2 requests/2.27.1 requests-toolbelt/0.9.1 tqdm/4.64.0 CPython/3.7.8

File hashes

Hashes for cialign-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b542bd20f37b6bffd4e5ed1279ce03acea08a354d982dc02e104e09cfebc5c25
MD5 073a464858e5f04795ea09d31659d267
BLAKE2b-256 353f02639f4776b895528b451a36d754952c8c0a79157b974dbcd7d436ade75c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page