Skip to main content

I/O and utilities for the Consensus Multiple Alignment (CMA) file format.

Project description

Consensus Multiple Alignment format, using Biopython alignments.

I/O support and relevant functionality for the Consensus Alignment Format (CMA). This format represents protein sequence alignments. It is used by a few tools by Dr. Andrew F. Neuwald, notably CHAIN and MAPGAPS.

Biopython objects and conventions are used where possible.

Installation

This is an ordinary Python package. You can install it from source with the setup.py script:

python setup.py build
python setup.py install

Stable releases are uploaded to PyPI as well, so you can install BioFrills with Python package managers:

pip install biofrills

Or:

easy_install biofrills

Some scripts that I find useful are in the scripts/ directory. By default these are not installed, but you can include them by uncommenting the line in setup.py that starts with scripts=glob

Alternatively, you can just copy those scripts into another directory in your $PATH.

What can the CMA format do for me?

  • Like the A2M and Stockholm formats, alignments are shown with insertions as lowercase characters and deletions are dashes

  • Like the A3M format, alignments are pairwise versus a profile (or “consensus” sequence), which also dictates which sites are indels. By compressing the insert columns, a large of alignment of many divergent (but related) sequences can be shown without filling it will mostly gap characters, as Stockholm can.

  • Like Stockholm, but unlike A2M and A3M, more than one alignment can be contained in a single file.

  • Typically, an ungapped consensus sequence will be included as the first sequence.

  • A FASTA-like header contains additional, optional fields for the number of leading and trailing sites and NBCI taxonomy codes

Who uses CMA?

The CMA format appears to have been invented by Dr. Andrew Neuwald at the University of Maryland, and is used in these programs:

Should I use CMA in my own work?

Unless you’re working with MAPGAPS or CHAIN, no.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

biocma-0.2.1.tar.gz (13.4 kB view details)

Uploaded Source

File details

Details for the file biocma-0.2.1.tar.gz.

File metadata

  • Download URL: biocma-0.2.1.tar.gz
  • Upload date:
  • Size: 13.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for biocma-0.2.1.tar.gz
Algorithm Hash digest
SHA256 6680644f424e9dfaf6c407c45eafc1bc081dada06dced63197827c61f3d0a728
MD5 f0d823a58cf2ad1179bebc9d138360e3
BLAKE2b-256 22fac3c4bb98691b44c3e6be1768f7f8b13c403a51d9b53e7636e8052fecfcd5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page