Skip to main content

Generated color schemes for sequence alignment visualizations

Project description

Gecos - Generated Color Schemes for sequence alignment visualizations

Multiple sequence alignments are often visualized by coloring the symbols according to some kind of properties. For example a color scheme for amino acids could use one color for hydrophobic residues, another color for positively charged residues and so forth. Usually, such color schemes are manually created by experienced people who have knowledge about the characteristics of the e.g. amino acids, so they can assign equal or similar colors to amino acids that share similar properties.

The Gecos software follows a different approach: Instead of looking at specific, sometimes subjective properties, it uses another source for estimating the similarity of symbols: the substitution matrix itself. Similar colors are assigned to high scoring pairs of symbols, low scoring pairs get distant colors - in a completely automatic manner. As a result the distance of two symbols in the substitution matrix corresponds to the perceptual differences in the color scheme.

How about an example? The following command line invocation creates a light color scheme. An example alignment using the newly generated color scheme is displayed below.

$ gecos --matrix BLOSUM62 --lmin 60 --lmax 75 -f awesome_colors.json
https://raw.githubusercontent.com/biotite-dev/gecos/master/doc/static/assets/figures/main_example_alignment.png

Installation

In order to use Gecos you need to have Python (at least 3.6) installed. Furthermore, the following Python packages are required:

  • biotite

  • numpy

  • matplotlib

  • scikit-image

If these prerequisites are met, Gecos is simply installed via

$ pip install gecos

Alternatively, Gecos can be installed via Conda:

$ conda install -c conda-forge gecos

Citation

If you use Gecos in a scientific publication, please cite:


P. Kunzmann, B. E. Mayer, K. Hamacher, “Substitution matrix based color schemes for sequence alignment visualization,” BMC Bioinformatics, vol. 21, pp. 209, 2020. doi: 10.1186/s12859-020-3526-6

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gecos-2.0.0.tar.gz (49.2 kB view details)

Uploaded Source

Built Distribution

gecos-2.0.0-py3-none-any.whl (58.5 kB view details)

Uploaded Python 3

File details

Details for the file gecos-2.0.0.tar.gz.

File metadata

  • Download URL: gecos-2.0.0.tar.gz
  • Upload date:
  • Size: 49.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.11.3 pkginfo/1.8.2 requests/2.27.1 requests-toolbelt/0.9.1 tqdm/4.64.0 CPython/3.9.12

File hashes

Hashes for gecos-2.0.0.tar.gz
Algorithm Hash digest
SHA256 1b2e2b7be3f4977a191e5caf664307f5a463e73bdd1f22c175d8bafd4472a3fe
MD5 330ded9ffbe4df5aa8f6e5274dd45168
BLAKE2b-256 eb12a3cc14b604f495702e5c15144aa7a47838f737c22a92fdee88735c700c42

See more details on using hashes here.

File details

Details for the file gecos-2.0.0-py3-none-any.whl.

File metadata

  • Download URL: gecos-2.0.0-py3-none-any.whl
  • Upload date:
  • Size: 58.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.11.3 pkginfo/1.8.2 requests/2.27.1 requests-toolbelt/0.9.1 tqdm/4.64.0 CPython/3.9.12

File hashes

Hashes for gecos-2.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d3866ec738f9f6ae01ed3f18a5bef92038e6826d01e2c30d67fec2e03ce7937f
MD5 8ce58232c9c48158ce1d6874f4d4b5ab
BLAKE2b-256 535412333e741a8767e26a95365bc6aa8b5b075d6aa505e5cb8a68d75da08fff

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page