Skip to main content

A simple BLOSUM toolbox without dependencies.

Project description

blosum logo

DOI Citation Badge Test Badge Code style: black
Download Badge Python Version Badge install with bioconda

The BLOcks SUbstitution Matrices (BLOSUM) are used to score alignments between protein sequences and are therefore mainly used in bioinformatics.

Reading such matrices is not particularly difficult, yet most off the shelf packages are overloaded with strange dependencies. And why do we need to implement the same reader again if there is a simple module for that.

blosum offers a robust and easy-to-expand implementation without relying on third-party libraries.

Installation

Using pip / pip3:

pip install blosum

Or by source:

git clone git@github.com:not-a-feature/blosum.git
cd blosum
pip install .

Or by conda:

conda install blosum

How to use

Default BLOSUM

This package provides the most commonly used BLOSUM matrices. You can choose from BLOSUM 45, 50, 62, 80 and 90.

To load a matrix:

import blosum as bl
matrix = bl.BLOSUM(62)
val = matrix["A"]["Y"]

Custom matrix

In addition, own matrices can be loaded. For this, the path is given as an argument.

import blosum as bl
matrix = bl.BLOSUM("path/to/blosum.file")
val = matrix["A"]["Y"]

The matrices are required to have following format:

# Comments should start with #
# Each value should be seperated by one or many whitespace
   A  R  N  D
A  5 -2 -1 -2
R -2  7  0 -1
N -1  0  6  2
D -2 -1  2  7

Getting values:

Once loaded the matrix behaves like a defaultdict. To get a value use:

val = matrix["A"]["Y"]

To get a defaultdict of the row with a given key use:

val_dict = matrix["A"]

If the key cannot be found, the default value float("-inf") is returned. It is possible to set a custom default score:

matrix = bl.BLOSUM(62, default=0)

License

Copyright (C) 2023 by Jules Kreuer - @not_a_feature

This piece of software is published unter the GNU General Public License v3.0 TLDR:

Permissions Conditions Limitations
✓ Commercial use Disclose source ✕ Liability
✓ Distribution License and copyright notice ✕ Warranty
✓ Modification Same license
✓ Patent use State changes
✓ Private use

Go to LICENSE.md to see the full version.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

blosum-2.2.0.tar.gz (24.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

blosum-2.2.0-py3-none-any.whl (21.5 kB view details)

Uploaded Python 3

File details

Details for the file blosum-2.2.0.tar.gz.

File metadata

  • Download URL: blosum-2.2.0.tar.gz
  • Upload date:
  • Size: 24.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for blosum-2.2.0.tar.gz
Algorithm Hash digest
SHA256 6200425b534c3b82d2dca303bbc685274e6adf34f9a257671f85291b20766c35
MD5 950d36699b0090246b3adda682f73171
BLAKE2b-256 461cfbded27d453242e4fb15bb327dbe4d828b0ff17b415d46dafea25f19fc8f

See more details on using hashes here.

File details

Details for the file blosum-2.2.0-py3-none-any.whl.

File metadata

  • Download URL: blosum-2.2.0-py3-none-any.whl
  • Upload date:
  • Size: 21.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for blosum-2.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 707b89fe268b8d1f81a7a3aca600f4eb6d01215f46ba233f4ea185a557e226bd
MD5 b9d3a99729e85cd1c420c1e2b37980c0
BLAKE2b-256 f9dd3bb3adebd4fca4e16072510fd54883075b4571bb84a37f5475965895bca7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page