Skip to main content

Hash-based phonemic sequence identifiers

Project description

Konstel(lations)

Tests PyPI

Not yet stable, proceed with caution

An extensible command line tool and library for generating memorable and pronounceable hash-based identifier schemes for sequences, biological or otherwise. For further details and my SARS-CoV-2 naming proposal, please read my blog post.

Install

# Python >= 3.6
pip install konstel

# Latest
pip install git+https://github.com/bede/konstel

Usage

Command line

$ konstel gen sars-cov-2-s.genome konstel/tests/data/spike2.genome.fa --output table
scheme               sars-cov-2-s   
hash                 S:w80qgz2k1fdds6x4mknxazm7psed5knd
hash-4               S:w80q         
id                   S:gofabil  

$ echo "ACGT" | konstel gen generic.nucl - --output table
scheme               generic        
hash                 4449jkgqyv6akzs3aaptjav527dger1m
id                   bodafanoja      

Python

>>> from konstel import konstel
>>> konstel.generate('sars-cov-2-s.protein', 'konstel/tests/data/spike.prot.fa')
{"scheme": "sars-cov-2-s", "hash": "S:c52gdyc7v1dfznny48hfkn1g8ax2yx8f", "hash-4": "S:c52g", "id": "S:dodidib"}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

konstel-0.7.1.tar.gz (8.7 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page