Skip to main content

tools for comparing DNA sequences with MinHash sketches

Project description

sourmash

Documentation Build Status PyPI codecov DOI License: 3-Clause BSD


Compute MinHash signatures for nucleotide (DNA/RNA) and protein sequences.

Usage:

sourmash compute *.fq.gz
sourmash compare *.sig -o distances
sourmash plot distances

sourmash 1.0 is published on JOSS; please cite that paper if you use sourmash (doi: 10.21105/joss.00027):.


The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Primary authors: C. Titus Brown (@ctb) and Luiz C. Irber, Jr (@luizirber).

sourmash is a product of the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine.

Installation

We recommend using bioconda to install sourmash:

conda install -c conda-forge -c bioconda sourmash

This will install the latest stable version of sourmash 3.

You can also use pip to install sourmash:

pip install sourmash

A quickstart tutorial is available.

Requirements

sourmash runs under both Python 2.7.x and Python 3.5+. The base requirements are screed and ijson, together with a Rust environment (for the extension code). We suggest using rustup to install the Rust environment:

curl https://sh.rustup.rs -sSf | sh

The comparison code (sourmash compare) uses numpy, and the plotting code uses matplotlib and scipy, but most of the code is usable without these.

For search and gather you also need khmer version 2.1+.

Installation with conda

Bioconda is a channel for the conda package manager with a focus on bioinformatics software. After installing conda you will need to add the bioconda channel as well as the other channels bioconda depends on. Once you have setup bioconda, you can install sourmash by running:

$ conda create -n sourmash_env -c conda-forge -c bioconda sourmash python=3.7
$ source activate sourmash_env
$ sourmash compute -h

which will install the latest alpha release.

Support

Please ask questions and files issues on Github.

Development

Development happens on github at dib-lab/sourmash.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the Python library and command-line interface code.

The src/core/ directory contains the Rust library implementing core functionality.

Tests require py.test and can be run with make test.

Please see the developer notes for more information.


CTB Jan 2020

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourmash-3.4.0.tar.gz (7.4 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

sourmash-3.4.0-py2.py3-none-manylinux2014_ppc64le.whl (1.3 MB view details)

Uploaded Python 2Python 3

sourmash-3.4.0-py2.py3-none-manylinux2014_aarch64.whl (1.1 MB view details)

Uploaded Python 2Python 3

sourmash-3.4.0-py2.py3-none-manylinux2010_x86_64.whl (1.1 MB view details)

Uploaded Python 2Python 3manylinux: glibc 2.12+ x86-64

sourmash-3.4.0-py2.py3-none-manylinux1_x86_64.whl (1.1 MB view details)

Uploaded Python 2Python 3

sourmash-3.4.0-py2.py3-none-macosx_10_11_x86_64.whl (487.2 kB view details)

Uploaded Python 2Python 3macOS 10.11+ x86-64

File details

Details for the file sourmash-3.4.0.tar.gz.

File metadata

  • Download URL: sourmash-3.4.0.tar.gz
  • Upload date:
  • Size: 7.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.0.post20200712 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.7.6

File hashes

Hashes for sourmash-3.4.0.tar.gz
Algorithm Hash digest
SHA256 8e2aeb45ae82d5864c6c19f6944ba9492128936745d21804d08f6add23d0a166
MD5 531d5b6e3130fa7fe0e2dd33075b51e0
BLAKE2b-256 7c53133dcd337a4fccd6a5414ce914308f42ad3eb827ec5273048dcfc8fd8d60

See more details on using hashes here.

File details

Details for the file sourmash-3.4.0-py2.py3-none-manylinux2014_ppc64le.whl.

File metadata

  • Download URL: sourmash-3.4.0-py2.py3-none-manylinux2014_ppc64le.whl
  • Upload date:
  • Size: 1.3 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.1.0.post20200119 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.6

File hashes

Hashes for sourmash-3.4.0-py2.py3-none-manylinux2014_ppc64le.whl
Algorithm Hash digest
SHA256 51a8163805353320dcb85b2576613afcbcdc331e912a5e51001318f899bffee9
MD5 7b0db1c8c9ed40381b7e1edaea41875a
BLAKE2b-256 395b065d970a3f9311d83567fcbcd54ed176ae6f09fcdb307d24bf5b6aab794f

See more details on using hashes here.

File details

Details for the file sourmash-3.4.0-py2.py3-none-manylinux2014_aarch64.whl.

File metadata

  • Download URL: sourmash-3.4.0-py2.py3-none-manylinux2014_aarch64.whl
  • Upload date:
  • Size: 1.1 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.1.0.post20200119 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.6

File hashes

Hashes for sourmash-3.4.0-py2.py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 7c071e4a15fd11941e62a121d2507d20f66793b1673ef2f503708504e0b576be
MD5 82f1f629d9d0c9c50a7ab5a5f57e59fb
BLAKE2b-256 da2c9fc7822096f04e43579e8f9655e4ee155896e03bd55076f272b90cfabcf6

See more details on using hashes here.

File details

Details for the file sourmash-3.4.0-py2.py3-none-manylinux2010_x86_64.whl.

File metadata

  • Download URL: sourmash-3.4.0-py2.py3-none-manylinux2010_x86_64.whl
  • Upload date:
  • Size: 1.1 MB
  • Tags: Python 2, Python 3, manylinux: glibc 2.12+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.1.0.post20200119 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.6

File hashes

Hashes for sourmash-3.4.0-py2.py3-none-manylinux2010_x86_64.whl
Algorithm Hash digest
SHA256 f843e0c2fa88d05c96a95670a0b7a83140c34f045b9e1cad57b4e868896572f4
MD5 d0dfa722c6372a5e8f6f1f3d78f4e36f
BLAKE2b-256 7387c269f84d8806c296520f30e557acd8299738a7aaf8519d74c1fff77a0244

See more details on using hashes here.

File details

Details for the file sourmash-3.4.0-py2.py3-none-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-3.4.0-py2.py3-none-manylinux1_x86_64.whl
  • Upload date:
  • Size: 1.1 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.1.0.post20200119 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.6

File hashes

Hashes for sourmash-3.4.0-py2.py3-none-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 8845d6e5d8764208d171231fdc908416691f66f20169bfde368860d2cc23ad20
MD5 bbff139d14d5ae6ab9f1ed5be7a445b6
BLAKE2b-256 b47887ef9e52bebe650066c3f77752176394823024f322f9e5541de497d37bd6

See more details on using hashes here.

File details

Details for the file sourmash-3.4.0-py2.py3-none-macosx_10_11_x86_64.whl.

File metadata

  • Download URL: sourmash-3.4.0-py2.py3-none-macosx_10_11_x86_64.whl
  • Upload date:
  • Size: 487.2 kB
  • Tags: Python 2, Python 3, macOS 10.11+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.1.0.post20200119 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.6

File hashes

Hashes for sourmash-3.4.0-py2.py3-none-macosx_10_11_x86_64.whl
Algorithm Hash digest
SHA256 62300bda741216baa8598d8e4400a8ddfcac947c162fda2ff368276ad097951f
MD5 b80013a71e9f55354ae4da5e8855a4d4
BLAKE2b-256 69819586e417084502e2b9ef82601e2005da72ac1a61b717cd51cfd8230dd348

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page