Skip to main content

Utitilies for constructing and manipulating models for non-local structural dependencies in genomic sequences

Project description

Quasinet

quasinet PyPI Downloads

PyPI version

Description

Infer non-local structural dependencies in genomic sequences. Genomic sequences are esentially compressed encodings of phenotypic information. This package provides a novel set of tools to extract long-range structural dependencies in genotypic data that define the phenotypic outcomes. The key capabilities implemented here are as follows:

  1. Compute the Quasinet (Q-net) given a database of nucleic acid sequences. The Q-net is a family of conditional inference trees that capture the predictability of each nucleotide position given the rest of the genome. The constructed Q-net for COVID-19 and Influenza A H1N1 HA 2008-9 is shown below.
COVID-19 INFLUENZA
  1. Compute a structure-aware evolution-adaptive notion of distance between genomes, which is demonstrably more biologically relevant compared to the standard edit distance.

  2. Draw samples in-silico that have a high probability of being biologically correct. For example, given a database of Influenza sequences, we can generate a new genomic sequence that has a high probability of being a valid influenza sequence.

Installation

To install with pip:

pip install quasinet

Dependencies

  • scikit-learn
  • scipy
  • numpy
  • numba
  • pandas
  • joblib
  • biopython

Usage

from quasinet import qnet

# initialize qnet
myqnet = qnet.Qnet()

# train the qnet
myqnet.fit(X)

# compute qdistance
qdist = qnet.qdistance(seq1, seq2, myqnet, myqnet) 

Examples

Examples are located here.

Authors

You can reach the ZED lab at: zed.uchicago.edu

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

quasinet-0.0.52.tar.gz (14.4 MB view details)

Uploaded Source

Built Distribution

quasinet-0.0.52-py3-none-any.whl (15.1 MB view details)

Uploaded Python 3

File details

Details for the file quasinet-0.0.52.tar.gz.

File metadata

  • Download URL: quasinet-0.0.52.tar.gz
  • Upload date:
  • Size: 14.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.1.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.7.7

File hashes

Hashes for quasinet-0.0.52.tar.gz
Algorithm Hash digest
SHA256 856248094794fadfc9ed6b2c5996e1c7fcc3fe7943c93cec44c90f8d514bcfe7
MD5 0619d205f3d7b1bfdce4a34d60bfae7c
BLAKE2b-256 ed29f83439f03eb3e5f400f91c4370bf34ab6244812ebd5758fef7c8ceab1784

See more details on using hashes here.

File details

Details for the file quasinet-0.0.52-py3-none-any.whl.

File metadata

  • Download URL: quasinet-0.0.52-py3-none-any.whl
  • Upload date:
  • Size: 15.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.1.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.7.7

File hashes

Hashes for quasinet-0.0.52-py3-none-any.whl
Algorithm Hash digest
SHA256 df886aa8d11b443d7f0f5821bab8e87e6295cfbdb39abbc138276232dd6cbcd0
MD5 fdc26bb3d89fd7bfc2159c0e46d4dca2
BLAKE2b-256 c99c74dea5eeed362783f78e18cfed66818a79850e7d893f12bef539afc458e6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page