Utitilies for constructing and manipulating models for non-local structural dependencies in genomic sequences
Project description
Quasinet
Description
Infer non-local structural dependencies in genomic sequences. Genomic sequences are esentially compressed encodings of phenotypic information. This package provides a novel set of tools to extract long-range structural dependencies in genotypic data that define the phenotypic outcomes. The key capabilities implemented here are as follows:
- Compute the Quasinet (Q-net) given a database of nucleic acid sequences. The Q-net is a family of conditional inference trees that capture the predictability of each nucleotide position given the rest of the genome. The constructed Q-net for COVID-19 and Influenza A H1N1 HA 2008-9 is shown below.
COVID-19 | INFLUENZA |
---|---|
-
Compute a structure-aware evolution-adaptive notion of distance between genomes, which is demonstrably more biologically relevant compared to the standard edit distance.
-
Draw samples in-silico that have a high probability of being biologically correct. For example, given a database of Influenza sequences, we can generate a new genomic sequence that has a high probability of being a valid influenza sequence.
Installation
To install with pip:
pip install quasinet
To fix error with Mac or Windows:
from quasinet.osfix import osfix
# for windows
osfix('win')
# for max x86_64 (macbook pro)
osfix('macx86')
# mac arm (macbook air)
osfix('macarm')
NOTE: If trying to reproduce the paper below, please use pip install quasinet==0.0.58
Dependencies
- scikit-learn
- scipy
- numpy
- numba
- pandas
- joblib
- biopython
Usage
from quasinet import qnet
# initialize qnet
myqnet = qnet.Qnet()
# train the qnet
myqnet.fit(X)
# compute qdistance
qdist = qnet.qdistance(seq1, seq2, myqnet, myqnet)
Examples
Examples are located here.
Documentation
For more documentation, see here.
Papers
For reference, please check out our paper:
Authors
You can reach the ZED lab at: zed.uchicago.edu
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file quasinet-0.1.21.tar.gz
.
File metadata
- Download URL: quasinet-0.1.21.tar.gz
- Upload date:
- Size: 14.5 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.6.0 importlib_metadata/4.6.3 pkginfo/1.7.1 requests/2.27.1 requests-toolbelt/0.9.1 tqdm/4.64.1 CPython/3.10.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 106b2e03472e4088adf04655ab379912aeaf90edfee7b5b29e3c3f25c7fe70b7 |
|
MD5 | 082e65a4712b1db6b65d808418950e45 |
|
BLAKE2b-256 | c8e95968a37f136986eadbc81bbda96d68cdb9c3a6dfd97497ebc365ef02d7fc |
File details
Details for the file quasinet-0.1.21-py3-none-any.whl
.
File metadata
- Download URL: quasinet-0.1.21-py3-none-any.whl
- Upload date:
- Size: 15.2 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.6.0 importlib_metadata/4.6.3 pkginfo/1.7.1 requests/2.27.1 requests-toolbelt/0.9.1 tqdm/4.64.1 CPython/3.10.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 12a0f897c24ea29bffad561b6222131b9befec7296f64cb3589c70c9cb7ee8fa |
|
MD5 | 0bf935072ea8ef377367ed0171e19e65 |
|
BLAKE2b-256 | 40de385bd1364e1d89cff97b589a9f5a6aa50d77770fef5cb40029ccd3fc787b |