Utitilies for constructing and manipulating models for non-local structural dependencies in genomic sequences
Project description
===============
quasinet
===============
.. figure:: https://badge.fury.io/py/quasinet.svg
:alt: ehrzero PyPI Downloads
.. image:: http://zed.uchicago.edu/logo/logozed1.png
:height: 400px
:scale: 50 %
:alt: alternate text
:align: center
.. class:: no-web no-pdf
:Author: ZeD@UChicago <zed.uchicago.edu>
:Description: Infer Non-local Structural Dependencies In Genomic Sequences. Genomic sequences are esentially compressed encodings of phenotypic information. This package provides a novel set of tools to extract long-range structural dependencies in genotypic data that define the phenotypic outcomes. The key capabilities implemented here are as follows: 1. computing the q-net given a databse of nucleic acid sequences, which is a family of conditional inference trees capturing the predictability of each nucleotide position given the rest of the genome. 2. Computing a structure-aware evolution-adaptive notion of distance between genomes, which demonstrably is much more biologically relevant compared to the standard edit distance 3. Ability to draw samples in-silico, that have a high probability of being biologically correct. For example, given a database of HIV sequences, we can generate a new genomic sequence, which has a high probability of being a valid encoding of a HIV virion. The constructed q-net for long term non-progressor clinical phenotype in HIV-1 infection is shown below.
.. image:: https://zed.uchicago.edu/data/img/q-ltnr.png
:height: 400px
:scale: 50 %
:alt: q-net for long term non-progressor clinical phenotype in HIV-1 infection
:align: center
quasinet
===============
.. figure:: https://badge.fury.io/py/quasinet.svg
:alt: ehrzero PyPI Downloads
.. image:: http://zed.uchicago.edu/logo/logozed1.png
:height: 400px
:scale: 50 %
:alt: alternate text
:align: center
.. class:: no-web no-pdf
:Author: ZeD@UChicago <zed.uchicago.edu>
:Description: Infer Non-local Structural Dependencies In Genomic Sequences. Genomic sequences are esentially compressed encodings of phenotypic information. This package provides a novel set of tools to extract long-range structural dependencies in genotypic data that define the phenotypic outcomes. The key capabilities implemented here are as follows: 1. computing the q-net given a databse of nucleic acid sequences, which is a family of conditional inference trees capturing the predictability of each nucleotide position given the rest of the genome. 2. Computing a structure-aware evolution-adaptive notion of distance between genomes, which demonstrably is much more biologically relevant compared to the standard edit distance 3. Ability to draw samples in-silico, that have a high probability of being biologically correct. For example, given a database of HIV sequences, we can generate a new genomic sequence, which has a high probability of being a valid encoding of a HIV virion. The constructed q-net for long term non-progressor clinical phenotype in HIV-1 infection is shown below.
.. image:: https://zed.uchicago.edu/data/img/q-ltnr.png
:height: 400px
:scale: 50 %
:alt: q-net for long term non-progressor clinical phenotype in HIV-1 infection
:align: center
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
quasinet-0.0.26.tar.gz
(27.7 kB
view details)
File details
Details for the file quasinet-0.0.26.tar.gz
.
File metadata
- Download URL: quasinet-0.0.26.tar.gz
- Upload date:
- Size: 27.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: Python-urllib/2.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 51374f1bc7236e326bc1b93c6eb5045ce5519acd755328c57eb7e97dab42be49 |
|
MD5 | 95e5d4ed5b0f18d240213b1d6840c952 |
|
BLAKE2b-256 | 1b5d26f65e3c04acaabfda35b7a519792ae15502175ea098f56d6f47b7ac204e |