A PyPI package to compute multivariate tensor-based orthogonal polynomials for sequence data and map phenotypes onto sequence space.
Project description
ortho_seqs
Converting sequence data (DNA/RNA/protein) to tensor-valued orthogonal polynomials and projecting phenotypes onto the polynomial space. Find out more about the approach in this paper Analyzing genomic data using tensor-based orthogonal polynomials with application to synthetic RNAs The paper gives an example of this method as applied to a case of synthetic RNA from a previously published dataset. Another manuscript detailing the use of this method to understand binding affinities of transcription factors (TFs) is currently in progress. The tool can also be used for protein sequence data and functionality to analyze amino acid sequences has been added to the CLI.
to install an environment with dependencies for this package
conda create -n ortho_seqs pip
pip install -r requirements.txt
conda activate ortho_seq
or
conda env create -f conda_environment.yml
conda activate ortho_seq
then to install the package
python setup.py install
input files needed
- You'll need a file with sequence data (.txt or .dat or .csv).
- You'll need a file with corresponding phenotypes (.txt or .dat or .csv) with the same length as the number of sequences.
to run the commandline tool
To use the code, you can run the sample command below::
ortho_seq orthogonal-polynomial ./ortho_seq_code/testdata/pho4_r4_s1_site_3-4.csv --phenotype ./ortho_seq_code/testdata/pho4_r4_s1_ddg.txt --pop-size 10000 --out-dir ../results_ortho_seq_testing
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for ortho_seq_code-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f8bc30f9fa3dd3a478af4fc10b03ae8365a9ab02abd88dfc5bbc45799024b00a |
|
MD5 | 9a0f87b6d6d8b6d0a555cefbba73dc07 |
|
BLAKE2b-256 | 8eda0c60b37e13fd8569632379eb15a63f8654134d8ed2b8d8a6698dc7d387a7 |