Python bindings to GenDisCal and addition of phylogenetic tree function
Project description
gendiscalpy
Python bindings to GenDisCal and addition of phylogenetic tree function
- GenDisCal repo: https://github.com/LM-UGent/GenDisCal
- GenDisCal paper: https://academic.oup.com/bioinformatics/article/36/8/2337/5695704
Installation
This package was created using Python 3.9
.
pip install git+https://github.com/MrTomRod/gendiscalpy
This will add two scripts to your PATH:
- install_gendiscal
- gendiscal_tree
Usage in terminal
install_gendiscal
This script downloads the GenDisCal binary and unpacks it to a path of your choosing.
$ install_gendiscal
Options:
1: /home/user/PycharmProjects/venvs/gendiscalpy10/bin
2: /home/user/miniconda3/condabin
3: /home/user/bin
4: /usr/bin
5: /usr/sbin
Please select a path to install GenDisCal to!
1
You chose: /home/user/PycharmProjects/venvs/gendiscalpy10/bin
Installed GenDiscal here: /home/user/PycharmProjects/venvs/gendiscalpy10/bin/GenDisCal
Now, you should be able to run GenDisCal in your terminal. Test with GenDisCal --version
.
gendiscal_tree
Create a phylogenetic tree based on GenDisCal a distance matrix. Returns Newick format.
This script uses the upmga
algorithm of the biotite
package. See
wikipedia and
biotite docs.
Requires GenDisCal to be in PATH, see also install_gendiscal
.
Examples:
gendiscal_tree from_files assembly1.fna assembly2.fna
gendiscal_tree from_files assemblies/*.fna
gendiscal_tree from_files assemblies/*.fna --preset PaSiT6 --method euclidian
Usage as Python class
from gendiscalpy import GenDisCal, GenDisCalTree
from gendiscalpy.install_gendiscal import install
# install GenDisCal to /home/user/bin/GenDisCal
install(path='/home/user/bin')
gdc = GenDisCal() # bin='/home/user/bin/GenDisCal'
# these commands return pandas.DataFrame objects
table1 = gdc.run('test-data/*.fna')
table2 = gdc.run('test-data/*.fna', preset='PaSiT6', method='euclidian')
distance_matrix = gdc.run('test-data/*.fna', distance_matrix=True)
histogram = gdc.run('test-data/*.fna', histogram=True)
# this command returns a float
val = gdc.compare_two(assembly_1='test-data/FAM3257-i1-1.fna', assembly_2='test-data/FAM13496-i1-1.fna')
# this command returns a string in Newick format
newick = GenDisCalTree.from_files('test-data/*.fna')
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file gendiscalpy-0.0.2.tar.gz
.
File metadata
- Download URL: gendiscalpy-0.0.2.tar.gz
- Upload date:
- Size: 6.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.10.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f5cca8dc27c04fa9fb4f5db967fea4a8809dcf3415d0b88a6e28f257283c4c5b |
|
MD5 | 79692dbf19f93fd39221b2d9b51963b2 |
|
BLAKE2b-256 | 08bb26890094c88ca3cdb9214d6e959c2b9a4f8e5fbad0fb4f025eae362ba8b3 |
File details
Details for the file gendiscalpy-0.0.2-py3-none-any.whl
.
File metadata
- Download URL: gendiscalpy-0.0.2-py3-none-any.whl
- Upload date:
- Size: 7.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.10.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | aed479359506f15c115b7d101c000441d9339b51264009663b1358680e94670d |
|
MD5 | 4fc96fa2da73570a0fde033178afca67 |
|
BLAKE2b-256 | a03100369ac1a3b7e21503bf469dc97b4c402e3e5cfa8134ede95a2c1e234b16 |