Skip to main content

Python bindings to GenDisCal and addition of phylogenetic tree function

Project description

gendiscalpy

Python bindings to GenDisCal and addition of phylogenetic tree function

Installation

This package was created using Python 3.9.

pip install git+https://github.com/MrTomRod/gendiscalpy

This will add two scripts to your PATH:

  • install_gendiscal
  • gendiscal_tree

Usage in terminal

install_gendiscal

This script downloads the GenDisCal binary and unpacks it to a path of your choosing.

$ install_gendiscal 
Options:
  1: /home/user/PycharmProjects/venvs/gendiscalpy10/bin
  2: /home/user/miniconda3/condabin
  3: /home/user/bin
  4: /usr/bin
  5: /usr/sbin
Please select a path to install GenDisCal to!
1
You chose: /home/user/PycharmProjects/venvs/gendiscalpy10/bin
Installed GenDiscal here: /home/user/PycharmProjects/venvs/gendiscalpy10/bin/GenDisCal

Now, you should be able to run GenDisCal in your terminal. Test with GenDisCal --version.

gendiscal_tree

Create a phylogenetic tree based on GenDisCal a distance matrix. Returns Newick format.

This script uses the upmga algorithm of the biotite package. See wikipedia and biotite docs.

Requires GenDisCal to be in PATH, see also install_gendiscal.

Examples:

gendiscal_tree from_files assembly1.fna assembly2.fna
gendiscal_tree from_files assemblies/*.fna
gendiscal_tree from_files assemblies/*.fna --preset PaSiT6 --method euclidian

Usage as Python class

from gendiscalpy import GenDisCal, GenDisCalTree
from gendiscalpy.install_gendiscal import install

# install GenDisCal to /home/user/bin/GenDisCal
install(path='/home/user/bin')

gdc = GenDisCal()  # bin='/home/user/bin/GenDisCal'

# these commands return pandas.DataFrame objects
table1 = gdc.run('test-data/*.fna')
table2 = gdc.run('test-data/*.fna', preset='PaSiT6', method='euclidian')
distance_matrix = gdc.run('test-data/*.fna', distance_matrix=True)
histogram = gdc.run('test-data/*.fna', histogram=True)

# this command returns a float
val = gdc.compare_two(assembly_1='test-data/FAM3257-i1-1.fna', assembly_2='test-data/FAM13496-i1-1.fna')

# this command returns a string in Newick format
newick = GenDisCalTree.from_files('test-data/*.fna')

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gendiscalpy-0.0.2.tar.gz (6.2 kB view details)

Uploaded Source

Built Distribution

gendiscalpy-0.0.2-py3-none-any.whl (7.7 kB view details)

Uploaded Python 3

File details

Details for the file gendiscalpy-0.0.2.tar.gz.

File metadata

  • Download URL: gendiscalpy-0.0.2.tar.gz
  • Upload date:
  • Size: 6.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.7

File hashes

Hashes for gendiscalpy-0.0.2.tar.gz
Algorithm Hash digest
SHA256 f5cca8dc27c04fa9fb4f5db967fea4a8809dcf3415d0b88a6e28f257283c4c5b
MD5 79692dbf19f93fd39221b2d9b51963b2
BLAKE2b-256 08bb26890094c88ca3cdb9214d6e959c2b9a4f8e5fbad0fb4f025eae362ba8b3

See more details on using hashes here.

File details

Details for the file gendiscalpy-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: gendiscalpy-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 7.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.7

File hashes

Hashes for gendiscalpy-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 aed479359506f15c115b7d101c000441d9339b51264009663b1358680e94670d
MD5 4fc96fa2da73570a0fde033178afca67
BLAKE2b-256 a03100369ac1a3b7e21503bf469dc97b4c402e3e5cfa8134ede95a2c1e234b16

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page