For matching species databases
Project description
Phylo-Match
Phylo-Match is a package for correcting misspellings or disparate labelings in phylogenetic trees
Installation
This project was built with python v3.8 (python3). Later versions should be fine, but no guarantees for earlier. Not compatible with python 2.
Check that you have python3 installed:
python3 --version
Output should be something like:
Python 3.7.1
If --version does not return a version, follow Python3 installation instructions (or install Python3 your own way)
Use the package manager pip3 to install phylo-match.
Install pip:
python3 -m ensurepip --upgrade
Install Phylo-Match:
pip3 install phylo-match
Usage
phylo-match
Use the gui to select a database file (.csv), and a taxa tree (.nexus) to match the database to. Click run when you are happy with your selection.
Phylo-Match does all of its calculations and api requests upfront, so users may have to wait 10-15 minutes after run is clicked, depending on internet speed.
This time can be minimized by unchecking 'Lookup Taxa Info' - a good idea if you're very familiar with the taxa, but the project will not provide information about matches beyond the name.
Information about the DB's taxa will be on the left-hand side. All similar entries in the .nexus file will appear in the middle of the screen. Click on the name you'd like to change the entry to, manually enter a name on the bottom, or click on 'same species', or 'same genus' for additional options, if available.
Once all selections have been made, a new .csv file will be created in the same directory as the original database .csv file.
Examples:
The program (correctly) thinks my best bet to match the database taxon Aotus azarae is Aotus azarai from the tree. But if I don't like that option I can click on 'same species' to see other taxa in the tree with the species name "azarae" or 'same genus' to see other members of the genus Aotus.
Here's an example of the 'same species' option at work: The database has Vicunga pacos but the tree has Lama pacos. This is useful if the genus has been split up.
Here's an example where I might want to use the "Same Genus" option. This can be useful if you don't have many taxa in that genus and it doesn't really matter which species in the genus you use: Here I have data for Cercocebus atys but that taxon isn't in the tree. I could use a different Cercocebus species as a substitute. The 'removed suggestions' in the bottom left tells me that C. agilis is already matched between the data and tree, but C. atys and C. torquatus are sister species, both equally closely related to C. agilis, so I can use C. torquatus instead.
Contributing
Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.
Please make sure to update tests as appropriate.
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file phylo-match-0.0.8.tar.gz.
File metadata
- Download URL: phylo-match-0.0.8.tar.gz
- Upload date:
- Size: 22.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.8.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1963a6aa8dee62b30cc9710bf6fd1843269b4cec7da55ef4c7c96a4db9107821
|
|
| MD5 |
0aa4b2bc134775711e4e02ea90730edf
|
|
| BLAKE2b-256 |
dbc31a171c4ca5be2ce8db11dd348cc4bd0a633ad43e643a26adda7c972fdb2e
|
File details
Details for the file phylo_match-0.0.8-py3-none-any.whl.
File metadata
- Download URL: phylo_match-0.0.8-py3-none-any.whl
- Upload date:
- Size: 14.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.8.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0159eb5a084d0ef4ebccd1125e535181d967457b86592bdac3587a7b22783d39
|
|
| MD5 |
0e1bbbd999399c9141219401ae4c30f8
|
|
| BLAKE2b-256 |
4225193493528560c417773f53ce9618b77a21038cb5998f8331a4d90ae7b1f2
|