Skip to main content

Converts TEI XML collations to NEXUS and other formats

Project description

https://raw.githubusercontent.com/jjmccollum/teiphy/main/docs/img/teiphy-logo.svg

license badge testing badge coverage badge docs badge black badge git3moji badge iqtree badge raxml badge mrbayes badge beast badge stemma badge joss badge doi badge

A Python package for converting TEI XML collations to NEXUS and other formats.

Textual scholars have been using phylogenetics to analyze manuscript traditions since the early 1990s. Many standard phylogenetic software packages accept as input the NEXUS file format. The teiphy program takes a collation of texts encoded using the Text Encoding Initiative (TEI) guidelines and converts it to a NEXUS format so that it can be used for phylogenetic analysis. It can also convert to other formats as well, including Hennig86 (for TNT), PHYLIP (for RAxML), FASTA, and the XML format used by BEAST 2.7.

Installation

The software can be installed using pip:

pip install teiphy

Alternatively, you can install the package by cloning this repository and installing it with poetry:

git clone https://github.com/jjmccollum/teiphy.git
cd teiphy
pip install poetry
poetry install

Once the package is installed, you can run all unit tests via the command

poetry run pytest

Usage

To use the software, run the teiphy command line tool:

teiphy <input TEI XML> <output file>

teiphy can export to NEXUS, Hennig86 (TNT), PHYLIP (in the relaxed form used by RAxML), FASTA, BEAST 2.7 XML, CSV, TSV, Excel and stemma formats. teiphy will try to infer the file format to export to from the extension of the output file. Accepted file extensions are: “.nex”, “.nexus”, “.nxs”, “.ph”, “.phy”, “.fa”, “.fasta”, “.xml”, “.tnt”, “.csv”, “.tsv”, “.xlsx”.

To explicitly say which format you wish to export to, use the --format option. For example:

teiphy <input TEI XML> <output file> --format nexus

For more information about the other options, see the help with:

teiphy --help

Or see the documentation with explanations about advanced usage.

The software can also be used in Python directly. See API Reference in the documentation for more information.

Credits

teiphy was designed by Joey McCollum (Australian Catholic University) and Robert Turnbull (University of Melbourne). We received additional help from Stephen C. Carlson (Australian Catholic University).

If you use this software, please cite the paper: Joey McCollum and Robert Turnbull, “teiphy: A Python Package for Converting TEI XML Collations to NEXUS and Other Formats,” JOSS 7.80 (2022): 4879, DOI: 10.21105/joss.04879.

@article{MT2022,
    author = {Joey McCollum and Robert Turnbull},
    title = {{teiphy: A Python Package for Converting TEI XML Collations to NEXUS and Other Formats}},
    journal = {Journal of Open Source Software},
    year = {2022},
    volume = {7},
    number = {80},
    pages = {4879},
    publisher = {The Open Journal},
    doi = {10.21105/joss.04879},
    url = {https://doi.org/10.21105/joss.04879}
}

Further details on the capabilities of teiphy, particularly in terms of the text-critically valuable features it can map from TEI XML collations to BEAST 2 inputs, are discussed in Joey McCollum and Robert Turnbull, “Using Bayesian Phylogenetics to Infer Manuscript Transmission History,” DSH 39.1 (2024): 258–279, DOI: 10.1093/llc/fqad089.

@article{MT2024,
    author = {Joey McCollum and Robert Turnbull},
    title = {{Using Bayesian Phylogenetics to Infer Manuscript Transmission History}},
    journal = {Digital Scholarship in the Humanities},
    year = {2024},
    volume = {39},
    number = {1},
    pages = {258--279},
    doi = {10.1093/llc/fqad089},
    url = {https://doi.org/10.1093/llc/fqad089}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

teiphy-0.1.21.tar.gz (42.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

teiphy-0.1.21-py3-none-any.whl (44.1 kB view details)

Uploaded Python 3

File details

Details for the file teiphy-0.1.21.tar.gz.

File metadata

  • Download URL: teiphy-0.1.21.tar.gz
  • Upload date:
  • Size: 42.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.12.3 Linux/6.11.0-1013-azure

File hashes

Hashes for teiphy-0.1.21.tar.gz
Algorithm Hash digest
SHA256 d47ef8420d56f87384313b83a004e79e0e2c033492ea3deba063c98d8b46a1c1
MD5 d2d95770aa072b7e635fa1fce666eb1a
BLAKE2b-256 676c31e5eb304ea9a55cb494898f84f27d18ba38833eb0b5964f3c0b0440a8a7

See more details on using hashes here.

File details

Details for the file teiphy-0.1.21-py3-none-any.whl.

File metadata

  • Download URL: teiphy-0.1.21-py3-none-any.whl
  • Upload date:
  • Size: 44.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.12.3 Linux/6.11.0-1013-azure

File hashes

Hashes for teiphy-0.1.21-py3-none-any.whl
Algorithm Hash digest
SHA256 6ed65c16c8fd84201e88562f96b1e99a76698fd76278c61911b8657c0b634c46
MD5 006e01d7d86e8a0d0fe703950e0f8777
BLAKE2b-256 91d53edd616682e87d5093602cb69a186f77a56b512b14bd9dc48f2ed639c9e2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page