Phage bioinformatics utilities (seqclust runner and friends).
Project description
phu - Phage Utilities
phu (phage utilities) or phutilities, is a modular toolkit for viral genomics workflows. It provides command-line tools to handle common steps in phage bioinformatics pipelines—wrapping complex utilities behind a consistent and intuitive interface.
Installation
You can install phu using mamba or conda from the bioconda channel:
mamba create -n phu bioconda::phu
Usage
As a command-line tool, phu follows a modular structure. You can access different functionalities through subcommands. The general syntax is:
phu <command> [options]
Commands
cluster: Cluster viral sequences into species or other operational taxonomic units (OTUs).simplify-taxa: Simplify vContact taxonomy prediction columns into compact lineage codes.
Contributing
We welcome contributions to phu! Please follow these steps:
- Fork the repository.
- Create a new branch for your feature or bugfix.
- Make your changes and commit them.
- Submit a pull request describing your changes.
Developers
You can also install the development version of phu directly from GitHub:
git clone https://github.com/camilogarciabotero/phu.git
cd phu
pip install -e .
phu is also available on PyPI:
pip install phu
References
This program uses several key tools and libraries, make sure to acknowledge them when using phu:
- vclust: A high-performance clustering tool for viral sequences:
Zielezinski A, Gudyś A, Barylski J, Siminski K, Rozwalak P, Dutilh BE, Deorowicz S. Ultrafast and accurate sequence alignment and clustering of viral genomes. Nat Methods. https://doi.org/10.1038/s41592-025-02701-7
- seqkit: A toolkit for FASTA/Q file manipulation.
Wei Shen*, Botond Sipos, and Liuyang Zhao. 2024. SeqKit2: A Swiss Army Knife for Sequence and Alignment Processing. iMeta e191. doi:10.1002/imt2.191.
- Prodigal: A gene prediction tool for prokaryotic genomes.
Hyatt, D., Chen, G. L., LoCascio, P. F., Land, M. L., Larimer, F. W., & Hauser, L. J. (2010). Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC bioinformatics, 11(1), 119. https://doi.org/10.1186/1471-2105-11-119
- pyrodigal: A tool for gene prediction in prokaryotic genomes.
Larralde, M., (2022). Pyrodigal: Python bindings and interface to Prodigal, an efficient method for gene prediction in prokaryotes. Journal of Open Source Software, 7(72), 4296, https://doi.org/10.21105/joss.04296
- HMMER: A suite of tools for sequence analysis using profile hidden Markov models.
Eddy, S. R. (2011). Accelerated Profile HMM Searches. PLoS Computational Biology, 7(10), e1002195. https://doi.org/10.1371/journal.pcbi.1002195
- pyHMMER: Python bindings for HMMER.
Larralde, M., & Zeller, G. (2023). PyHMMER: a Python library binding to HMMER for efficient sequence analysis. Bioinformatics, 39(5). https://doi.org/10.1093/bioinformatics/btad214
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file phu-0.4.0.tar.gz.
File metadata
- Download URL: phu-0.4.0.tar.gz
- Upload date:
- Size: 20.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f30d31e59a8ac05739b6a65cba75d07823649ffd3254c73b6a16b03a1cf61586
|
|
| MD5 |
66f56c4330d59e37e3711daad00cd69c
|
|
| BLAKE2b-256 |
47c4bbc008bfd92629dcd4484b7de238a82a02d3bafccc31ce3fdde338c66534
|
Provenance
The following attestation bundles were made for phu-0.4.0.tar.gz:
Publisher:
python-publish.yml on camilogarciabotero/phu
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
phu-0.4.0.tar.gz -
Subject digest:
f30d31e59a8ac05739b6a65cba75d07823649ffd3254c73b6a16b03a1cf61586 - Sigstore transparency entry: 513213664
- Sigstore integration time:
-
Permalink:
camilogarciabotero/phu@259418b9931155d9b267a20379ae6d6dd69b35dc -
Branch / Tag:
refs/tags/v0.4.0 - Owner: https://github.com/camilogarciabotero
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-publish.yml@259418b9931155d9b267a20379ae6d6dd69b35dc -
Trigger Event:
release
-
Statement type:
File details
Details for the file phu-0.4.0-py3-none-any.whl.
File metadata
- Download URL: phu-0.4.0-py3-none-any.whl
- Upload date:
- Size: 21.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f1a1f7801aff1e1f45ee461955642092bfd9f1d544cf59d1647cdd893f30b072
|
|
| MD5 |
c3b076b34fa4eda575688e21a057d9c7
|
|
| BLAKE2b-256 |
f0d491e8bd7f6a0680edb37797e93c71cbd088dd4a6257b225eae98689235ca9
|
Provenance
The following attestation bundles were made for phu-0.4.0-py3-none-any.whl:
Publisher:
python-publish.yml on camilogarciabotero/phu
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
phu-0.4.0-py3-none-any.whl -
Subject digest:
f1a1f7801aff1e1f45ee461955642092bfd9f1d544cf59d1647cdd893f30b072 - Sigstore transparency entry: 513213670
- Sigstore integration time:
-
Permalink:
camilogarciabotero/phu@259418b9931155d9b267a20379ae6d6dd69b35dc -
Branch / Tag:
refs/tags/v0.4.0 - Owner: https://github.com/camilogarciabotero
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-publish.yml@259418b9931155d9b267a20379ae6d6dd69b35dc -
Trigger Event:
release
-
Statement type: