A tool for purging host sequences from metagenomic reads
Project description
HostPurge is a tool for removing host contamination from sequencing reads. It supports four modes, each designed for different scenarios based on the level of host contamination and the need for precision.
Table of contents
Introduction
HostPurge is a versatile tool designed to remove host contamination from sequencing reads. It supports multiple modes of operation, allowing it to handle different levels of host contamination and precision requirements.
Requirements
- Linux or macOS
- Python 3.8 or later
- KneadData 0.12.0 or later
- Kraken2 2.1.2 or later
Installation
Install from source
These instructions install the most up-to-date version of HostPurge:
conda install hostpurge
Usage
Run HostPurge with the desired mode:
hostpurge input_1.fastq input_2.fastq output_dir --mode [a|b|c|d] [additional options]
For detailed help information:
hostpurge --help
Modes
HostPurge supports four modes, each suited for different contamination levels and precision needs:
Choosing the right mode
Mode a: Best for samples with low host contamination. This mode is slower but more thorough, making it ideal for sensitive samples where precision is critical.
Mode b: Fastest mode but with lower accuracy. Use this mode when you need quick results and the host contamination level is manageable.
Mode c: Recommended for samples with high host contamination. This mode balances speed and accuracy, providing reliable results for most scenarios.
Mode d: Best for samples with low host contamination but requiring high precision. This mode ensures the highest accuracy by using both KneadData and Kraken2 in sequence.
Output files
HostPurge generates the following output files in the specified output directory:
cleaned_reads_1.fastq: Cleaned reads from input_1
cleaned_reads_2.fastq: Cleaned reads from input_2
host_contamination_report.txt: Report detailing the host contamination removal process
log.txt: Log file with detailed process information
Acknowledgements
HostPurge was developed with contributions from the bioinformatics community. Special thanks to the developers of KneadData and Kraken2, whose tools are integral to the functionality of HostPurge.
License
HostPurge is licensed under the MIT License.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file hostpurge-1.1.4.tar.gz
.
File metadata
- Download URL: hostpurge-1.1.4.tar.gz
- Upload date:
- Size: 6.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.8.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7037845b575f045122891b65e77704a15e68cc7a569225d66ed76b9bc28ebdd3 |
|
MD5 | cbc0addf52d0f8ae4eaa7c0d6f555c49 |
|
BLAKE2b-256 | b44e77bc094d80c7c6214af9c06cdefbf08afcc4ffb11deab647546ca905033a |
File details
Details for the file hostpurge-1.1.4-py3-none-any.whl
.
File metadata
- Download URL: hostpurge-1.1.4-py3-none-any.whl
- Upload date:
- Size: 7.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.8.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d7e578c451ee7fd18912e50f3e543723591bac3f6b9dcf8bbb0cffef0a10daf1 |
|
MD5 | 814eabd6eac769d972228c4c5b682b07 |
|
BLAKE2b-256 | 687a960d9129a541072d3c4d25d663e7e3985f4a286a8ad1d2e4cfc5ea028445 |