Skip to main content

A tool for purging host sequences from metagenomic reads

Project description

HostPurge

HostPurge is a tool for removing host contamination from sequencing reads. It supports four modes, each designed for different scenarios based on the level of host contamination and the need for precision.

Table of contents

Introduction

HostPurge is a versatile tool designed to remove host contamination from sequencing reads. It supports multiple modes of operation, allowing it to handle different levels of host contamination and precision requirements.

Requirements

  • Linux or macOS
  • Python 3.8 or later
  • KneadData 0.12.0 or later
  • Kraken2 2.1.2 or later

Installation

Install from source

These instructions install the most up-to-date version of HostPurge:

conda install hostpurge

Usage

Run HostPurge with the desired mode:

hostpurge input_1.fastq input_2.fastq output_dir --mode [a|b|c|d] [additional options]

For detailed help information:

hostpurge --help

Modes

HostPurge supports four modes, each suited for different contamination levels and precision needs:

Choosing the right mode

Mode a: Best for samples with low host contamination. This mode is slower but more thorough, making it ideal for sensitive samples where precision is critical.

Mode b: Fastest mode but with lower accuracy. Use this mode when you need quick results and the host contamination level is manageable.

Mode c: Recommended for samples with high host contamination. This mode balances speed and accuracy, providing reliable results for most scenarios.

Mode d: Best for samples with low host contamination but requiring high precision. This mode ensures the highest accuracy by using both KneadData and Kraken2 in sequence.

Output files

HostPurge generates the following output files in the specified output directory:

cleaned_reads_1.fastq: Cleaned reads from input_1

cleaned_reads_2.fastq: Cleaned reads from input_2

host_contamination_report.txt: Report detailing the host contamination removal process

log.txt: Log file with detailed process information

Acknowledgements

HostPurge was developed with contributions from the bioinformatics community. Special thanks to the developers of KneadData and Kraken2, whose tools are integral to the functionality of HostPurge.

License

HostPurge is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hostpurge-1.1.3.tar.gz (7.7 kB view details)

Uploaded Source

Built Distribution

hostpurge-1.1.3-py3-none-any.whl (8.2 kB view details)

Uploaded Python 3

File details

Details for the file hostpurge-1.1.3.tar.gz.

File metadata

  • Download URL: hostpurge-1.1.3.tar.gz
  • Upload date:
  • Size: 7.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.8.19

File hashes

Hashes for hostpurge-1.1.3.tar.gz
Algorithm Hash digest
SHA256 13cb4a84c94b8d96eb6a02e2d89dd0ffd0fa8cd4998162c776987b4d621bddf7
MD5 56112d211f5263dd1a0d3d9fa09fcf61
BLAKE2b-256 c5b09975f4963ea564c64ad233de4f00470e7ab1f2f1cbc4f181b94fb6cd9c43

See more details on using hashes here.

File details

Details for the file hostpurge-1.1.3-py3-none-any.whl.

File metadata

  • Download URL: hostpurge-1.1.3-py3-none-any.whl
  • Upload date:
  • Size: 8.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.8.19

File hashes

Hashes for hostpurge-1.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 9098fbcb3f65aa22e3f2327bda34ec81f0003b514bff1ecdcd8fff49d01bde37
MD5 485d2b7feafc299a8c4e8c3a876ad094
BLAKE2b-256 8750e164819f3d26d95c8d0a86ebae85c17256e46e9c4e5aa35635b3d9f5a1d4

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page