Skip to main content

Removing reads mapping to the lambda genome

Project description

Remove reads mapping to the lambda phage genome from a fastq file.
This script uses Heng Li’s minimap2 and his mappy Python binding.

Twitter URL Build Status Code Health install with bioconda

INSTALLATION

pip install NanoLyse

USAGE

Reads fastq from stdin and writes to stdout.

NanoLyse [-h] [-v] [-r REFERENCE]

                    Remove reads mapping to the lambda genome.
                    Reads fastq from stdin and writes to stdout.

                    Example usage:
                    zcat reads.fastq.gz | NanoLyse | gzip > reads_without_lambda.fastq.gz


optional arguments:
  -h, --help            show this help message and exit
  -v, --version         Print version and exit.
  -r REFERENCE, --reference REFERENCE
                        Specify a reference fasta file against which to filter.

WARNING

If (some of) the reads of your genome of interest are sufficiently similar to the lambda genome those reads will be lost.

EXAMPLES

gunzip -c reads.fastq.gz | NanoLyse | gzip > reads_without_lambda.fastq.gz
In combination with NanoFilt:
gunzip -c reads.fastq.gz | NanoLyse | NanoFilt -q 12 | gzip > filtered_reads_without_lambda.fastq.gz
Using a different genome to filter on (rather than lambda phage):
gunzip -c reads.fastq.gz | NanoLyse --reference mygenome.fa.gz | gzip > reads_without_mygenome.fastq.gz

I welcome all suggestions, bug reports, feature requests and contributions. Please leave an issue or open a pull request. I will usually respond within a day, or rarely within a few days.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

NanoLyse-0.4.0.tar.gz (19.8 kB view details)

Uploaded Source

File details

Details for the file NanoLyse-0.4.0.tar.gz.

File metadata

  • Download URL: NanoLyse-0.4.0.tar.gz
  • Upload date:
  • Size: 19.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for NanoLyse-0.4.0.tar.gz
Algorithm Hash digest
SHA256 98eb8700d22c8efde09c71d5a3507dcf26e86401526aaa1fee632732c843244c
MD5 e90159035a3e6a89c2a0d85411499241
BLAKE2b-256 25b376ce1c6dd6778d57213ebeb424e0ad7140b5f6c10e490a2902037287562c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page