Skip to main content

A reference-based, UMI-aware, 5ʹ-trimming-aware PCR duplicate removal pipeline.

Project description

Dupligänger

Dupligänger is a reference-based, UMI-aware, 5’-trimming-aware PCR duplicate removal pipeline.

Usage: dupliganger [options] <command> [<args>…]

Dupligänger is a pipeline. Each stage of the pipeline is run by passing a ‘command’ to Dupligänger. The commands / pipeline-steps (in order) are as follows:

remove-umi         1. Annotate read names with UMIs (clip inline UMIs if needed).
remove-adapter     2. Remove adapters ('Cutadapt' wrapper).
qtrim              3. Quality trim ('Trimmomatic' wrapper).
annotate-qtrim     4. Annotates quality trimmed file(s).
align              5. Align reads to a reference genome assembly (performed manually by user).
dedup              6. Use the alignment to remove PCR duplicates.

While generally used only by the developers of Dupligänger, the ‘dedup’ command is comprised of the following Dupligänger commands run in the following order:

build-read-db      1. Build a database of aligned reads.
build-location-db  2. Build a database of locations of aligned reads.
build-dup-db       3. Build a database of PCR duplicates.

Options:

-o OUT_DIR      Place results in directory OUT_DIR.
--compress      Compress output.
Note:

Dupligänger supports (and autodetects) input FASTQ files that are gzipped.

See ‘dupliganger help <command>’ for more information on a specific command.

Documentation

For further information on Dupligänger, please see the full documentation at https://github.com/uoregon-postlethwait/dupliganger

Authors

  • Jason Sydes - Conceptual Design and Software Engineering

  • Peter Batzel - Conceptual Design

  • John H. Postlethwait - Project Advisor

Funding

Dupligänger has been funded by the following grants:

  • NIH R01 OD011116 - Resources for Teleost Gene Duplicates and Human Disease

  • NIH R24 OD011199 - Advancing the Scientific Potential of Transcriptomics in Aquatic Models

  • NIH R24 OD018555 - Development of Aquatic Model Resources for Therapeutic Screens

  • NSF PLR-1543383 - Antarctic Fish and MicroRNA Control of Development and Physiology

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dupliganger-0.99rc0.tar.gz (92.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dupliganger-0.99rc0-py2.py3-none-any.whl (62.3 kB view details)

Uploaded Python 2Python 3

File details

Details for the file dupliganger-0.99rc0.tar.gz.

File metadata

  • Download URL: dupliganger-0.99rc0.tar.gz
  • Upload date:
  • Size: 92.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.23.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.7.6

File hashes

Hashes for dupliganger-0.99rc0.tar.gz
Algorithm Hash digest
SHA256 f9487bdf5be5ddad7ffa26e4a73f37ebf0f7765c9374958c9ab53890efd51ea0
MD5 be9b5ce2b25b4d3e191289c58fcf5ec0
BLAKE2b-256 11e8b39efefa9da2ff905104406e45344e562c27906408e77655c9632883449d

See more details on using hashes here.

File details

Details for the file dupliganger-0.99rc0-py2.py3-none-any.whl.

File metadata

  • Download URL: dupliganger-0.99rc0-py2.py3-none-any.whl
  • Upload date:
  • Size: 62.3 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.23.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.7.6

File hashes

Hashes for dupliganger-0.99rc0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 208a043d1e1f5ddf02d4e50fb2293edd029f590b06b4c5eba7b33828aaeb3b22
MD5 d7f91ef44c47969fa086e26c21ef5655
BLAKE2b-256 0dd5e7de6e274953d3f0852e13e614102472e82057fccef29dd0da2f25fec89e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page