Skip to main content

Sequence salvaging script from corrupted BAM files

Project description

bamsalvage

INTRODUCTION

bamsalvage is a tools to recover sequence reads as much as possible from possibly corrupted BAM files. This software share the common purpose with bamrescue by Jérémie Roquet (https://bamrescue.arkanosis.net/). bamrescue detects corrupted BGZF block using CRC32 checksums and skip corrupted blocks and the method works well if all blocks begin with new reads.

When we would like to recover long-read sequences, a read can span more than one BGZF blocks since the maximum block size is less than sequencer outputs.

Skipping corrupted blocks does not solve such the troubles and often results in termination of Samtools and failure of sequence recovery.

bamsalvage scans next available start positions when any corrupted blocks are detected. Since the goal of the software is rescuing sequences, bamsalvage do not recover all information included in BAM file but retrieves reads and qual sequences.

INSTALL

pip install bamsalvage

USAGE

bamsalvage -i [FILE] -o {DIRECTORY] --mode [fa,fa.gz,fastq,fastq.gz] [--verbose]
Options:
  -i, --input <FILE>     Input BAM file
  -o, --output <FILE>    Output filename
  -l, --limit <integer>  Limiting counts [default: 0]
  --mode                 Output format (fa:fasta, fa.gz:gzipped fasta, fastq:fastq, fastq.gz(default):gzipped fastq)
  -v, --verbose          verbosity
  -h, --help             Print help

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bamsalvage-0.1.7.tar.gz (156.3 kB view details)

Uploaded Source

Built Distribution

bamsalvage-0.1.7-py3-none-any.whl (10.8 kB view details)

Uploaded Python 3

File details

Details for the file bamsalvage-0.1.7.tar.gz.

File metadata

  • Download URL: bamsalvage-0.1.7.tar.gz
  • Upload date:
  • Size: 156.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.0

File hashes

Hashes for bamsalvage-0.1.7.tar.gz
Algorithm Hash digest
SHA256 9cbeb3822054c1e0029f6c78e032f858e62a96c5510db39a37a309344d7ca754
MD5 4876fa85b81780be11fdd037f5096847
BLAKE2b-256 4b5e409e91c5403a16e62f6e97dba03b84aa69e8e22eaf5a5761ffe20aed6346

See more details on using hashes here.

File details

Details for the file bamsalvage-0.1.7-py3-none-any.whl.

File metadata

  • Download URL: bamsalvage-0.1.7-py3-none-any.whl
  • Upload date:
  • Size: 10.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.0

File hashes

Hashes for bamsalvage-0.1.7-py3-none-any.whl
Algorithm Hash digest
SHA256 108610ab3f913404e10bb0066a8e7813ac53f1dec10fa8df68f2b1c76f60f284
MD5 7c90e93eafe17a62bf6863ebcf93b786
BLAKE2b-256 2fddef276361aba0c4e3df93a0035a2141458c4f3ac1b91054d8901e45a1ce4b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page