Sequence salvaging script from corrupted BAM files
Project description
bamsalvage
INTRODUCTION
bamsalvage is a tools to recover sequence reads as much as possible from possibly corrupted BAM files. This software share the common purpose with bamrescue by Jérémie Roquet (https://bamrescue.arkanosis.net/). bamrescue detects corrupted BGZF block using CRC32 checksums and skip corrupted blocks and the method works well if all blocks begin with new reads.
When we would like to recover long-read sequences, a read can span more than one BGZF blocks since the maximum block size is less than sequencer outputs.
Skipping corrupted blocks does not solve such the troubles and often results in termination of Samtools and failure of sequence recovery.
bamsalvage scans next available start positions when any corrupted blocks are detected. Since the goal of the software is rescuing sequences, bamsalvage do not recover all information included in BAM file but retrieves reads and qual sequences.
INSTALL
pip install bamsalvage
USAGE
bamsalvage -i [FILE] -o {DIRECTORY] --mode [fa,fa.gz,fastq,fastq.gz] [--verbose]
Options:
-i, --input <FILE> Input BAM file
-o, --output <FILE> Output filename
-l, --limit <integer> Limiting counts [default: 0]
--mode Output format (fa:fasta, fa.gz:gzipped fasta, fastq:fastq, fastq.gz(default):gzipped fastq)
-v, --verbose verbosity
-h, --help Print help
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file bamsalvage-0.1.7.tar.gz
.
File metadata
- Download URL: bamsalvage-0.1.7.tar.gz
- Upload date:
- Size: 156.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9cbeb3822054c1e0029f6c78e032f858e62a96c5510db39a37a309344d7ca754 |
|
MD5 | 4876fa85b81780be11fdd037f5096847 |
|
BLAKE2b-256 | 4b5e409e91c5403a16e62f6e97dba03b84aa69e8e22eaf5a5761ffe20aed6346 |
File details
Details for the file bamsalvage-0.1.7-py3-none-any.whl
.
File metadata
- Download URL: bamsalvage-0.1.7-py3-none-any.whl
- Upload date:
- Size: 10.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 108610ab3f913404e10bb0066a8e7813ac53f1dec10fa8df68f2b1c76f60f284 |
|
MD5 | 7c90e93eafe17a62bf6863ebcf93b786 |
|
BLAKE2b-256 | 2fddef276361aba0c4e3df93a0035a2141458c4f3ac1b91054d8901e45a1ce4b |