Skip to main content

SNV variant caller that uses discordant overlapping read pairs to calculate sample-specific base error rates.

Project description

BBQ (BetterBaseQuals)

A somatic SNV variant caller that uses discordant overlapping read pairs to calculate sample-specific base error rates.

Requirements

bbq requires Python 3.8 or above.

Install

bbq can be installed using pip:

pip install betterbasequals

or using pipx:

pipx install betterbasequals

Usage

The bbq tool has different commands that can be run. To get at list of the different command you can use -h or --help:

bbq -h

To see the options for a specific command you can use -h after a command:

bbq count -h

To call variants in a sample you will need to first gather training data then train the model on the sample and finally call variants using the model. Typically you will want to split this into multiple jobs and we recommend using the snakemake workflow bbq_pipeline that does this. That github project also contain training data that can be used to test BBQ.

If you want to run both the error counting, model training and variant calling using a single command that is also possible using the following command (but we strongly recommend using the above mentioned workflow):

bbq call --bam_file cfdna.bam --filter_bam_file  pbmc.bam --twobit_file ~/Data/2bit/hg38.2bit

For some of the command it is possible to run it on only a part of the genome using the -r {chrom}:{start}-{end} option Fx:

bbq call_only --bam_file tmp.bam --twobit_file ~/Data/2bit/hg38.2bit --input_file_kmerpapa model.txt -r chr22:30000000-31000000 

Several of the commands requires the reference genome in 2bit format. If the bam file is mapped to a known reference genome the 2bit file can be downloaded from: https://hgdownload.cse.ucsc.edu/goldenpath/{genome}/bigZips/{genome}.2bit where {genome} is a valid UCSC genome assembly name (fx. "hg38").

Otherwise a 2bit file can be created from a fasta file using the faToTwoBit command. A linux binary of the tool can be downloaded from UCSC:

wget https://hgdownload.cse.ucsc.edu/admin/exe/linux.x86_64/faToTwoBit

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

betterbasequals-0.3.0.tar.gz (31.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

betterbasequals-0.3.0-py3-none-any.whl (35.5 kB view details)

Uploaded Python 3

File details

Details for the file betterbasequals-0.3.0.tar.gz.

File metadata

  • Download URL: betterbasequals-0.3.0.tar.gz
  • Upload date:
  • Size: 31.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.9.15 Darwin/24.3.0

File hashes

Hashes for betterbasequals-0.3.0.tar.gz
Algorithm Hash digest
SHA256 510c1f6329bb646529e3ba7e4d7c84840910010922bc13c2717762f58a2956bc
MD5 9427707ab58aeee20a4b737c74fc5682
BLAKE2b-256 0c50e22ffd7fe863886957c68e81e975d52bdd4eb5faa60a759f3474bd2f4fd9

See more details on using hashes here.

File details

Details for the file betterbasequals-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: betterbasequals-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 35.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.9.15 Darwin/24.3.0

File hashes

Hashes for betterbasequals-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b709dad043b21600faaa35c1a145aae57dc18b72cb43ea56f4fe8e765888ab86
MD5 4dc78b86252d8b500cb7f527015111c2
BLAKE2b-256 cd14641da880ce12f72fbcca3d7e48bf96abb5cb94c33ed3987913d7afc6c7f5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page