Skip to main content

Call ISBT alleles from VCF/s

Project description

RBCeq2: blood group allele inference

Lifeblood Logo

[!WARNING] NOT FOR CLINICAL USE

Version v2.4.1

RBCeq2 reads in genomic variant data in the form of variant call files (VCF) and outputs blood group (BG) genotype and phenotype inference.

At the highest level RBCeq2 finds all possible alleles, then filters out those that fail certain logic checks. This allows for an auditable trail of why it has reached a certain result. Every effort has been made to be explicit both in encoding alleles in our database and while writing code. This results in verbose but unambiguous results. Last, some liberties have been taken to standardise syntax and nomenclature across blood groups.

The initial release of RBCeq2 was focused on perfecting the calling of International Society for Blood Transfusion (ISBT) defined BG alleles from simple variants; single nucleotide variants (SNVs) and small insertions and deletions (indels). v2.3.0 focussed on phasing. V2.4.0 (current) focuses on large variants and the RH region, using long read sequencing data.

Bugs

This software is extensively tested and accurately reports genotypes/phenotypes based on our inhouse definitions of the ‘correct’ answer, however, there are some examples where the ‘correct’ answer is subjective. The docs are detailed – if you find what you think is a bug in the results from RBCeq2 please take the time to understand if it inline with what we intended or not (use --debug and look to see what happened).

Documentation

Documentation can be downloaded from the release page, you will need to be signed in to github to access it.

How To

Install via pip (python3.12+) or clone the git repository:

pip install RBCeq2

rbceq2 -h

usage: rbceq2 --vcf example_multi_sample.vcf.gz --out example --reference_genome GRCh37

options:
  -h, --help            show this help message and exit
  -v, --version         Show programs version number and exit.
  --vcf VCF             Path to VCF file/s. Give a folder if you want to pass multiple separate files (file names must end in .vcf or .vcf.gz), or alternatively give a file if using a multi-sample VCF. (default: None)
  --out OUT             Prefix for output files (default: None)
  --depth DEPTH         Minimum number of reads for a variant (default: 10)
  --quality QUALITY     Minimum average genotype quality for a variant (default: 10)
  --processes PROCESSES
                        Number of processes. I.e., how many CPUs are available? ~1GB RAM required per process (default: 1)
  --reference_genome {GRCh37,GRCh38}
                        GRCh37/8 (default: None)
  --phased              Use phase information (default: False)
  --microarray          Input is from a microarray. (default: False)
  --debug               Enable debug logging. If not set, logging will be at info level. (default: False)
  --validate            Enable VCF validation. Doubles run time. Might help you identify input issues (default: False)
  --PDFs                Generate a per sample PDF report (default: False)
  --HPAs                Generate results for HPA (default: False)
  --RH                  Generate results for RHD and RHCE. WARNING! Long read only! (default: False)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rbceq2-2.4.1.tar.gz (385.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rbceq2-2.4.1-py3-none-any.whl (317.3 kB view details)

Uploaded Python 3

File details

Details for the file rbceq2-2.4.1.tar.gz.

File metadata

  • Download URL: rbceq2-2.4.1.tar.gz
  • Upload date:
  • Size: 385.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.11

File hashes

Hashes for rbceq2-2.4.1.tar.gz
Algorithm Hash digest
SHA256 9f97a1ab0d4369fc5a806f46e76d6c3b9cd6432239d3933e334e7d8ad4e03b16
MD5 ffffd47ca4cdf095dbc8186321c64dd7
BLAKE2b-256 7f6822e2481b25a044aa6ccef0aabd383f5df1508e4a27c9058f7dbf109ef1e1

See more details on using hashes here.

File details

Details for the file rbceq2-2.4.1-py3-none-any.whl.

File metadata

  • Download URL: rbceq2-2.4.1-py3-none-any.whl
  • Upload date:
  • Size: 317.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.11

File hashes

Hashes for rbceq2-2.4.1-py3-none-any.whl
Algorithm Hash digest
SHA256 4f5865815dceec98d456930e7bb2cebb03449b8bfec7859ace72b077decbe864
MD5 9895b12f4055444ea23d2431444b81c0
BLAKE2b-256 e16cee86a54359e74cb7869919c9f741cad4dac067ffe4de712ef427e540d88b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page