A complete suite for gene-by-gene schema creation and strain identification.

These details have been verified by PyPI

Maintainers

cimendes jacarrico mickaelsilva pcerqueira rfm

These details have not been verified by PyPI

Project links

GitHub Statistics

Project description

chewBBACA

chewBBACA stands for "BSR-Based Allele Calling Algorithm". The "chew" part could be thought of as "Comprehensive and Highly Efficient Workflow" but at this point it still needs a bit of work to make that claim, so we just add "chew" to add extra coolness to the software name. BSR stands for BLAST Score Ratio as proposed by Rasko DA et al.

chewBBACA is a comprehensive pipeline including a set of functions for the creation and validation of whole genome and core genome MultiLocus Sequence Typing (wg/cgMLST) schemas, providing an allele calling algorithm based on Blast Score Ratio that can be run in multiprocessor settings and a set of functions to visualize and validate allele variation in the loci. chewBBACA performs the schema creation and allele calls on complete or draft genomes.

Check the documentation for implementation details and guidance on using chewBBACA.

News

3.3.3 - 2024-02-23

Fixed warning related with BLASTp --seqidlist parameter. For BLAST>=2.9, the TXT file with the sequence IDs is converted to binary format with blastdb_aliastool.
The Bio.Application modules are deprecated and might be removed from future Biopython versions. Modified the function that calls MAFFT so that it uses the subprocess module instead of Bio.Align.Applications.MafftCommandline. Changed the Biopython version requirement to >=1.79.
Added a pyproject.toml configuration file and simplified the instructions in setup.py. The use of setup.py as a command line tool is deprecated and the pyproject.toml configuration file allows to install and build packages through the recommended method.
Updated the Dockerfile to install chewBBACA with python3 -m pip install . instead of the deprecated python setup.py install command.
Removed FASTA header integer conversion before running BLASTp. This was done to avoid a warning from BLAST related to sequence header length exceeding 50 characters.
The seqids and coordinates of the CDSs closest to contig tips are stored in a dictionary during gene prediction to simplify LOTSC and PLOT5/3 determination (in many cases this reduces runtime by ~20%).
Limited the number of values stored in memory while creating the results_contigsInfo.tsv and results_alleles.tsv output files to reduce memory usage.
Adding data to the FASTA and TSV files for the missing classes per locus instead of storing the complete per input data to reduce memory usage.
The data for novel alleles is saved to files to reduce memory usage.
Fixed the in-frame stop codon count values displayed in the reports created by the SchemaEvaluator module.
The UniprotFinder module now exits cleanly if the output directory already exists.
Improved info printed to the stdout by the CreateSchema and AlleleCall modules, added comments, and changed variable names to better match data being stored.

Check our Changelog to learn about the latest changes.

Citation

When using chewBBACA, please use the following citation:

Silva M, Machado MP, Silva DN, Rossi M, Moran-Gilad J, Santos S, Ramirez M, Carriço JA. 2018. chewBBACA: A complete suite for gene-by-gene schema creation and strain identification. Microb Genom 4:000166. doi:10.1099/mgen.0.000166

Project details

These details have been verified by PyPI

Maintainers

cimendes jacarrico mickaelsilva pcerqueira rfm

These details have not been verified by PyPI

Project links

GitHub Statistics

Release history Release notifications | RSS feed

3.3.8

Jul 2, 2024

3.3.7

Jun 27, 2024

3.3.6

Jun 12, 2024

3.3.5

Apr 18, 2024

3.3.4

Mar 22, 2024

This version

3.3.3

Feb 23, 2024

3.3.2

Jan 16, 2024

3.3.1

Nov 3, 2023

3.3.0

Oct 11, 2023

3.2.0

Apr 27, 2023

3.1.2

Mar 13, 2023

3.1.1

Mar 9, 2023

3.1.0

Jan 16, 2023

3.0.0

Dec 14, 2022

2.8.5

Jul 5, 2021

2.8.4

Apr 27, 2021

2.8.3

Apr 27, 2021

2.7.0

Mar 15, 2021

2.6.0

Feb 8, 2021

2.5.6

Nov 10, 2020

2.5.5

Sep 15, 2020

2.5.4

Aug 10, 2020

2.1.0

Dec 6, 2019

2.0.17.2

Sep 2, 2019

2.0.17.1

Jun 18, 2019

2.0.17

Feb 9, 2019

2.0.16

Nov 17, 2018

2.0.15

Nov 6, 2018

2.0.14

Nov 6, 2018

2.0.13

Sep 18, 2018

2.0.12

Aug 1, 2018

2.0.11

Jun 5, 2018

2.0.10

May 21, 2018

2.0.9

Apr 11, 2018

2.0.8

Feb 26, 2018

2.0.7

Feb 23, 2018

2.0.6

Feb 22, 2018

2.0.5

Feb 14, 2018

2.0.4

Feb 5, 2018

2.0.3

Feb 2, 2018

2.0.2

Feb 2, 2018

2.0.1

Feb 2, 2018

2.0.0

Feb 2, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chewBBACA-3.3.3.tar.gz (10.9 MB view hashes)

Uploaded Feb 23, 2024 Source

Built Distribution

chewBBACA-3.3.3-py3-none-any.whl (10.9 MB view hashes)

Uploaded Feb 23, 2024 Python 3

Hashes for chewBBACA-3.3.3.tar.gz

Hashes for chewBBACA-3.3.3.tar.gz
Algorithm	Hash digest
SHA256	`c29e34bdca769b38f8e9c7f7bd3547563f9e1e1effe5266e7b00674d9a7ca12a`
MD5	`b923af85588591d753830ceb23488bd6`
BLAKE2b-256	`6b41137d5fcc3e173fa7e78aeec06424e2dfe6288f75abb1d479389b109ab6b0`

Hashes for chewBBACA-3.3.3-py3-none-any.whl

Hashes for chewBBACA-3.3.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6eee0f4046c6f21f0a2ee7cbb9e63f92f8bde5694bbee10f9ec58a33c10099f7`
MD5	`be760b11b9b7f42037e8f310fbbc9c1e`
BLAKE2b-256	`9e0540df1ddf85d9af0af89a884d49f98b4827d2adab21533a1c155f09e66841`