Skip to main content

cgMLST analysis tool

Project description

cvmcgmlst

PYPI Static Badge

cvmcgmlst is a tool developed based on the cvmmlst for core genome MLST analysis .

Usage: cvmcgmlst -i <genome assemble directory> -o <output_directory> -db database_name

Author: Qingpo Cui(SZQ Lab, China Agricultural University)

options:
  -h, --help            show this help message and exit
  -i I                  <input_file>: the PATH of assembled genome file
  -db DB                <database_path>: name of cgMLST database
  -o O                  <output_directory>: output PATH
  -minid MINID          <minimum threshold of identity>, default=95
  -mincov MINCOV        <minimum threshold of coverage>, default=90
  -t T                  <number of threads>: default=8
  -v, --version         Display version

cvmcgmlst subcommand:
  {show_db,init,create_db}
    show_db             <show the list of all available database>
    init                <initialize the reference database>
    create_db           <add custome database, use cvmcgmlst createdb -h for help>

Installation

Using pip

pip3 install cvmcgmlst

Dependency

  • BLAST+ >2.7.0

you should add BLAST in your PATH

Blast installation

Windows

Following this tutorial: Add blast into your windows PATH

Linux/Mac

The easyest way to install blast is:

conda install -c bioconda blast

Usage

1. Create reference cgmlst database

Users could create their own core genome database. All you need is a FASTA file of nucleotide sequences. The sequence IDs should have the format >locus_allelenumber, where LOCUS is the loci name, ALLELENUMBER is the number of this allele. The curated core genome fasta file should like this:

>GBAA_RS00015_1
TTGGAAAACATCTCTGATTTATGGAACAGCGCCTTAAAAGAACTCGAAAAAAAGGTCAGT
AAACCAAGTTATGAAACATGGTTAAAATCAACAACCGCACATAATTTAAAGAAAGATGTA
AAGTCAGTTGCCTTTCCTCGCCAAATTGCAATGTATTTGTCACGCGAACTGACAGATTCC
TCCTTACCTAAAATAGGTGAAGAATTTGGTGGACGTGATCATACAACCGTTATCCATGCC
CATGAAAAAATTTCTAAGCTACTTAAGACGGATACGCAATTACAAAAACAAGTTGAAGAA
ATTAACGATATTTTAAAGTAG
>GBAA_RS00015_2
TTGGAAAACATCTCTGATTTATGGAACAGCGCCTTAAAAGAACTCGAAAAAAAGGTCAGT
AAACCAAGTTATGAAACATGGTTAAAATCAACAACCGCACATAATTTAAAGAAAGATGTA
TTAACAATTACGGCTCCAAATGAATTCGCCCGTGATTGGTTAGAATCTCATTATTCAGAG
CTAATTTCGGAAACACTTTATGATTTAACGGGGGCAAAATTAGCTATTCGCTTTATTATT
GCTAAAGCATATAATCCCCTCTTTATTTATGGGGGAGTTGGACTTGGAAAAACCCATTTA
>GBAA_RS00015_3
ATGCTTTATATCGCAAATCAAATCGATTCAAATATTCGTGAACTAGAAGGTGCACTCATC
CGCGTTGTAGCTTATTCATCTTTAATTAACAAGGATATTAATGCTGATTTAGCAGCTGAA
AAAGCTGTTGGAGATGTTTATCAAGTAAAATTAGAAGATTTCAAGGCGAAAAAGCGCACA
AAGTCAGTTGCCTTTCCTCGCCAAATTGCAATGTATTTGTCACGCGAACTGACAGATTCC
CATGAAAAAATTTCTAAGCTACTTAAGACGGATACGCAATTACAAAAACAAGTTGAAGAA
ATTAACGATATTTTAAAGTAG
...

After finish installation, you should first initialize the reference database using following command

cvmcgmlst create_db -file YOUR_REF.fasta -name DBNAME

2. Show available database

You could list all available databases using the show_db subcommand.

cvmcgmlst show_db

The shell will print available databases

DB_name No. of seqs Update_date
demo 1 2025-02-25
DBNAME Number of locus Date

Run with your genome

# Single Genome Mode
cvmcgmlst -i /PATH_TO_ASSEBLED_GENOME/sample.fa -db DBNAME -o PATH_TO_OUTPUT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cvmcgmlst-0.2.6.tar.gz (6.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cvmcgmlst-0.2.6-py3-none-any.whl (8.2 kB view details)

Uploaded Python 3

File details

Details for the file cvmcgmlst-0.2.6.tar.gz.

File metadata

  • Download URL: cvmcgmlst-0.2.6.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.32.3 requests-toolbelt/1.0.0 urllib3/1.26.7 tqdm/4.66.5 importlib-metadata/4.10.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.6 CPython/3.9.8

File hashes

Hashes for cvmcgmlst-0.2.6.tar.gz
Algorithm Hash digest
SHA256 f6d41666bcdad76cb4d2b5983fa4c24d081e67e58c4f3cb66b71e5cafed39a8a
MD5 29aa0854bad0f69ef85914164b22adee
BLAKE2b-256 17a9b99790e6fd4267c9380d33fa8bba739a098eec5e80cede946b0ff6394851

See more details on using hashes here.

File details

Details for the file cvmcgmlst-0.2.6-py3-none-any.whl.

File metadata

  • Download URL: cvmcgmlst-0.2.6-py3-none-any.whl
  • Upload date:
  • Size: 8.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.32.3 requests-toolbelt/1.0.0 urllib3/1.26.7 tqdm/4.66.5 importlib-metadata/4.10.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.6 CPython/3.9.8

File hashes

Hashes for cvmcgmlst-0.2.6-py3-none-any.whl
Algorithm Hash digest
SHA256 4a45c5e4e700616a629e351f2d38bd1f636d362be88f2311b0c29f7bc7adea7d
MD5 0667aec51ff857b788d8c34b9349ed1a
BLAKE2b-256 e152f6672586835c46200af4f56dc08462a029410d55de343051a12324454942

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page