cgMLST analysis tool
Project description
cvmcgmlst
cvmcgmlst is a tool developed based on the cvmmlst for core genome MLST analysis .
Usage: cvmcgmlst -i <genome assemble directory> -o <output_directory> -db database_name
Author: Qingpo Cui(SZQ Lab, China Agricultural University)
options:
-h, --help show this help message and exit
-i I <input_file>: the PATH of assembled genome file
-db DB <database_path>: name of cgMLST database
-o O <output_directory>: output PATH
-minid MINID <minimum threshold of identity>, default=95
-mincov MINCOV <minimum threshold of coverage>, default=90
-t T <number of threads>: default=8
-v, --version Display version
cvmcgmlst subcommand:
{show_db,init,create_db}
show_db <show the list of all available database>
init <initialize the reference database>
create_db <add custome database, use cvmcgmlst createdb -h for help>
Installation
Using pip
pip3 install cvmcgmlst
Dependency
- BLAST+ >2.7.0
you should add BLAST in your PATH
Blast installation
Windows
Following this tutorial: Add blast into your windows PATH
Linux/Mac
The easyest way to install blast is:
conda install -c bioconda blast
Usage
1. Create reference cgmlst database
Users could create their own core genome database. All you need is a FASTA file of nucleotide sequences. The sequence IDs should have the format >locus_allelenumber, where LOCUS is the loci name, ALLELENUMBER is the number of this allele. The curated core genome fasta file should like this:
>GBAA_RS00015_1
TTGGAAAACATCTCTGATTTATGGAACAGCGCCTTAAAAGAACTCGAAAAAAAGGTCAGT
AAACCAAGTTATGAAACATGGTTAAAATCAACAACCGCACATAATTTAAAGAAAGATGTA
AAGTCAGTTGCCTTTCCTCGCCAAATTGCAATGTATTTGTCACGCGAACTGACAGATTCC
TCCTTACCTAAAATAGGTGAAGAATTTGGTGGACGTGATCATACAACCGTTATCCATGCC
CATGAAAAAATTTCTAAGCTACTTAAGACGGATACGCAATTACAAAAACAAGTTGAAGAA
ATTAACGATATTTTAAAGTAG
>GBAA_RS00015_2
TTGGAAAACATCTCTGATTTATGGAACAGCGCCTTAAAAGAACTCGAAAAAAAGGTCAGT
AAACCAAGTTATGAAACATGGTTAAAATCAACAACCGCACATAATTTAAAGAAAGATGTA
TTAACAATTACGGCTCCAAATGAATTCGCCCGTGATTGGTTAGAATCTCATTATTCAGAG
CTAATTTCGGAAACACTTTATGATTTAACGGGGGCAAAATTAGCTATTCGCTTTATTATT
GCTAAAGCATATAATCCCCTCTTTATTTATGGGGGAGTTGGACTTGGAAAAACCCATTTA
>GBAA_RS00015_3
ATGCTTTATATCGCAAATCAAATCGATTCAAATATTCGTGAACTAGAAGGTGCACTCATC
CGCGTTGTAGCTTATTCATCTTTAATTAACAAGGATATTAATGCTGATTTAGCAGCTGAA
AAAGCTGTTGGAGATGTTTATCAAGTAAAATTAGAAGATTTCAAGGCGAAAAAGCGCACA
AAGTCAGTTGCCTTTCCTCGCCAAATTGCAATGTATTTGTCACGCGAACTGACAGATTCC
CATGAAAAAATTTCTAAGCTACTTAAGACGGATACGCAATTACAAAAACAAGTTGAAGAA
ATTAACGATATTTTAAAGTAG
...
After finish installation, you should first initialize the reference database using following command
cvmcgmlst create_db -file YOUR_REF.fasta -name DBNAME
2. Show available database
You could list all available databases using the show_db subcommand.
cvmcgmlst show_db
The shell will print available databases
| DB_name | No. of seqs | Update_date |
|---|---|---|
| demo | 1 | 2025-02-25 |
| DBNAME | Number of locus | Date |
Run with your genome
# Single Genome Mode
cvmcgmlst -i /PATH_TO_ASSEBLED_GENOME/sample.fa -db DBNAME -o PATH_TO_OUTPUT
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file cvmcgmlst-0.2.5.tar.gz.
File metadata
- Download URL: cvmcgmlst-0.2.5.tar.gz
- Upload date:
- Size: 6.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.32.3 requests-toolbelt/1.0.0 urllib3/1.26.7 tqdm/4.66.5 importlib-metadata/4.10.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.6 CPython/3.9.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2fe05ea39b4e15a032ff1c4ce7ad0dddcf3f7d428d9349389bceba517a9d91bc
|
|
| MD5 |
ca6108dc970564070d94d14502d68b46
|
|
| BLAKE2b-256 |
263616db1e6cb6cb8fb96790341861872b6726fd888d92e045a1bdc52c69321d
|
File details
Details for the file cvmcgmlst-0.2.5-py3-none-any.whl.
File metadata
- Download URL: cvmcgmlst-0.2.5-py3-none-any.whl
- Upload date:
- Size: 8.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.32.3 requests-toolbelt/1.0.0 urllib3/1.26.7 tqdm/4.66.5 importlib-metadata/4.10.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.6 CPython/3.9.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
628aeebb2bffe73066b3a4714da21555f08f4767b247ca155316a336ff29fb02
|
|
| MD5 |
62bf8c2dc5b8893d350cc4f489d24cd5
|
|
| BLAKE2b-256 |
fc692ca18b3ee9081db7b9b0ffeafac98959f90e25e059671e71c916a04ed68d
|