Classify C. Liberibacter solanacearum haplotypes
Project description
Haplotype-LSO
Haplotype assignment of Candidatus Liberibacter solanacearum following IPPC (International Plant Protection Convention) standard DP 21: Candidatus Liberibacter solanacearum.
Quick Facts
License: MIT
Programming Language Python
Input / Output - What it Does!
This program takes as the input Sanger sequences from the 16S, 16S-23S, and 50S primers from the IPPC standard DP21. It then aligns them to the GenBank reference sequences EU812559 and EU834131 (as specified in DP21). Based on the alignments and the document DP21, sequence identity is computed and haplotyping is performed, yielding:
sequence identity to EU822559 for identifying the species C. Liberibacter solanacearum, and
haplotyping of the read based on variation from the reference sequence.
Sample names can be inferred from the read names or from a separate mapping TSV file.
Quick Start
This is gonna be really quick!
Installation
We recommend using Bioconda.
First install Bioconda. Then (clsify is the old package name of Haplotype-Lso and it will be renamed soon):
# conda install -y clsify
And – tadaa – you’re ready to go!
Running
You can have one FASTA (or FASTQ) file with all of your reads or one file for each. If you have a single sequence per FASTA (or FASTQ) file then you can use the file name instead of the sequence name.
# hlso -o result.tsv INPUT.fasta
## OR
# hlso [--use-file-name] -o result.tsv INPUT1.fasta INPUT2.fasta [...]
## e.g.,
# hlso [--use-file-name] -o result.tsv INPUT*.fasta
Developer Guide
Releasing Packages
For the PyPi package:
$ python setup.py sdist
$ twine upload --repository-url https://test.pypi.org/legacy/ dist/hlso-*.tar.gz
$ twine upload dist/hlso-*.tar.gz
For the Bioconda package, see the great documentation. The Docker image will automatically be created as a BioContainer when the Bioconda package is built.
History
v0.4.4
Fix file conversion problem.
v0.4.3
Fixing issue with AB1/SCF support.
v0.4.2
Fix for multi-region sequences.
Fixing collision, variant must be identified by sequence, position, and reference.
v0.4.1
Fixing bug with display of BLAST match.
Adding link-out to NCBI WWWBLAST submission.
Removing some cruft from repository.
Fixing bug in phylogenetic coputation in case of BLAST all-to-all matches not a square number.
v0.4.0
Compute region-wise phylogenetic tree.
v0.3.4
Support for uploading to PyPi.
v0.3.3
Fixing regular expression.
v0.3.2
Changing file name pattern do dot-separated scheme.
Greatly extending documentation.
v0.3.1
Starting out with tutorial and manual.
v0.3.0
Adding tutorial.
Rebranding as “Haplotype-LSO” (hlso).
Properly normalizing indels according to Tan et al. (2015).
Adding support for haplotyping with indels.
v0.2.0
Removing dependency on bcftools. Haplotyping is done from BLAST match now.
Adding tests for blast module.
Rewrite of the whole BLAST and haplotyping interface and architecture.
v0.1.1
Zapping gremlins in haplotype table.
Change formatting of README.
v0.1.0
Initial release.
Everything is new!
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file hlso-0.4.4.tar.gz
.
File metadata
- Download URL: hlso-0.4.4.tar.gz
- Upload date:
- Size: 1.4 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.2.0.post20200210 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8cfda457e2f2512ce7d7baeba05ab9807266c8efcff93d6ff009550bba290ca8 |
|
MD5 | 06b89511945bcef1c6554d4602607439 |
|
BLAKE2b-256 | 9eb0d60b67e60386806ce414f5fbd888b70747cc76c7e3d70f311e2f646fadbd |