Skip to main content

THAPBI Phytophthora ITS1 Classifier Tool (PICT).

Project description

THAPBI PICT on the Python Package Index (PyPI) THAPBI PICT on BioConda THAPBI PICT TravisCI build status THAPBI PICT CircleCI build status Documentation Status Code style: black

THAPBI Phytophthora ITS1 Classifier Tool (PICT)

THAPBI PICT an ITS1-based diagnostic/profiling tool from the UK BBSRC funded Tree Health and Plant Biosecurity Initiative (THAPBI) Phyto-Threats project, focused on identifying Phytophthora species present in Illumina sequenced environmental samples.

Phytophthora (from Greek meaning plant-destroyer) species are economically important plant pathogens, important in both agriculture and forestry. ITS1 is short for Internal Transcribed Spacer one, which is a region of eukaryotes genomes between the 18S and 5.8S rRNA genes. This is commonly used for molecular barcoding, where sequencing this short region can identify species.

This software repository continues earlier work including:

Installation

We recommend installing this tool on Linux using the Conda packaging system, via the BioConda channel, which will handle all the dependencies:

$ conda install thapbi_pict

The same should work on macOS, but the command line dependencies likely rule out using Windows directly.

Alternatively, since the software is on the Python Package Index (PyPI), the following command will install it along with its Python dependencies:

$ pip install thapbi_pict

However, in this case you will still need to install various external command line tools like hmmer, and others which are only used for some classifiers (like blast and swarm). If you have BioConda setup, use the following:

$ conda install blast cutadapt flash hmmer swarm trimmomatic

On a typical Linux system most of the tools listed will be available via the default distribution packages, although not always under the same package name.

On Debian (with the efforts of DebianMed), or Ubuntu Linux, try:

$ sudo apt-get install ncbi-blast+ cutadapt hmmer swarm trimmomatic

If you want to install the very latest unreleased code, you must download the source code from GitHub - see the CONTRIBUTING.md file for more details.

Once installed, you should be able to run the tool using:

$ thapbi_pict

This should automatically find the installed copy of the Python code. Use thapbi_pict -v to report the version, or thapbi_pict -h for help.

Release History

Version Date Notes
v0.2.4 2019-06-21 Synthetic controls in DB, fixed 3 Hyaloperonospora also in Peronospora.
v0.2.3 2019-06-18 Sample count rather than total read abundance for node size in edit-graph.
v0.2.2 2019-06-12 New edit-graph command for use with Cytoscape etc, or PDF via GraphViz.
v0.2.1 2019-05-27 Cope better with multiple (short) ITS1 fragments during classification.
v0.2.0 2019-05-14 Limit ITS1 length, 100 to 250bp. Exclude uncultured NCBI entries from DB.
v0.1.12 2019-05-09 Sort read-summary output by species. Set coloring group at command line.
v0.1.11 2019-05-06 Excel output from read-summary with formatting applied.
v0.1.10 2019-05-03 Tweaking command line API, renamed plate-summary to read-summary.
v0.1.9 2019-05-02 Implemented pipeline subcommand (prepare reads, classify, and report).
v0.1.8 2019-05-01 Standard errors for missing external tools; versions logged in verbose mode.
v0.1.7 2019-05-01 Changed default classifier method from 'identity' to more lienient 'onebp'.
v0.1.6 2019-04-30 Include ready to use binary ITS1 database in source tar-ball & wheel files.
v0.1.5 2019-04-29 Reworked optional metadata integration and its display in summary reports.
v0.1.4 2019-04-25 Sort samples using the optional metadata fields requested in reports.
v0.1.3 2019-04-24 Can optionally display sample metadata from TSV file in summary reports.
v0.1.2 2019-04-17 Keep searching if onebp classifier perfect match is at genus-level only.
v0.1.1 2019-04-16 Expand default taxonomy and database from Peronosporaceae to Peronosporales.
v0.1.0 2019-04-04 Include a bundled ITS1 database.
v0.0.15 2019-04-03 Support for genus-level only entries in the database.
v0.0.14 2019-04-01 MD5 in dump output. Fixed importing sequences failing taxonomic validation.
v0.0.13 2019-03-22 Remove conserved 32bp when primer trim. Assess at sample level by default.
v0.0.12 2019-03-11 Fixed bug in swarmid classifier.
v0.0.11 2019-03-08 Speed up FASTQ preparation by using flash instead of pear v0.9.6.
v0.0.10 2019-03-06 Replace primer code allowing only 1bp differences with cutadapt.
v0.0.9 2019-03-05 Looks for expected primers, discards mismatches. Caches HMM files locally.
v0.0.8 2019-02-21 Fix multi-class TN under-counting. New loss metric, swarmid classifier.
v0.0.7 2019-02-12 Added plate-summary command, onebp classifier.
v0.0.6 2019-02-07 Misc. cleanup and import fixes.
v0.0.5 2019-02-06 Hamming Loss in assessement output.
v0.0.4 2019-01-24 Added seq-import command, blast classifier, multi-taxon predictions.
v0.0.3 2019-01-22 Simplified generated filenames.
v0.0.2 2019-01-21 Added assess command.
v0.0.1 2019-01-17 Initial framework with identity and swarm classifiers.

Development Notes

Please see the CONTRIBUTING.md file for details of the development setup including Python style conventions, git pre-commit hook, continuous integration and test coverage, and release process.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

thapbi_pict-0.2.4.tar.gz (2.9 MB view details)

Uploaded Source

Built Distribution

thapbi_pict-0.2.4-py3-none-any.whl (1.1 MB view details)

Uploaded Python 3

File details

Details for the file thapbi_pict-0.2.4.tar.gz.

File metadata

  • Download URL: thapbi_pict-0.2.4.tar.gz
  • Upload date:
  • Size: 2.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.31.1 CPython/3.7.2

File hashes

Hashes for thapbi_pict-0.2.4.tar.gz
Algorithm Hash digest
SHA256 a3d59d36294cd401559a27314084cb03d8bdf67ec9a4d23150877488e4eefec3
MD5 a4386c2c1d490d9e3150d9d3f026cd8d
BLAKE2b-256 6e0a76ccb14fe7c86f8efbbd85d221d3f73ba74525d59c43207acff70c2c379f

See more details on using hashes here.

File details

Details for the file thapbi_pict-0.2.4-py3-none-any.whl.

File metadata

  • Download URL: thapbi_pict-0.2.4-py3-none-any.whl
  • Upload date:
  • Size: 1.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.31.1 CPython/3.7.2

File hashes

Hashes for thapbi_pict-0.2.4-py3-none-any.whl
Algorithm Hash digest
SHA256 a464cf2316e50916298e9a08c7e8f2f90fea8840abba61416606f4385a7f5997
MD5 79bc799a33c00b6b30930edeb0a00ac4
BLAKE2b-256 64a4d628bfda98654e13736ac034c4a58330e739a9febaa56f8556bfa90e84e8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page