Create interactive comparative sequencing coverage plots from virus sequencing data.
Project description
wgscovplot
wgscovplot generates interactive comparative sequencing coverage plots in self-contained, offline-friendly HTML files with optional annotation of variant calling results, PCR amplicon coverage and genetic features.
Installation
PyPI
Install from PyPI with pip
pip install wgscovplot
If the installation was successful, you should be able to type wgscovplot --help
and get a help message on how to use the tool.
Source
Use pip
to install from source:
# optionally, create a virtual environment
python -m venv venv
source venv/bin/activate
# install from GitHub repo
pip install git+https://github.com/nhhaidee/wgscovplot.git
# run wgscovplot
wgscovplot --help
wgscovplot /path/to/results_folder
Features
- Easy-to-use: Simply provide a Nextflow output directory containing and
wgscovplot
will figure out what files it needs to generate its interactive sequencing coverage plots- Compatible workflows:
- nf-core/viralrecon
- [CFIA-NCFAD/nf-virontus]
- CFIA-NCFAD/nf-flu
- Compatible workflows:
- Fully-interactive plots featuring:
- Zoom, scroll, pan, select regions of interest
- Informative tooltips highlighting variant calling results and coverage statistics across all samples being shown
- Change the y-axis scale to linear or log scale
- Select which samples to show (and which Influenza gene segments to show)
- Highlight regions of interest (e.g. genetic features, primer/probe binding sites, low coverage regions)
- Annotate coverage plots with variant calling results from multiple different variant callers and variant effect results from SnpEff/SnpSift
- Supported variant callers: iVar, Nanopolish, Longshot, Medaka, Clair3
- Compare sequencing coverage across multiple samples
Usage
Basic usage will output a wgscovplot.html
file in the current directory:
wgscovplot /path/to/results_folder
Show help info with $ wgscovplot --help
:
Usage: wgscovplot [OPTIONS] INPUT_DIR
╭─ Arguments ──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
│ * input_dir PATH Directory containing Mosdepth and variant calling results from sequence analysis. For example, the output directory from execution of the nf-core/viralrecon or CFIA-NCFAD/nf-flu Nextflow workflow │
│ [default: None] │
│ [required] │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Options ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
│ --output-html -o PATH wgscovplot HTML output file [default: wgscovplot.html] │
│ --primers-fasta -p PATH FASTA file containing real-time PCR primer/probe sequences. [default: None] │
│ --low-coverage-threshold -l INTEGER Low sequencing coverage threshold. [default: 10] │
│ --edit-distance -d INTEGER The maximum differences or 'edits' allowed between real-time PCR primer/probe sequences and the sample sequences. [default: 0] │
│ --compress-depths --no-compress-depths Compress coverage depth arrays? [default: compress-depths] │
│ --verbose -v Verbose logs │
│ --force -f Force overwrite of existing output files │
│ --version --no-version Print wgscovplot version and exit [default: no-version] │
│ --install-completion [bash|zsh|fish|powershell|pwsh] Install completion for the specified shell. [default: None] │
│ --show-completion [bash|zsh|fish|powershell|pwsh] Show completion for the specified shell, to copy it or customize the installation. [default: None] │
│ --help Show this message and exit. │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
wgscovplot version 0.3.0; Python 3.11.6
Dependencies
- Python (>=3.9)
- Typescript/Javascript
- Echarts for performant generating interactive plots
- SolidJS for reactive UI components
- Vite for TS/JS dev and building bundle
- Tailwind CSS for styling
Development
This project has two main components: a Python "backend" (CLI that spits out a templated HTML with built JS embedded) and a Javascript frontend.
Python backend development is done in the wgscovplot
directory.
Web frontend development is done in the web
directory. The frontend is built with Vite, SolidJS and ECharts.
Environment
Python development is recommended with PyCharm. Jetbrains IDEs have great support for Python development and virtual environments.
Jetbrains IDEs work great for Typescript/Javascript development as well, but any editor will do if you have Vite live reload enabled.
Setup
# clone repo
git clone https://github.com/nhhaidee/wgscovplot.git
cd wgscovplot
# optionally, create a virtual environment
python -m venv venv
source venv/bin/activate
# install dev dependencies
pip install hatch
# start shell with Hatch
hatch shell
# run linting with Hatch
hatch run lint:all
# run tests with Hatch
hatch run cov
Frontend development
See web/README.md for more details.
Authors
- Development Lead: Peter Kruczkiewicz
- Software Developer: Hai Nguyen
License
Copyright 2024 Canadian Food Inspection Agency of Canada, Government of Canada.
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this work except in compliance with the License. You may obtain a copy of the License at:
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file wgscovplot-1.0.0.tar.gz
.
File metadata
- Download URL: wgscovplot-1.0.0.tar.gz
- Upload date:
- Size: 319.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/4.0.2 CPython/3.11.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6385c8fc4b6c4b4b15b48f3b4d0e5e634b6f9fd8fc74ce9733f7be0c15bddad1 |
|
MD5 | ed6f347170447e2f260fe96c597e1f18 |
|
BLAKE2b-256 | ef57bf55f18c9934d501a2ca40816f507076de0689d2160066d0475e97bdaa49 |
Provenance
File details
Details for the file wgscovplot-1.0.0-py3-none-any.whl
.
File metadata
- Download URL: wgscovplot-1.0.0-py3-none-any.whl
- Upload date:
- Size: 325.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/4.0.2 CPython/3.11.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 13462efb1c3ed012999a49f8f92f7ec5af32c164f6a3741201d1d69166398fe3 |
|
MD5 | da54f776b902ea58355a67309d0a29b5 |
|
BLAKE2b-256 | 250729df02260a8c2f80e8e0ececaf3b71a96128d46ca22f0dae4556c40ff181 |