ANOSPP data analysis
Project description
anospp-analysis
Python package for ANOSPP data analysis
ANOSPP is the multiplexed amplicon sequencing assay for Anopheles mosquito species identification and Plasmodium detection. This repository contains the code for analysis of the sequencing results pre-processed with nf-core ampliseq pipeline.
Installation
For latest released version
conda install -c bioconda anospp-analysis
For development setup, see instructions below
Usage
Key analysis steps are implemented as standalone scripts:
anospp-preptakes DADA2 output files and targets primer sequences, demultiplexes the amplicons and yields haplotypes tableanospp-qctakes haplotypes table, DADA2 stats table and samples manifest and produces QC plotsanospp-plasmblasts Plasmodium sequences against reference dataset to determine species and infer sample infection statusanospp-nncompares k-mer profiles of mosquito targets against a reference dataset and provides probabilistic species callsanospp-vaeprovides finer scale species prediction for An. gambiae complex with VAE projectionanospp-aggcombines all results into a single table
Development
Setup
Installation is hybrid with conda + poetry:
git clone git@github.com:malariagen/anospp-analysis.git
cd anospp-analysis
git checkout dev
conda env create -f environment.yml
conda activate anospp_analysis_dev
poetry install
Usage & testing
The code in this repository can be accessed via wrapper scripts:
anospp-qc \
--haplotypes test_data/haplotypes.tsv \
--samples test_data/samples.csv \
--stats test_data/stats.tsv \
--outdir test_data/qc
Besides, individual components are available as a python API:
$ python
>>> from anospp_analysis.util import *
>>> PLASM_TARGETS
['P1', 'P2']
Adding Python deps
Introducing python dependencies should be done via poetry:
poetry add package_name
This should update both pyproject.toml and poetry.lock files
If the package should be used in development environment only, use
poetry add package_name --dev
To update environment after changes made to pyproject.toml and/or poetry.lock
poetry install
Adding non-Python deps
Introducing non-python dependencies should be done via conda: edit environment.yml,
then re-create the conda environment and poetry deps:
conda env create -f environment.yml
conda activate anospp_analysis
poetry install
If changes in conda environment introduce changes to the python installation, one should update poetry lock file
poetry lock
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file anospp_analysis-0.5.0.tar.gz.
File metadata
- Download URL: anospp_analysis-0.5.0.tar.gz
- Upload date:
- Size: 43.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.4 CPython/3.10.20 Darwin/25.4.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
259529d9ea49d2bbe1bdb465705601181fc93956903b7dd4fc33c6de61582da8
|
|
| MD5 |
2468daa3b812f033f97bddb0c6378d15
|
|
| BLAKE2b-256 |
2a00a980f663d7bb18df8a30597e015679fa794fa3eb16bcf2658718f723e639
|
File details
Details for the file anospp_analysis-0.5.0-py3-none-any.whl.
File metadata
- Download URL: anospp_analysis-0.5.0-py3-none-any.whl
- Upload date:
- Size: 48.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.4 CPython/3.10.20 Darwin/25.4.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
85a938b693d8dfd00bb266a4d56b165d2583c9a0d040e2049e183e2d8e9b8d59
|
|
| MD5 |
287f178d900ec76bc1509d5bee7098a2
|
|
| BLAKE2b-256 |
74e61f7afcd44686e519535bc03974a54f9ecab64667a6868e0162f67bd6c3bc
|