Skip to main content

ANOSPP data analysis

Project description

anospp-analysis

Python package for ANOSPP data analysis

ANOSPP is the multiplexed amplicon sequencing assay for Anopheles mosquito species identification and Plasmodium detection. This repository contains the code for analysis of the sequencing results pre-processed with nf-core ampliseq pipeline.

Installation

For latest released version

conda install -c bioconda anospp-analysis

For development setup, see instructions below

Usage

Key analysis steps are implemented as standalone scripts:

  • anospp-prep takes DADA2 output files and targets primer sequences, demultiplexes the amplicons and yields haplotypes table
  • anospp-qc takes haplotypes table, DADA2 stats table and samples manifest and produces QC plots
  • anospp-plasm blasts Plasmodium sequences against reference dataset to determine species and infer sample infection status
  • anospp-nn compares k-mer profiles of mosquito targets against a reference dataset and provides probabilistic species calls
  • anospp-vae provides finer scale species prediction for An. gambiae complex with VAE projection
  • anospp-agg combines all results into a single table

Development

Setup

Installation is hybrid with conda + poetry:

git clone git@github.com:malariagen/anospp-analysis.git
cd anospp-analysis
git checkout dev
conda env create -f environment.yml
conda activate anospp_analysis_dev
poetry install

Usage & testing

The code in this repository can be accessed via wrapper scripts:

anospp-qc \
    --haplotypes test_data/haplotypes.tsv \
    --samples test_data/samples.csv \
    --stats test_data/stats.tsv \
    --outdir test_data/qc

Besides, individual components are available as a python API:

$ python
>>> from anospp_analysis.util import *
>>> PLASM_TARGETS
['P1', 'P2']

Adding Python deps

Introducing python dependencies should be done via poetry:

poetry add package_name

This should update both pyproject.toml and poetry.lock files

If the package should be used in development environment only, use

poetry add package_name --dev

To update environment after changes made to pyproject.toml and/or poetry.lock

poetry install

Adding non-Python deps

Introducing non-python dependencies should be done via conda: edit environment.yml, then re-create the conda environment and poetry deps:

conda env create -f environment.yml
conda activate anospp_analysis
poetry install

If changes in conda environment introduce changes to the python installation, one should update poetry lock file

poetry lock

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

anospp_analysis-0.5.0.tar.gz (43.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

anospp_analysis-0.5.0-py3-none-any.whl (48.6 kB view details)

Uploaded Python 3

File details

Details for the file anospp_analysis-0.5.0.tar.gz.

File metadata

  • Download URL: anospp_analysis-0.5.0.tar.gz
  • Upload date:
  • Size: 43.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.4 CPython/3.10.20 Darwin/25.4.0

File hashes

Hashes for anospp_analysis-0.5.0.tar.gz
Algorithm Hash digest
SHA256 259529d9ea49d2bbe1bdb465705601181fc93956903b7dd4fc33c6de61582da8
MD5 2468daa3b812f033f97bddb0c6378d15
BLAKE2b-256 2a00a980f663d7bb18df8a30597e015679fa794fa3eb16bcf2658718f723e639

See more details on using hashes here.

File details

Details for the file anospp_analysis-0.5.0-py3-none-any.whl.

File metadata

  • Download URL: anospp_analysis-0.5.0-py3-none-any.whl
  • Upload date:
  • Size: 48.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.4 CPython/3.10.20 Darwin/25.4.0

File hashes

Hashes for anospp_analysis-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 85a938b693d8dfd00bb266a4d56b165d2583c9a0d040e2049e183e2d8e9b8d59
MD5 287f178d900ec76bc1509d5bee7098a2
BLAKE2b-256 74e61f7afcd44686e519535bc03974a54f9ecab64667a6868e0162f67bd6c3bc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page