Species identification pipeline for both single species and metagenomic samples.
Project description
HAYSTAC: High-AccuracY and Scalable Taxonomic Assignment of MetagenomiC data
Introduction
Alignment based metagenomics
haystack
is an easy to use pipeline for metagenomic identifications
You can easily:
- Construct a database
- Prepare your sample for analysis, including trimming sequencing adapters, collapsing paired end reads and also downloading data from the SRA
- Perform species identification
- Perform a chemical damage pattern analysis
Setup
haystac
can be run on any unix based system. It needs the conda package manager to run and it easy to install.
Quick Start
Four commands/modules that allow you to run the method from start to finish
# install haystack from conda
conda install -c bioconda haystac
# build a target database of species you are interested in
haystac database \
--query '("Yersinia"[Organism] OR "Yersinia"[Organism]) AND "complete genome"[All Fields]' \
--output yersinia_db
# prepare a sample for analyis
haystac sample \
--sra SRR12157896 \
--collapse \
--output SRR12157896
# analyse a sample against a specific database
haystac analyse \
--mode abundances \
--database yersinia_db \
--sample SRR12157896 \
--output yersinia_SRR12157896
example commmands when I got them
License
MIT
Citation
Happening soon
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
haystac-0.1.tar.gz
(45.8 kB
view hashes)
Built Distribution
haystac-0.1-py2.py3-none-any.whl
(69.7 kB
view hashes)
Close
Hashes for haystac-0.1-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6f48cea72efb5fcb731db4c3eff518515d9d637d020e24c82db556881da477ea |
|
MD5 | 18019a1e12c6aa397ea7df136a710d00 |
|
BLAKE2b-256 | db22933d185e03576620c5e3770f23abf4145f72cc540f975428104b4b7abc95 |