No project description provided
Project description
Apclusterv: Clustering viral genomes with Affinity Propagation
Apclusterv is a novel clustering software for viral genomes. The input genomes can be either complete genomes or contigs from metagenomic assembly. The program is based on protein-protein alignment and written in python
The current stable version is 1.2.4
Dependencies:
python>=3.8
pandas
numpy
networkx >= 2.8.4
scipy >=1.8.1
scikit-learn >= 1.1.2
MCL
diamond >= 0.9.14
prodigal >= 2.6.3
R>=3.6.1
Installation:
Suppose you are in a conda environment, you need to install MCL, prodigal (for ORF prediction),diamond (for alignment) and R(if not already installed, we just need stats library in r-base)
conda install diamond -c bioconda
conda install mcl -c bioconda
conda install prodigal -c bioconda
conda install r-base
pip install apclusterv==1.2.4
Getting Started:
option 1. start with contigs
step1. preduct ORFs from the DNA file with the following command:
prepare contig_dna_fasta
(contig_dna_fasta is the path to the dna sequences for clustering)
step2. execute clustering with the following command:
apclusterv -contig contig_dna_fasta
option 2. if you already have protein sequences from the contigs, you can run apclusterv by the proteins and a protein-contig map file.
An example of protein file and mapfile are data/experiment1.faa and data/experiment1.csv
apclusterv -protein experiment1.faa -csv experiment1.csv
Help message and parameter setting
apclusterv -h
Results
The program will create tmp/ directory. The clustering result is tmp/cluster_result.i.r.csv (cluster_result.3.4.csv by default) Simulation profile used in the manuscript is in data/profile.csv RI and ARI for evalation script is data/eval.py
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file apclusterv-1.2.4.tar.gz
.
File metadata
- Download URL: apclusterv-1.2.4.tar.gz
- Upload date:
- Size: 31.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.7.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 746831619a3b02b2e723e8b085e9c84c80863b0317d37cd5060dbf67db0fc0de |
|
MD5 | b4a4e1ca42344e855d720a60e57e2dae |
|
BLAKE2b-256 | 77945d6e0eea6097348e2e8ace12d9cb9c476abd0860d5acf5e69939865b11c1 |
File details
Details for the file apclusterv-1.2.4-py3-none-any.whl
.
File metadata
- Download URL: apclusterv-1.2.4-py3-none-any.whl
- Upload date:
- Size: 37.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.7.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | cb88b766a01ae3f20231cf9e423f6f9505e3ffe66b2b0e8032c300a3ef89510c |
|
MD5 | 4d7cbd3f2ca9871a2b6fdf0cfce0a3a0 |
|
BLAKE2b-256 | bf9f28b5dd412be20cdeb717ebd0db78153b48cd85b505df4824bc430abefa2b |