Skip to main content

No project description provided

Project description

Apclusterv: Clustering viral genomes with Affinity Propagation

Apclusterv is a novel clustering software for viral genomes. The input genomes can be either complete genomes or contigs from metagenomic assembly. The program is based on protein-protein alignment and written in python
The current version is 1.1.2

Dependencies:

python>=3.7
pandas
numpy
networkx >= 2.8.4
scipy >=1.8.1
scikit-learn >= 1.1.2
MCL
diamond >= 0.9.34
prodigal >= 2.6.3
R>=3.6.1

Installation:

Suppose you are in a conda environment, you need to install MCL, prodigal (for ORF prediction),diamond (for alignment) and R(if not already installed, we just need stats library in r-base)

conda install diamond -c bioconda 
conda install mcl -c bioconda
conda install prodigal -c bioconda
conda install r-base 

pip install apclusterv==1.1.2

Getting Started:

option 1. start with contigs

step1. preduct ORFs from the DNA file with the following command:

prepare  contig_dna_fasta 

(contig_dna_fasta is the path to the dna sequences for clustering)

step2. execute clustering with the following command:

apclusterv -contig contig_dna_fasta 

option 2. if you already have protein sequences from the contigs, you can run apclusterv by the proteins and a protein-contig map file.

An example of protein file and mapfile are data/experiment1.faa and data/experiment1.csv

apclusterv -protein experiment1.faa -csv experiment1.csv

Help message and parameter setting

apclusterv -h

Results

The program will create tmp/ directory. The clustering result is tmp/cluster_result.i.r.csv (cluster_result.3.4.csv by default) Simulation profile used in the manuscript is in data/profile.csv RI and ARI for evalation script is data/eval.py

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

apclusterv-1.2.2.tar.gz (31.2 kB view details)

Uploaded Source

Built Distribution

apclusterv-1.2.2-py3-none-any.whl (37.7 kB view details)

Uploaded Python 3

File details

Details for the file apclusterv-1.2.2.tar.gz.

File metadata

  • Download URL: apclusterv-1.2.2.tar.gz
  • Upload date:
  • Size: 31.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.7.16

File hashes

Hashes for apclusterv-1.2.2.tar.gz
Algorithm Hash digest
SHA256 77c13d4a4593e18dd7b2be2b4e0b880d55240c891dacbc864f8ab4d3da76f4b1
MD5 13da9acf55c1d4d82d4ae519d2f5fd73
BLAKE2b-256 8ec79cbb3b3628aed4156f1ccda806513b684a43f23a3e2ef381c993e8b7c673

See more details on using hashes here.

File details

Details for the file apclusterv-1.2.2-py3-none-any.whl.

File metadata

  • Download URL: apclusterv-1.2.2-py3-none-any.whl
  • Upload date:
  • Size: 37.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.7.16

File hashes

Hashes for apclusterv-1.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 e51a9e069f1e4ebf55763dd15e8d84646e0b7f9de968b31dd404d559c142f3f8
MD5 9e3b4f447c1e935511ad1bcb30c85685
BLAKE2b-256 71bd8f4dc6e77daa8df95c1d4146e0faa7dce4673c024fc20cee2a7bbb887003

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page