a python package to generate and analyse protein similarity networks
Project description
networker
Description
This repository contains scripts to create and analyse protein (or nucleotide) identity networks.
Scirpts
1. networker.py
Usage:
python networker.py proteinseqs.faa
or,
python networker.py diamondorblastdatabase.tsv
If provided with a DIAMOND or BLAST table (.tsv), or an amino acid fasta file (.faa) 'networker.py' will produce a protein/nucleotide identity network ('xxx.html') and a list of nodes used in the network ('xxx.txt').
It is recommended that you use .faa as an input, however if you want to use your own table, the file must have the format: qseqid | sseqid | pident
For example, to make a usable DIAMOND search:
diamond makedb --db example --in example.faa
diamond blastp --db example --query example.faa --out example.tsv --outfmt 6 qseqid sseqid pident
All steps are currently automatic, including thresholding. However, the recommended threshold should work for most putposes. In future, you will be able to provide a custom threshold.
2. subnetworker.py
WIP
3. histogramer.py
WIP
Example Data
Example data and output can be found in the example_data folder. The file example.faa was used to generate the DIAMOND database and, subsequentally, the network ('example.html') and the node list ('example.txt').
To Do
- Enable custom thresholds.
- Create 'histogramer.py' to assist with threshold analysis.
- Create 'subnetworker.py' for analysing subnetworks.
- Compile into software package.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file networker-0.1.0.tar.gz
.
File metadata
- Download URL: networker-0.1.0.tar.gz
- Upload date:
- Size: 4.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.7.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d2330f422f66b751144fe6a85e5e76d882e4c640759fc9f627846fd695cfa12f |
|
MD5 | ac5bead517f16ac3b12ca2a72c0a3226 |
|
BLAKE2b-256 | d92188cf10fb8aea54a895a4ec5810a4fd04a704a1619db2fcecc6aa6850ba5e |