Skip to main content

a python package to generate and analyse protein similarity networks

Project description

networker

Description

This repository contains scripts to create and analyse protein (or nucleotide) identity networks.

Scirpts

1. networker.py

Usage:

python networker.py proteinseqs.faa

or,

python networker.py diamondorblastdatabase.tsv

If provided with a DIAMOND or BLAST table (.tsv), or an amino acid fasta file (.faa) 'networker.py' will produce a protein/nucleotide identity network ('xxx.html') and a list of nodes used in the network ('xxx.txt').

It is recommended that you use .faa as an input, however if you want to use your own table, the file must have the format: qseqid | sseqid | pident

For example, to make a usable DIAMOND search:

diamond makedb --db example --in example.faa

diamond blastp --db example --query example.faa --out example.tsv --outfmt 6 qseqid sseqid pident

All steps are currently automatic, including thresholding. However, the recommended threshold should work for most putposes. In future, you will be able to provide a custom threshold.

2. subnetworker.py

WIP

3. histogramer.py

WIP

Example Data

Example data and output can be found in the example_data folder. The file example.faa was used to generate the DIAMOND database and, subsequentally, the network ('example.html') and the node list ('example.txt').

To Do

  1. Enable custom thresholds.
  2. Create 'histogramer.py' to assist with threshold analysis.
  3. Create 'subnetworker.py' for analysing subnetworks.
  4. Compile into software package.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

networker-0.1.0.tar.gz (4.8 kB view details)

Uploaded Source

File details

Details for the file networker-0.1.0.tar.gz.

File metadata

  • Download URL: networker-0.1.0.tar.gz
  • Upload date:
  • Size: 4.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.7.0

File hashes

Hashes for networker-0.1.0.tar.gz
Algorithm Hash digest
SHA256 d2330f422f66b751144fe6a85e5e76d882e4c640759fc9f627846fd695cfa12f
MD5 ac5bead517f16ac3b12ca2a72c0a3226
BLAKE2b-256 d92188cf10fb8aea54a895a4ec5810a4fd04a704a1619db2fcecc6aa6850ba5e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page