Skip to main content

CathPy - Python Bioinformatics Toolkit for CATH (Protein Classification).

Project description


Documentation Status

Getting Started

Get code

$ git clone
$ cd cathpy

Install code into virtual environment

$ python3 -m venv venv
$ source venv/bin/activate
$ pip install -e .

If everything is installed and working properly then the following should work:

$ cath-align-summary -d tests/data/funfams/
file aln_len seq_count dops gap_per
tests/data/funfams/                          69     51  61.53  12.53
tests/data/funfams/                          66    429 100.00  13.04
tests/data/funfams/                           59     14   7.81   3.15
tests/data/funfams/                          63    203  95.88  17.70


What is cathpy?

cathpy is a python package that contains bioinformatics tools and libraries used in CATH (protein structure classification resource at UCL).

Hmmm.. that sounds like Yet Another Python Bioinformatics Toolkit?

Well it is... sort of.

Should I be using it?

If you are looking for a general Bioinformatics toolkit, you probably want to look at BioPython first.

This project does contain generic libraries and tools, however the code is new and the API may move around. It has been published mainly for internal use (within CATH), however it has been released as open source in case others find the tools helpful.

So, why doesn't this use BioPython?

We may well merge in some BioPython modules in the future. There are few features that BioPython does not currently handle (eg regarding fairly low-level manipulation of alignments). At some point, we may look into turning some of this code into suggestions / pull-requests.


This code base contains external tools that are not written and maintained by the authors of this project. If you use the results of these tools, please reference the relevant papers.


Capra JA and Singh M. Characterization and Prediction of Residues Determining Protein Functional Specificity. Bioinformatics, 24(13): 1473-1480, 2008.


Valdar WSJ (2002). Scoring residue conservation. Proteins: Structure, Function, and Genetics. 43(2): 227-241.

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for cathpy, version 0.1.3
Filename, size File type Python version Upload date Hashes
Filename, size cathpy-0.1.3.linux-x86_64.tar.gz (279.6 kB) File type Source Python version None Upload date Hashes View hashes
Filename, size cathpy-0.1.3-py3-none-any.whl (232.8 kB) File type Wheel Python version py3 Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page