A package to match Pfam accession numbers to corresponding GO terms
Project description
pfam2go
The pfam2go package provides a short and simple interface to match the Pfam accesion numbres to Gene Ontology annotation data.
The Pfam - Go term mapping was taken from:
http://current.geneontology.org/ontology/external2go/pfam2go
Mitchell et al. (2015) Nucl. Acids Res. 43 :D213-D221
The Go term information is taken from QuickGO:
Installation
pfam2go can be installed via pip:
pip install pfam2go
Usage
pfam2go(pfam_seqs: Union[Iterable[str], str]) -> pd.DataFrame
Input:
pfam_seqs: string or an Iterable object containing strings (e.g., list or pd.Series).
Returns:
pd.Dataframe containing GO terms for all corresponding Pfam numbers. Dataframe contains 5 string fields:
- Pfam accession number
- GO accession number
- GO name
- GO definition
- GO functional aspect
One Pfam number can correspond to several GO terms.
In case the information about a specific GO term has not been found in QuickGO the last 3 columns will be assigned to NaN.
Example
from pfam2go import pfam2go
pfam_list = ['PF00032', 'PF00049', 'PF08463']
data = pfam2go(pfam_list)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for pfam2go-1.1.2-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 69ffdd9003c83aaed40fe9ce80049ae666ca6750fa65d84f94e7fe68a7de8737 |
|
MD5 | 90a38c6a036d40023ce7914b7f1e991d |
|
BLAKE2b-256 | 8f2f5f70b0233efaa89f55455474e0b6a368add5fc1eae7b3b46bd8087071c65 |