Getting Uniprot Data from Uniprot Accession ID through Uniprot REST API
Project description
UniProt Database Web Parser Project
TLDR: This parser can be used to parse UniProt accession id and obtain related data from the UniProt web database.
To parse UniProt accession
from uniprotparser.parser import UniprotSequence
protein_id = "seq|P06493|swiss"
acc_id = UniprotSequence(protein_id, parse_acc=True)
#Access ACCID
acc_id.accession
#Access isoform id
acc_id.isoform
To get additional data from UniProt online database
from uniprotparser.parser import UniprotParser
from io import StringIO
#Install pandas first to handle tabulated data
import pandas as pd
protein_accession = "P06493"
parser = UniprotParser([protein_accession])
#To get tabulated data
result = []
for i in parser.parse("tab"):
tab_data = pd.read_csv(i, sep="\t")
last_column_name = tab_data.columns[-1]
tab_data.rename(columns={last_column_name: "query"}, inplace=True)
result.append(tab_data)
fin = pd.concat(result, ignore_index=True)
#To get fasta sequence
with open("fasta_output.fasta", "wt") as fasta_output:
for i in parser.parse():
fasta_output.write(i)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
uniprotparser-1.0.1.tar.gz
(3.5 kB
view hashes)
Built Distribution
Close
Hashes for uniprotparser-1.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b47e9b2141d6da9a453417151572b104b61bb40cf3f8589ed931ffb110aac1c9 |
|
MD5 | a6ec51025fb3de6710fd18acfd9c6192 |
|
BLAKE2b-256 | bf0cc7aa03e29bb23a704176011d193a08cd2d2b64c0792d3285235999d3ad2d |