Getting Uniprot Data from Uniprot Accession ID through Uniprot REST API
Project description
UniProt Database Web Parser Project
TLDR: This parser can be used to parse UniProt accession id and obtain related data from the UniProt web database.
With version 1.0.2, support for the new UniProt REST API have been added under betaparser
module of the package.
In order to utilize this new module, you can follow the example bellow
from uniprotparser.betaparser import UniprotParser
from io import StringIO
import pandas as pd
example_acc_list = ["Q99490", "Q8NEJ0", "Q13322", "P05019", "P35568", "Q15323"]
parser = UniprotParser()
df = []
#Yield result for 500 accession ids at a time
for r in parser.parse(ids=example_acc_list):
df.append(pd.read_csv(StringIO(r), sep="\t"))
#Check if there were more than one result and consolidate them into one dataframe
if len(df) > 0:
df = pd.concat(df, ignore_index=True)
else:
df = df[0]
To parse UniProt accession with legacy API
from uniprotparser.parser import UniprotSequence
protein_id = "seq|P06493|swiss"
acc_id = UniprotSequence(protein_id, parse_acc=True)
#Access ACCID
acc_id.accession
#Access isoform id
acc_id.isoform
To get additional data from UniProt online database
from uniprotparser.parser import UniprotParser
from io import StringIO
#Install pandas first to handle tabulated data
import pandas as pd
protein_accession = "P06493"
parser = UniprotParser([protein_accession])
#To get tabulated data
result = []
for i in parser.parse("tab"):
tab_data = pd.read_csv(i, sep="\t")
last_column_name = tab_data.columns[-1]
tab_data.rename(columns={last_column_name: "query"}, inplace=True)
result.append(tab_data)
fin = pd.concat(result, ignore_index=True)
#To get fasta sequence
with open("fasta_output.fasta", "wt") as fasta_output:
for i in parser.parse():
fasta_output.write(i)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
uniprotparser-1.0.4.tar.gz
(5.5 kB
view details)
Built Distribution
File details
Details for the file uniprotparser-1.0.4.tar.gz
.
File metadata
- Download URL: uniprotparser-1.0.4.tar.gz
- Upload date:
- Size: 5.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.12 CPython/3.9.6 Windows/10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | caa764dc6f0947b4d653a6f21a7c6ac8746f38dda7c236be5a1edc9770818f01 |
|
MD5 | 0353b3003e4dd9a5e0159f883ad7252d |
|
BLAKE2b-256 | 2bf2b4361bdce745e1cb3d2b26d1a0161193052f2d0be6c7c6c4ac3d43d53a35 |
File details
Details for the file uniprotparser-1.0.4-py3-none-any.whl
.
File metadata
- Download URL: uniprotparser-1.0.4-py3-none-any.whl
- Upload date:
- Size: 6.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.12 CPython/3.9.6 Windows/10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 53217a6299a0849e75e5a1eab880f4ab0fb68f8341e7a7db4968dc7b0d74e240 |
|
MD5 | da0136c8c8cd83cd851935dae4d821ad |
|
BLAKE2b-256 | 7be9a55b1255878ebaeaaec12625ad9cd210269ec5f17268396dec444844d86d |