Skip to main content

Getting Uniprot Data from Uniprot Accession ID through Uniprot REST API

Project description

UniProt Database Web Parser Project

TLDR: This parser can be used to parse UniProt accession id and obtain related data from the UniProt web database.

With version 1.0.2, support for the new UniProt REST API have been added under betaparser module of the package.

In order to utilize this new module, you can follow the example bellow

from uniprotparser.betaparser import UniprotParser
from io import StringIO

import pandas as pd
example_acc_list = ["Q99490", "Q8NEJ0", "Q13322", "P05019", "P35568", "Q15323"]
parser = UniprotParser()
df = []
#Yield result for 500 accession ids at a time
for r in parser.parse(ids=example_acc_list):
    df.append(pd.read_csv(StringIO(r), sep="\t"))

#Check if there were more than one result and consolidate them into one dataframe
if len(df) > 0:
    df = pd.concat(df, ignore_index=True)
else:
    df = df[0]

To parse UniProt accession with legacy API

from uniprotparser.parser import UniprotSequence

protein_id = "seq|P06493|swiss"

acc_id = UniprotSequence(protein_id, parse_acc=True)

#Access ACCID
acc_id.accession

#Access isoform id
acc_id.isoform

To get additional data from UniProt online database

from uniprotparser.parser import UniprotParser
from io import StringIO
#Install pandas first to handle tabulated data
import pandas as pd

protein_accession = "P06493"

parser = UniprotParser([protein_accession])

#To get tabulated data
result = []
for i in parser.parse("tab"):
    tab_data = pd.read_csv(i, sep="\t")
    last_column_name = tab_data.columns[-1]
    tab_data.rename(columns={last_column_name: "query"}, inplace=True)
    result.append(tab_data)
fin = pd.concat(result, ignore_index=True)

#To get fasta sequence
with open("fasta_output.fasta", "wt") as fasta_output:
    for i in parser.parse():
        fasta_output.write(i)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

uniprotparser-1.0.4.tar.gz (5.5 kB view details)

Uploaded Source

Built Distribution

uniprotparser-1.0.4-py3-none-any.whl (6.1 kB view details)

Uploaded Python 3

File details

Details for the file uniprotparser-1.0.4.tar.gz.

File metadata

  • Download URL: uniprotparser-1.0.4.tar.gz
  • Upload date:
  • Size: 5.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.12 CPython/3.9.6 Windows/10

File hashes

Hashes for uniprotparser-1.0.4.tar.gz
Algorithm Hash digest
SHA256 caa764dc6f0947b4d653a6f21a7c6ac8746f38dda7c236be5a1edc9770818f01
MD5 0353b3003e4dd9a5e0159f883ad7252d
BLAKE2b-256 2bf2b4361bdce745e1cb3d2b26d1a0161193052f2d0be6c7c6c4ac3d43d53a35

See more details on using hashes here.

File details

Details for the file uniprotparser-1.0.4-py3-none-any.whl.

File metadata

  • Download URL: uniprotparser-1.0.4-py3-none-any.whl
  • Upload date:
  • Size: 6.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.12 CPython/3.9.6 Windows/10

File hashes

Hashes for uniprotparser-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 53217a6299a0849e75e5a1eab880f4ab0fb68f8341e7a7db4968dc7b0d74e240
MD5 da0136c8c8cd83cd851935dae4d821ad
BLAKE2b-256 7be9a55b1255878ebaeaaec12625ad9cd210269ec5f17268396dec444844d86d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page