Skip to main content

Package for the computation of distances from a residue to the catalytic active residues.

Project description

Mutation_Checker

Python package for checking the distance of a mutation from an active center of a protein.

Installation

Run the following to install:

pip3 install MutationChecker

Usage

This package consists of several modules: MAPPER, EMBL, PROSITE and STRUCTURE. Each one has it's own methods.

MAPPER MODULE

The mapper module consist of 3 functions. GeneToUniprot, GeneToPDB and GeneToFasta. These functions are used for converting identifiers between main databases.

GeneToUniprot

This function takes the name of a gene and maps it into a Uniprot Reviewed ID. By default uses the human specie.

@ input - gene (str) Name of Gene @ input - specie (str) Name of the Specie. Default: Human @ output - Uniprot ID (str) - Uniprot ID Code

Example: Generate info for EIF2B5 gene

from MutationChecker.mapper import GeneToUniprot
GeneToUniprot("EIF2B5")

GeneToPDB

This function takes the name of a gene and maps it into a list of PDB id's By default uses the human specie.

@ input - gene (str) Name of Gene @ input - specie (str) Name of the Specie. Default: Human @ output - PDB ID (list of str) - PDB ID Codes

Example: Generate PDB Code for EIF2B5 gene

from MutationChecker.mapper import GeneToPDBMapper
GeneToPDB("EIF2B5")

GeneToFasta

This function takes the name of a gene, and extract its sequence from Uniprot.

@ input - gene (str) Name of Gene @ input - specie (str) Name of the Specie. Default: Human @ output - Uniprot Fasta (str)

Example: Generate Fasta for EIF2B5 gene

from MutationChecker.mapper import GeneToFasta
GeneToFasta("EIF2B5")

STRUCTURE MODULE

This module contains methods related to computations of the PDB structure file of a protein.

DownloadPDB

This function takes a list of str (or one str) of PDB Codes and downloads the file into the working folder. It accepts a list of PDB to download the longest PDB structure..

@ input - List of Strings (or standalone str) - Code of PDBs @ output - PDB ID (str) - Path of the downloaded file.

Example: Download 1UBQ

from MutationChecker.structure import DownloadPDB
DownloadPDB("1UBQ")

PDBtoSequence

This function takes a PDB file and extracts the sequence of the structure.

@ input - PDB File path (str) @ output - Fasta Sequence (str) - Sequence of the structure.

Example: Get sequence for 1UBQ file

from MutationChecker.structure import PDBtoSequence
ExtractPDBSequence("./1UBQ")

MapUniprotToPDB

This function takes a sequence of uniprot, a sequence of PDB (of the same protein) and a uniprot residue number, and returns you the residue number on the structure.

@ input - Uniprot Fasta (str), can be obtained with the method GeneToFasta @ input - PDB Fasta (str), can be obtained with the method PDBtoSequence @ input - uniprot residue number (int) @ output - PDB residue number that match the uniprot residue number given (int)

from MutationChecker.structure import MapUniprotToPDB
MapUniprotToPDB(GeneToFasta("EIF2B5"), PDBtoSequence("3JUI"), 45)

CheckDistances

This function takes a residue number , a list of another residue numbers, and a PDB structure file, and compute the physical distance between the first residue to the residues on the list

@ input - PDB residue number (int) @ input - List of PDB residue numbers (list) @ output - List of distances between the first input residue, to the ones in the list (float)

from MutationChecker.structure import CheckDistances
CheckDistances(1, [5, 6, 7], "./pdb3jui.ent")

PROSITE MODULE

This module has methods to search and parse the prosite database.

PrositeRequest

This function takes an uniprot id, and returns a JSON with information about the domains and motifs of the protein.

@ input - Uniprot ID (str) @ output - Information about protein domains (json)

from MutationChecker.prosite import PrositeRequest
PrositeRequest("Q13144")

CheckMutationProsite

This function takes a number of the residue in the sequence according to uniprot, and a uniprot ID of the protein. It search the Prosite database to extract the motifs, and checks if the mutation falls in place.

@ input - num_residue (int) - Number of the residue to check in the sequence uniprot_id (str) - String of the uniprot identifier of the protein to check.

@ output - Bool - The "num_residue" falls into the domain found by Prosite

from MutationChecker.prosite import CheckMutationProsite
CheckMutationProsite(45, "Q13144")

RetrieveDomain

This function takes a number of the residue in the sequence according to uniprot, and a uniprot ID of the protein. It search the Prosite database to extract the motifs, and checks if the mutation falls in place.

@ input - num_residue (int) - Number of the residue to check in the sequence uniprot_id (str) - String of the uniprot identifier of the protein to check.

@ output - tupple of str - Tupple with the parameters (Name of the domain found at num_residue, Accession code of Prosite of the domain.)

from MutationChecker.prosite import RetrieveDomain
RetrieveDomain(45, "Q13144")

EMBL MODULE

This module has function related to the parse of the active site described on EMBL

ObtainActiveCenterResidues

This function takes the Uniprot ID of a protein, and returns a list of residue numbers that conforms the active site based on EMBL-EBI

@ input - gene (str) Name of Gene @ output - List of Strings - Active Site residue numbers.

If the protein has not an active site mapped on EMBL-EBI it returns None.

Example: Get Active Site residues for LTA4H

from MutationChecker.embl import ObtainActiveCenterResidues
from MutationChecker.mapper import GeneToUniprot, GeneToFasta

UniprotID = GeneToUniprot("LTA4H")
ObtainActiveCenterResidues(UniprotID)

CheckDistanceToActiveSite

This function takes a name of the Gene, and a residue number, and computes the physical distance in amstrongs between the residue number and the active site residues.

@ input - gene (str) Name of the gene @ input - residue number (int) - Number of residue to check @ output - List of tupples (Name of active site residue, number)

Example: Get distances to the Active site from ASN488 in LTA4H

from MutationChecker.embl import CheckDistanceToActiveSite
CheckDistanceToActiveSite("LTA4H")

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

MutationChecker-0.1.7.tar.gz (6.0 kB view details)

Uploaded Source

Built Distribution

MutationChecker-0.1.7-py3-none-any.whl (9.2 kB view details)

Uploaded Python 3

File details

Details for the file MutationChecker-0.1.7.tar.gz.

File metadata

  • Download URL: MutationChecker-0.1.7.tar.gz
  • Upload date:
  • Size: 6.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.23.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.7

File hashes

Hashes for MutationChecker-0.1.7.tar.gz
Algorithm Hash digest
SHA256 37b1f8fd46092d22e8e9ed7a15385ff981a19acf16956d482988114b1403c20f
MD5 7bd88787850c0b3a58a8812c7b507bca
BLAKE2b-256 339d6988253dc1ac8690691ae568e53ab7e8af1ca5cb66e848c64959af0f0f9e

See more details on using hashes here.

File details

Details for the file MutationChecker-0.1.7-py3-none-any.whl.

File metadata

  • Download URL: MutationChecker-0.1.7-py3-none-any.whl
  • Upload date:
  • Size: 9.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.23.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.7

File hashes

Hashes for MutationChecker-0.1.7-py3-none-any.whl
Algorithm Hash digest
SHA256 1e7f5221bee48c3f0cf2fb2d60b27a98077766086bbef0efcc3b6caca9901a43
MD5 7f500b1871430aaa47676209f7402f3c
BLAKE2b-256 c2041886556272b6624496574f5db65aea3f3dacaa22d321439ef665c01d50e4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page