Find Lysine Acylation & other Modification Sites
Project description
FLAMS: Find Lysine Acylation & other Modification Sites
A command-line tool to analyze the conservation of lysine modifications, by means of a position-based search against the CPLM database v.4.
Table of contents
Introduction
FLAMS is a tool that looks for conservation of modifications on lysine residues by first looking for similar proteins in the Compendium of Protein Lysine Modification Sites (CPLM v.4, Zhang, W. et al. Nucleic Acids Research. 2021, 44 (5): 243–250.), and then extracting those proteins that contain a modified lysine at the queried position. The aim of this analysis is to easily identify conserved lysine modifications, to aid in identifying functional lysine modification sites and the comparison of the results of PTM identification studies across species.
The tool takes as input a protein sequence and the position of a lysine. This repository contains a command-line tool FLAMS
to obtain an overview of the conserved lysine modifications matching your query, by using the following scripts:
- input.py: processing the user-provided input
- cplm4.py and setup.py: downloading and preparing the modification-specific databases
- run_blast.py: searching your query against the databases of proteins with lysine modifications
- display.py: formatting the list of conserved lysine modifications to a tab delimeted output file
System requirements
Linux 64-bit and Mac OS supported. Windows users are advised to run the tool through Anaconda.
General dependencies
- Python3 (>=3.10)
- shutil
- BioPython
Third-party dependencies
Installation
The recommended installation takes place in a dedicated conda environment, which can be created as follows:
conda create -n flamsEnv
In this environment, the correct Python environment can be set up, and other dependencies can be installed through pip:
conda activate flamsEnv
conda install -c conda-forge python=3.10
pip install -r requirements.txt
Please make sure that BLAST is installed locally, and available in the PATH.
Usage
Download the project:
git@github.com:hannelorelongin/flams-v1.git
cd flams
Run the tool:
python FLAMS [-h] (--in inputFilePath | --id UniProtID) -p position [-m modification [modification ...]] [--range errorRange] [-t threadsBLAST] [-o outputFilePath]
Required arguments:
- one of:
inputFilePath
is the path to a .fasta file with the protein you wish to query against (has to contain only 1 protein)UniProtID
is the UniProt ID of the protein you wish to query against
position
is the position of a lysine in the protein, which you want to query against
Optional arguments:
modification
is one or a combination (seperated by spaces) of: ubiquitination, sumoylation, pupylation, neddylation, acetylation, succinylation, crotonylation, malonylation, 2-hydroxyisobutyrylation, beta-hydroxybutyrylation, butyrylation, propionylation, glutarylation, lactylation, formylation, benzoylation, hmgylation, mgcylation, mgylation, methylation, glycation, hydroxylation, phosphoglycerylation, carboxymethylation, lipoylation, carboxylation, dietylphosphorylation, biotinylation, carboxyethylation. We also provide aggregated combinations: 'All','Ubs','Acylations' and'Others', in analogy to the CPLM database. [default: Acylations]"errorRange
is an number of positions before and afterpos
to also search for modifications. [default: 0]threadsBLAST
is a BLAST parameter, allows you to speed up the search by multithreading. [default: 1]outputFilePath
is the path to where the result will be saved (in a .tsv file format). [default: out.tsv]
Example:
python FLAMS --in P57703.fa -p 306 --range 5 -o results.tsv -m acetylation succinylation
python FLAMS --id P57703 738 -m acetylation propionylation
Output
The output file is a tsv containing one row per modification that matched the query, i.e., aligning (within the user-specified range) to the query lysine, in a protein similar to the query protein. TSV output file contains five columns:
- UniProt ID: UniProt identifier of matched protein
- Species: the textual description of the species of the matched protein
- Modification: the type of modification found in the matched protein
- Lysine location: the location of this matched modification in the matched protein
- Lysine window: the local sequence containing the conserved lysine modification (window of five amino acids before and after°)
°: window can be smaller than the [+5;-5] window if the sequence alignment ends sooner, which can happen for modified lysines near the start/end of the protein
Contact
Laboratory of Computational Systems Biology, KU Leuven.
References
Zhang, W., Tan, X., Lin, S., Gou, Y., Han, C., Zhang, C., Ning, W., Wang, C. & Xue, Y. (2021) "CPLM 4.0: an updated database with rich annotations for protein lysine modifications." Nucleic Acids Research. 44(5):243–250.
Altschul, S.F., Gish, W., Miller, W., Myers, E.W. & Lipman, D.J. (1990) "Basic local alignment search tool." J. Mol. Biol. 215:403-410.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file flams-0.0.3.tar.gz
.
File metadata
- Download URL: flams-0.0.3.tar.gz
- Upload date:
- Size: 17.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.1 importlib_metadata/6.0.0 pkginfo/1.9.6 requests/2.29.0 requests-toolbelt/0.9.1 tqdm/4.65.0 CPython/3.11.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 902d444c478338c87ebfab20a837cc9f5622a7a9724612169b60faf913496ff3 |
|
MD5 | be94b5e5298bc434c020c8fa8b410552 |
|
BLAKE2b-256 | 11d870680c5a84483fb408dd6f30fa66504090cdd4fb041110c23f9c2cc34a95 |
File details
Details for the file flams-0.0.3-py3-none-any.whl
.
File metadata
- Download URL: flams-0.0.3-py3-none-any.whl
- Upload date:
- Size: 18.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.1 importlib_metadata/6.0.0 pkginfo/1.9.6 requests/2.29.0 requests-toolbelt/0.9.1 tqdm/4.65.0 CPython/3.11.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1ff66bdde1b1c6c8abc75b69185e102658490f79a7963eabebf8742a1faac9f3 |
|
MD5 | 00718c9105eb66021d265ebbdb9a1143 |
|
BLAKE2b-256 | ad25ef202fb8f78a7db3c72af4849caba8dccdaad09d6a581ece86057510bd41 |