Skip to main content

Find Lysine Acylation & other Modification Sites

Reason this release was yanked:

package does not work

Project description

FLAMS: Find Lysine Acylations & other Modification Sites

A bioinformatics tool to analyze the conservation of lysine modifications, by means of a position-based search against the Compendium of Protein Lysine Modifications (CPLM database) v.4. FLAMS is available as command-line tool and as a web service.

Table of contents

  1. Introduction
  2. System requirements
    1. General dependencies
    2. Third-party dependencies
  3. Installation
  4. Usage
    1. Example use case
  5. Output
  6. Contact
  7. References
  8. License

Introduction

FLAMS is a bioinformatics tool to analyze the conservation of lysine modifications, by means of a position-based search against the CPLM database v.4 (Zhang, W. et al. Nucleic Acids Research. 2021, 44 (5): 243–250.). FLAMS can be used (i) to quickly verify whether modifications in a specific protein have been reported before, (ii) to assess whether findings in one species might translate to other species, and (iii) to systematically assess the novelty and conservation of all reported lysine modification sites.

The tool takes as input a protein (identifier or sequence) and the position of a lysine. This repository contains the command-line tool FLAMS, which obtains an overview of the previously reported lysine modifications matching your query, by using the following scripts:

  • input.py: processing the user-provided input
  • cplm4.py and setup.py: downloading and preparing the modification-specific databases
  • run_blast.py: searching your query against the databases of proteins with lysine modifications
  • display.py: formatting the list of conserved lysine modifications to a tab delimited output file

FLAMS is also available as a web service at https://www.biw.kuleuven.be/m2s/cmpg/research/CSB/tools/flams/ .

System requirements

Linux 64-bit, Windows and Mac OS supported.

General dependencies

  • Python3 (>=3.10)

Third-party dependencies

Installation

The recommended installation for Mac OS and Linux is through conda:

conda install -c bioconda flams

It is also possible to install FLAMS through pip (recommended installation for Windows):

pip install flams

Please note that the pip install requires users to have BLAST+ installed locally and available in PATH. For more information on how to install BLAST+ on Windows, click here .

Usage

Run the tool:

FLAMS [-h] (--in inputFilePath | --id UniProtID) -p position [--range errorRange] [-t threadsBLAST] [-o outputFilePath] [-d dataDir] [-m modification [modification ...]]

Required arguments:

  • one of:
    • inputFilePath is the path to a .fasta file with the protein you wish to query against (has to contain only 1 protein)
    • UniProtID is the UniProt ID of the protein you wish to query against
  • position is the position of a lysine in the protein, which you want to query against

Optional arguments:

  • errorRange is an number of positions before and after pos to also search for modifications. [default: 0]
  • threadsBLAST is a BLAST parameter, allows you to speed up the search by multithreading. [default: 1]
  • outputFilePath is the path to where the result will be saved (in a .tsv file format). [default: out.tsv]
  • dataDir is the path to directory where intermediate files (the UniProt sequence files) are stored. [default: $PWD/data]"
  • modification is one or a combination (seperated by spaces) of: ubiquitination, sumoylation, pupylation, neddylation, acetylation, succinylation, crotonylation, malonylation, 2-hydroxyisobutyrylation, beta-hydroxybutyrylation, butyrylation, propionylation, glutarylation, lactylation, formylation, benzoylation, hmgylation, mgcylation, mgylation, methylation, glycation, hydroxylation, phosphoglycerylation, carboxymethylation, lipoylation, carboxylation, dietylphosphorylation, biotinylation, carboxyethylation. We also provide aggregated combinations: 'All','Ubs','Acylations' and'Others', in analogy to the CPLM database. [default: Acylations]"

Example use case

We provide two example use cases for FLAMS:

With the following command, you search whether the TatA (UniProt ID: A0A916NWA0) acetylation on K66 in Dehalococcoide mccartyi strain CBDB1, as described by Greiner-Haas (2021), had been previously detected.

FLAMS --in A0A916NWA0.fa -p 66 -m acetylation -o tatA.tsv

With the following command, you search whether the Mycobabcterium smegmatis' FadD2 (UniProt ID: A0QQ22) K537 is known to carry any modifications of the 'acylations' category, similar to what was reported by Xu (2020).

FLAMS --id A0QQ22 -p 537 -m Acylations -o FadD2.tsv

You can find the example input and output data in the folder test_data.

For more example use cases, see the Supplementary information of the paper.

Output

The output file is a .tsv containing one row per modification that matched the query, i.e., aligning (within the user-specified range) to the query lysine, in a protein similar to the query protein. The output file contains five columns:

  • UniProt ID: UniProt identifier of matched protein
  • Modification: the type of modification found in the matched protein
  • Lysine location: the location of this matched modification in the matched protein
  • Lysine window: the local sequence containing the conserved lysine modification (window of five amino acids before and after°)
  • Species: the textual description of the species of the matched protein

°: window can be smaller than the [-5;+5] window if the sequence alignment ends sooner, which can happen for modified lysines near the start/end of the protein

Contact

Laboratory of Computational Systems Biology, KU Leuven.

References

If you use FLAMS in your work, please cite us.

In addition, FLAMS relies on third-party software & database:

Zhang, W., Tan, X., Lin, S., Gou, Y., Han, C., Zhang, C., Ning, W., Wang, C. & Xue, Y. (2021) "CPLM 4.0: an updated database with rich annotations for protein lysine modifications." Nucleic Acids Research. 44(5):243–250.

Altschul, S.F., Gish, W., Miller, W., Myers, E.W. & Lipman, D.J. (1990) "Basic local alignment search tool." J. Mol. Biol. 215:403-410.

License

FLAMS is freely available under an MIT license.

Use of the third-party software, libraries or code referred to in the References section above may be governed by separate terms and conditions or license provisions. Your use of the third-party software, libraries or code is subject to any such terms and you should check that you can comply with any applicable restrictions or terms and conditions before use.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

flams-0.0.7.tar.gz (18.7 kB view details)

Uploaded Source

Built Distribution

flams-0.0.7-py3-none-any.whl (5.3 kB view details)

Uploaded Python 3

File details

Details for the file flams-0.0.7.tar.gz.

File metadata

  • Download URL: flams-0.0.7.tar.gz
  • Upload date:
  • Size: 18.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/6.0.0 pkginfo/1.9.6 requests/2.29.0 requests-toolbelt/0.9.1 tqdm/4.65.0 CPython/3.11.0

File hashes

Hashes for flams-0.0.7.tar.gz
Algorithm Hash digest
SHA256 5df4051b19d0dcdc566d148af013cf5d35d847bea6b98865be09ee5466c0c009
MD5 b1056b76ad539b83d313609a73e60bc4
BLAKE2b-256 029a5450c30d82444b70fbd5ea2a1ae196243ddd49f991611337a52d90700d09

See more details on using hashes here.

File details

Details for the file flams-0.0.7-py3-none-any.whl.

File metadata

  • Download URL: flams-0.0.7-py3-none-any.whl
  • Upload date:
  • Size: 5.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/6.0.0 pkginfo/1.9.6 requests/2.29.0 requests-toolbelt/0.9.1 tqdm/4.65.0 CPython/3.11.0

File hashes

Hashes for flams-0.0.7-py3-none-any.whl
Algorithm Hash digest
SHA256 99dfa6753f8966542dc41d3a657d1fa1f72540f2c0ac12781c64e19f3918054a
MD5 baa6d510a501510d0312dfe11fbfdea8
BLAKE2b-256 3ef97b5a9621eccf0be2c277996ee89a576eb46e47bf03d6e250376a88adba15

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page