Skip to main content

Pipeline for comprehensive analysis of tiling mutagenesis data

Project description

License: GUN

ProTiler-Mut

Introduction of ProTiler-Mut

Hi, this is ProTiler-Mut, a computational pipeline designed for comprehensive analysis of tiling mutagenesis screen data

I have three major functions: 1). Clustering and Categorization of functional mutations from tiling mutagenesis screens
                              2). "3D-RRA" module for robust identification of functional substructures from identified mutation clusters
                              3). PPI-mapping for specific mutation or substructure to identify mutaiton-associated PPIs


Hope you enjoy playing with me ^o^!
                                 
Any questions or bugs, please contact hwkobe.1027@gmail.com or whe3@mdanderson.org

Installation

If Anaconda (or miniconda) is not installed with Python 3, it is highly recommended to download and install Python3 Anaconda from here: https://www.anaconda.com/download/

Dependencies

Python Packages: Following are the specific versions used when developing the tool, other versions should also be OK, if it is not,please install the corresponding version instead

 biopython==1.79, matplotlib==3.5.3, mygene==3.2.2, numpy==1.21.6, 
 pandas==1.3.5, Requests==2.32.3, rich==14.0.0, scikit_learn==0.20.0, scipy==1.7.3, 
 seaborn==0.13.2, setuptools==68.0.0, statsmodels==0.13.5, umap_learn==0.5.3

Pymol is required for ProTiler-Mut, install it using following command:

conda install -c conda-forge pymol-open-source

On macOS, you need this command in addition:

pip install PyQt5

There are three ways to install ProTiler-Mut

Install ProTiler-Mut through pip

pip install ProTiler-Mut

OR you can install ProTiler-Mut through git clone

git clone https://github.com/MDhewei/ProTiler-Mut.git
cd ProTiler-Mut
pip install -r requirements.txt .

OR you can install ProTiler-Mut through Docker

With Docker no installation is required, the only dependence is Docker itself. Users will completely get rid of all the installation and configuration issues. Docker will do all the dirty work for you!

Docker can be downloaded freely from here: https://store.docker.com/search?offering=community&type=edition

To get an image of ProTiler-Mut, simply execute the following command:

$ docker pull MDhewei/ProTiler-Mut

How to use ProTiler-Mut

1. ProTiler-Mut cluster: Perform the clustering and categorization of functional mutations

   usage: protiler-mut.py cluster [-h] -i INPUTFILE -g GENE_ID -s SAMPLES -c CONTROL [-p PDB] [-n N_CLUSTERS] [-m METHOD]
                                  [-d METRIC] [--pdf-report PDF_REPORT] [-o OUTPUT_FOLDER]

   optional arguments:
   -h, --help            show this help message and exit

   Required arguments for clustering.:

   -i INPUTFILE, --inputfile INPUTFILE
                    The inputfile contains information of tiling mutagenesis screens including symbol of target
                    gene(s),targeted residue position, mutation types and phenotypic scores. Accept .txt, .cvs or
                    .xlsx fileformats. 
   -g GENE_ID, --gene_id GENE_ID
                    The symbol of targeted protein-coding gene, for example: ERCC2
   -s SAMPLES, --samples SAMPLES
                    Comma-separated sample column names.eg., "CISP,OLAP,DOX,CPT"
   -c CONTROL, --control CONTROL
                    Comma-separated control column names.eg., T0

   Optional arguments for clustering.:

   -p PDB, --pdb PDB     File path to the PDB of targeted protein structure.
   -n N_CLUSTERS, --n-clusters N_CLUSTERS
                    Number of clusters for clustering analysis.
   -m METHOD, --method METHOD
                    Clustering linkage method (default: average).
   -d METRIC, --metric METRIC
                    Clustering metric (default: euclidean).
   --pdf-report PDF_REPORT
                    Generate pdf report of clustering, visualization and annotation.
    -o OUTPUT_FOLDER, --output-folder OUTPUT_FOLDER
                    Output folder for saving the results.

2. ProTiler-Mut 3d-rra: Perform "3D-RRA" to call significant substructures in specific mutation clusters

   usage: protiler-mut.py 3d-rra [-h] -g GENE_ID -i INPUTFILE -p PDB -n N [-r NUM_PERMUTATIONS] [-t1 DISTANCE_THRESHOLD1]
                          [-t2 DISTANCE_THRESHOLD2] [-o OUTPUT_FOLDER]

   optional arguments:
   -h, --help            show this help message and exit

   Required arguments for 3D-RRA.:

   -g GENE_ID, --gene_id GENE_ID
                    The symbol of targeted protein-coding gene, for example: ERCC2
   -i INPUTFILE, --inputfile INPUTFILE
                    Path output tables file generated in cluster module which annotat the significant mutations, their
                    cluster assignment and residue position
   -p PDB, --pdb PDB     File path to the PDB of targeted protein structure
   -n N, --n N           Number of mutation samples for RRA analysis

   Optional arguments for 3D-RRA.:

   -r NUM_PERMUTATIONS, --num-permutations NUM_PERMUTATIONS
                    Number of permutations (default: 10000).
   -t1 DISTANCE_THRESHOLD1, --distance-threshold1 DISTANCE_THRESHOLD1
                    Distance threshold to identify clusters of seed mutations on 3D structure(default: 10.0 Å).
   -t2 DISTANCE_THRESHOLD2, --distance-threshold2 DISTANCE_THRESHOLD2
                    Distance threshold to identify surrounding signals near identified seed mutations(default: 5.0 Å).
   -o OUTPUT_FOLDER, --output-folder OUTPUT_FOLDER
                     Output folder for results.

3. ProTiler-Mut ppi-mapping: Perform PPI-mapping for specific mutation or substructure to identify mutaiton-associated PPIs

   usage: protiler-mut.py ppi-mapping [-h] -g GENE_ID -i INPUTFILE -f PDB_FILES -b CHAINS [-t DISTANCE_THRESHOLD]
                               [-o OUTPUT_FOLDER]

   optional arguments:
   -h, --help            show this help message and exit

   Required arguments for PPI-mapping.:

   -g GENE_ID, --gene_id GENE_ID
                    The symbol of targeted protein-coding gene, for example: ERCC2
   -i INPUTFILE, --inputfile INPUTFILE
                    Path output tables file generated in cluster module which annotat the significant mutations, their
                    cluster assignment and residue position, See example file
   -f PDB_FILES, --pdb-files PDB_FILES
                    Comma-separated list of paths of protein complex PDB files involving the target protein.
   -b CHAINS, --chains CHAINS
                    Comma-separated list of corresponding chain IDs of the target protein(e.g., A,B,A).

   Optional arguments for PPI mapping.:

   -t DISTANCE_THRESHOLD, --distance-threshold DISTANCE_THRESHOLD
                    Distance threshold to determine whether two residues interact between among different
                    chains(default: 5.0 Å).
   -o OUTPUT_FOLDER, --output-folder OUTPUT_FOLDER
                    Output folder for results.

For a detailed tutorial to run ProTiler-Mut, please refer to the video at Youtube https://www.youtube.com/@bioinforbricker

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

protiler-mut-0.1.0.tar.gz (36.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

protiler_mut-0.1.0-py3-none-any.whl (37.9 kB view details)

Uploaded Python 3

File details

Details for the file protiler-mut-0.1.0.tar.gz.

File metadata

  • Download URL: protiler-mut-0.1.0.tar.gz
  • Upload date:
  • Size: 36.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/6.7.0 pkginfo/1.5.0.1 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.7.12

File hashes

Hashes for protiler-mut-0.1.0.tar.gz
Algorithm Hash digest
SHA256 2c73b3834690ae6040adaaca0c7ed1d202c92e5a9f454c792686c67e0723a870
MD5 642128550c854a8a0153cb71d3e36eea
BLAKE2b-256 764fe21af2deb74d85c07271f34ce7245ed8effbf85f90517f257e536701ed5c

See more details on using hashes here.

File details

Details for the file protiler_mut-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: protiler_mut-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 37.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/6.7.0 pkginfo/1.5.0.1 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.7.12

File hashes

Hashes for protiler_mut-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 16a0c54e7a700105b6355cc6e1cafd83f5343832ae245f71cfbf07b6fd4b77e2
MD5 a106b179c15f5e86c81119fb0e837ea2
BLAKE2b-256 0142686b540336195ba8e810c1f487b05df0cdc14b69fbd329df698a02e4cd4d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page