Pipeline for comprehensive analysis of tiling mutagenesis data
Project description
ProTiler-Mut
Introduction of ProTiler-Mut
Hi, this is ProTiler-Mut, a computational pipeline designed for comprehensive analysis of tiling mutagenesis screen data
I have three major functions: 1). Clustering and Categorization of functional mutations from tiling mutagenesis screens
2). "3D-RRA" module for robust identification of functional substructures from identified mutation clusters
3). PPI-mapping for specific mutation or substructure to identify mutaiton-associated PPIs
Hope you enjoy playing with me ^o^!
Any questions or bugs, please contact hwkobe.1027@gmail.com or whe3@mdanderson.org
Installation
If Anaconda (or miniconda) is not installed with Python 3, it is highly recommended to download and install Python3 Anaconda from here: https://www.anaconda.com/download/
Dependencies
Python Packages: Following are the specific versions used when developing the tool, other versions should also be OK, if it is not,please install the corresponding version instead
biopython==1.79, matplotlib==3.5.3, mygene==3.2.2, numpy==1.21.6,
pandas==1.3.5, Requests==2.32.3, rich==14.0.0, scikit_learn==0.20.0, scipy==1.7.3,
seaborn==0.13.2, setuptools==68.0.0, statsmodels==0.13.5, umap_learn==0.5.3
Pymol is required for ProTiler-Mut, install it using following command:
conda install -c conda-forge pymol-open-source
On macOS, you need this command in addition:
pip install PyQt5
There are three ways to install ProTiler-Mut
Install ProTiler-Mut through pip
pip install ProTiler-Mut
OR you can install ProTiler-Mut through git clone
git clone https://github.com/MDhewei/ProTiler-Mut.git
cd ProTiler-Mut
pip install -r requirements.txt .
OR you can install ProTiler-Mut through Docker
With Docker no installation is required, the only dependence is Docker itself. Users will completely get rid of all the installation and configuration issues. Docker will do all the dirty work for you!
Docker can be downloaded freely from here: https://store.docker.com/search?offering=community&type=edition
To get an image of ProTiler-Mut, simply execute the following command:
$ docker pull MDhewei/ProTiler-Mut
How to use ProTiler-Mut
1. ProTiler-Mut cluster: Perform the clustering and categorization of functional mutations
usage: protiler-mut.py cluster [-h] -i INPUTFILE -g GENE_ID -s SAMPLES -c CONTROL [-p PDB] [-n N_CLUSTERS] [-m METHOD]
[-d METRIC] [--pdf-report PDF_REPORT] [-o OUTPUT_FOLDER]
optional arguments:
-h, --help show this help message and exit
Required arguments for clustering.:
-i INPUTFILE, --inputfile INPUTFILE
The inputfile contains information of tiling mutagenesis screens including symbol of target
gene(s),targeted residue position, mutation types and phenotypic scores. Accept .txt, .cvs or
.xlsx fileformats.
-g GENE_ID, --gene_id GENE_ID
The symbol of targeted protein-coding gene, for example: ERCC2
-s SAMPLES, --samples SAMPLES
Comma-separated sample column names.eg., "CISP,OLAP,DOX,CPT"
-c CONTROL, --control CONTROL
Comma-separated control column names.eg., T0
Optional arguments for clustering.:
-p PDB, --pdb PDB File path to the PDB of targeted protein structure.
-n N_CLUSTERS, --n-clusters N_CLUSTERS
Number of clusters for clustering analysis.
-m METHOD, --method METHOD
Clustering linkage method (default: average).
-d METRIC, --metric METRIC
Clustering metric (default: euclidean).
--pdf-report PDF_REPORT
Generate pdf report of clustering, visualization and annotation.
-o OUTPUT_FOLDER, --output-folder OUTPUT_FOLDER
Output folder for saving the results.
2. ProTiler-Mut 3d-rra: Perform "3D-RRA" to call significant substructures in specific mutation clusters
usage: protiler-mut.py 3d-rra [-h] -g GENE_ID -i INPUTFILE -p PDB -n N [-r NUM_PERMUTATIONS] [-t1 DISTANCE_THRESHOLD1]
[-t2 DISTANCE_THRESHOLD2] [-o OUTPUT_FOLDER]
optional arguments:
-h, --help show this help message and exit
Required arguments for 3D-RRA.:
-g GENE_ID, --gene_id GENE_ID
The symbol of targeted protein-coding gene, for example: ERCC2
-i INPUTFILE, --inputfile INPUTFILE
Path output tables file generated in cluster module which annotat the significant mutations, their
cluster assignment and residue position
-p PDB, --pdb PDB File path to the PDB of targeted protein structure
-n N, --n N Number of mutation samples for RRA analysis
Optional arguments for 3D-RRA.:
-r NUM_PERMUTATIONS, --num-permutations NUM_PERMUTATIONS
Number of permutations (default: 10000).
-t1 DISTANCE_THRESHOLD1, --distance-threshold1 DISTANCE_THRESHOLD1
Distance threshold to identify clusters of seed mutations on 3D structure(default: 10.0 Å).
-t2 DISTANCE_THRESHOLD2, --distance-threshold2 DISTANCE_THRESHOLD2
Distance threshold to identify surrounding signals near identified seed mutations(default: 5.0 Å).
-o OUTPUT_FOLDER, --output-folder OUTPUT_FOLDER
Output folder for results.
3. ProTiler-Mut ppi-mapping: Perform PPI-mapping for specific mutation or substructure to identify mutaiton-associated PPIs
usage: protiler-mut.py ppi-mapping [-h] -g GENE_ID -i INPUTFILE -f PDB_FILES -b CHAINS [-t DISTANCE_THRESHOLD]
[-o OUTPUT_FOLDER]
optional arguments:
-h, --help show this help message and exit
Required arguments for PPI-mapping.:
-g GENE_ID, --gene_id GENE_ID
The symbol of targeted protein-coding gene, for example: ERCC2
-i INPUTFILE, --inputfile INPUTFILE
Path output tables file generated in cluster module which annotat the significant mutations, their
cluster assignment and residue position, See example file
-f PDB_FILES, --pdb-files PDB_FILES
Comma-separated list of paths of protein complex PDB files involving the target protein.
-b CHAINS, --chains CHAINS
Comma-separated list of corresponding chain IDs of the target protein(e.g., A,B,A).
Optional arguments for PPI mapping.:
-t DISTANCE_THRESHOLD, --distance-threshold DISTANCE_THRESHOLD
Distance threshold to determine whether two residues interact between among different
chains(default: 5.0 Å).
-o OUTPUT_FOLDER, --output-folder OUTPUT_FOLDER
Output folder for results.
For a detailed tutorial to run ProTiler-Mut, please refer to the video at Youtube https://www.youtube.com/@bioinforbricker
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file protiler-mut-0.1.0.tar.gz.
File metadata
- Download URL: protiler-mut-0.1.0.tar.gz
- Upload date:
- Size: 36.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/6.7.0 pkginfo/1.5.0.1 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.7.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2c73b3834690ae6040adaaca0c7ed1d202c92e5a9f454c792686c67e0723a870
|
|
| MD5 |
642128550c854a8a0153cb71d3e36eea
|
|
| BLAKE2b-256 |
764fe21af2deb74d85c07271f34ce7245ed8effbf85f90517f257e536701ed5c
|
File details
Details for the file protiler_mut-0.1.0-py3-none-any.whl.
File metadata
- Download URL: protiler_mut-0.1.0-py3-none-any.whl
- Upload date:
- Size: 37.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/6.7.0 pkginfo/1.5.0.1 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.7.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
16a0c54e7a700105b6355cc6e1cafd83f5343832ae245f71cfbf07b6fd4b77e2
|
|
| MD5 |
a106b179c15f5e86c81119fb0e837ea2
|
|
| BLAKE2b-256 |
0142686b540336195ba8e810c1f487b05df0cdc14b69fbd329df698a02e4cd4d
|