CPred: A deep learning framework for predicting the charge state distribution in modified and unmodified peptides in ESI
Project description
CPred: Charge State Prediction for Modified and Unmodified Peptides in Electrospray Ionization
Introduction
CPred is a neural network capable of predicting the charge state distribution for modified and unmodified peptides in electrospray ionisation. By summarising the modifications as measures of mass and atomic compositions, the model is capable of generalising unseen modifications during training.
The model is available as a Python package, installable through Pypi and conda. This also makes it possible to use from the command-line-interface.
Installation
Currently, CPred is solely available in Pypi. Bioconda will be released soon.
Install with conda, using the bioconda and conda-forge channels:
conda install -c bioconda -c conda-forge CPred
Or install with pip:
pip install CPred
How to use
Python module
A reproducible example is shown in the tests folder.
from CPred import FeatureEngineering
from CPred import CPred_NN
import pandas as pd
test_dictionary = {
"Peptide_sequence": ["PEPTIDE", "EDITPEP"],
"Modifications": ["1|Carbamidomethyl", "2|Oxidation"]
}
# Turn dictionary into a Pandas dataframe for feature engineering
test_df = pd.DataFrame(test_dictionary)
# Do feature engineering
test_features = FeatureEngineering.feature_engineering(test_df)
# Saving to parquet
test_features.to_parquet(f"tests/tests_input/test.parquet", index=False)
# Neural network predictions
input_model = "tests/tests_input/test.parquet"
model_directory = "CPred/Data/Models/CPred_model_v1.keras"
output_directory = "tests/tests_output/"
CPred_NN.prediction_model(input_model, model_directory, output_directory)
The feature_engineering function returns a pandas dataframe with the generated features. As the CPred neural network requires the data in Parquet format, it is firstly saved.
Command-line interface
In order to use CPred from the command-line interface, run:
Documentation
python CPred_main.py FeatureEngineering --help
python CPred_main.py prediction --help
python CPred_main.py retraining --help
Running
In order to run CPred, all arguments may be specificied as documented.
python CPred_main.py FeatureEngineering -i ..\tests\tests_input\test.xlsx
Citation
When using CPred for your research, please cite: Vilenne, F., Agten, A., Appeltans, S., Ertaylan, G., & Valkenborg, D. (2024). CPred: Charge State Prediction for Modified and Unmodified Peptides in Electrospray Ionization. Analytical Chemistry. https://doi.org/10.1021/acs.analchem.4c01107
Q&A
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file cpred-0.1.4.tar.gz.
File metadata
- Download URL: cpred-0.1.4.tar.gz
- Upload date:
- Size: 39.3 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e02316537031cd604513b0ea4155e365bd40942fad2e16ac91299a28969120be
|
|
| MD5 |
77576ede28971e27cd4d2fc0254de5cd
|
|
| BLAKE2b-256 |
84c02883b35455955e569cffa1ce342d6bc7cf84b6fa50be0a1e7250e44479b6
|
File details
Details for the file CPred-0.1.4-py3-none-any.whl.
File metadata
- Download URL: CPred-0.1.4-py3-none-any.whl
- Upload date:
- Size: 12.9 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3333bb80b5d745367fa1a55c76feed88387d09ad42bcdd9e8f55c270740f9fe1
|
|
| MD5 |
1709e4d13cc6bebfaa422ab726abf15d
|
|
| BLAKE2b-256 |
2b5872ffecfaa9ec06eff0bd58c6babc8eaa67ad9cea14c2fe4b54a5d793a9ce
|