Skip to main content

CPred: A deep learning framework for predicting the charge state distribution in modified and unmodified peptides in ESI

Project description



PyPI License

CPred: Charge State Prediction for Modified and Unmodified Peptides in Electrospray Ionization



Introduction

CPred is a neural network capable of predicting the charge state distribution for modified and unmodified peptides in electrospray ionisation. By summarising the modifications as measures of mass and atomic compositions, the model is capable of generalising unseen modifications during training.

The model is available as a Python package, installable through Pypi and conda. This also makes it possible to use from the command-line-interface.

Installation

install with bioconda install with pip

Currently, CPred is solely available in Pypi. Bioconda will be released soon. Install with conda, using the bioconda and conda-forge channels: conda install -c bioconda -c conda-forge CPred

Or install with pip: pip install CPred

How to use

Python module

A reproducible example is shown in the tests folder.

from CPred import FeatureEngineering
from CPred import CPred_NN
import pandas as pd

test_dictionary = {
    "Peptide_sequence": ["PEPTIDE", "EDITPEP"],
    "Modifications": ["1|Carbamidomethyl", "2|Oxidation"]
}

# Turn dictionary into a Pandas dataframe for feature engineering
test_df = pd.DataFrame(test_dictionary)

# Do feature engineering
test_features = FeatureEngineering.feature_engineering(test_df)

# Saving to parquet
test_features.to_parquet(f"tests/tests_input/test.parquet", index=False)

# Neural network predictions
input_model = "tests/tests_input/test.parquet"
model_directory = "CPred/Data/Models/CPred_model_v1.keras"
output_directory = "tests/tests_output/"
CPred_NN.prediction_model(input_model, model_directory, output_directory)

The feature_engineering function returns a pandas dataframe with the generated features. As the CPred neural network requires the data in Parquet format, it is firstly saved.

Command-line interface

In order to use CPred from the command-line interface, run:

Documentation

python CPred_main.py FeatureEngineering --help
python CPred_main.py prediction --help
python CPred_main.py retraining --help

Running

In order to run CPred, all arguments may be specificied as documented.

python CPred_main.py FeatureEngineering -i ..\tests\tests_input\test.xlsx

Citation

When using CPred for your research, please cite: Vilenne, F., Agten, A., Appeltans, S., Ertaylan, G., & Valkenborg, D. (2024). CPred: Charge State Prediction for Modified and Unmodified Peptides in Electrospray Ionization. Analytical Chemistry. https://doi.org/10.1021/acs.analchem.4c01107

Q&A

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cpred-0.1.4.tar.gz (39.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

CPred-0.1.4-py3-none-any.whl (12.9 MB view details)

Uploaded Python 3

File details

Details for the file cpred-0.1.4.tar.gz.

File metadata

  • Download URL: cpred-0.1.4.tar.gz
  • Upload date:
  • Size: 39.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.13

File hashes

Hashes for cpred-0.1.4.tar.gz
Algorithm Hash digest
SHA256 e02316537031cd604513b0ea4155e365bd40942fad2e16ac91299a28969120be
MD5 77576ede28971e27cd4d2fc0254de5cd
BLAKE2b-256 84c02883b35455955e569cffa1ce342d6bc7cf84b6fa50be0a1e7250e44479b6

See more details on using hashes here.

File details

Details for the file CPred-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: CPred-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 12.9 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.13

File hashes

Hashes for CPred-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 3333bb80b5d745367fa1a55c76feed88387d09ad42bcdd9e8f55c270740f9fe1
MD5 1709e4d13cc6bebfaa422ab726abf15d
BLAKE2b-256 2b5872ffecfaa9ec06eff0bd58c6babc8eaa67ad9cea14c2fe4b54a5d793a9ce

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page