Skip to main content

Prediction of allergenic and non-allergenic peptides

Project description

AlgPred3

A computational framework for predicting and designing allergenic and non-allergenic peptides using machine learning and composition-based features.


📌 Introduction

AlgPred3 is developed to identify allergenic potential of peptides based on their primary sequence. It integrates feature-based machine learning approaches with curated allergen datasets to provide robust predictions. 🔗 Visit the web server for more information: Algpred3

The tool supports:

  • Prediction of allergenic peptides
  • Protein scanning for allergenic regions
  • Design of peptide mutants

It uses compositional and physicochemical descriptors such as:

  • AAC (Amino Acid Composition)
  • DPC (Dipeptide Composition)
  • PAAC, APAAC
  • CeTD, DDR, BTC

📚 Reference

AlgPred3 Raghava Group, IIIT-Delhi


🧪 Quick Start for Reproducibility

# 1. Clone the repository
git clone https://github.com/raghavagps/algpred3.git
cd algpred3
# 2. Set up the environment (conda recommended)
conda env create -f environment.yml
conda activate algpred3

# 3. Run help
python algpred3.py -h

# 4. Run prediction
python algpred3.py -i example.fasta -o output.csv -j pred

🛠️ Installation Options

🧰 Pip Installation

pip install algpred3

Check options:

algpred3 -h

🔹 Standalone Installation

AlgPred3 is written in Python 3 and requires:

✅ Required Libraries

python=3.8+

📦 Install Dependencies

pip install scikit-learn
pip install pandas
pip install joblib

🔹 Automatic Dependency Setup

AlgPred3 automatically downloads required files during runtime:

  • Model file (algpred3_model.sav)
  • Feature column file (columns.csv)
  • pfeature binary

This behavior is implemented directly in the script.


⚠️ Important Notes

  • First run requires internet connection for auto-download.
  • Input sequences must contain only valid amino acids: ACDEFGHIKLMNPQRSTVWY
  • Invalid sequences are removed and logged in:
stand_error.log

🔬 Classification

AlgPred3 classifies peptides into:

  • Allergen
  • Non-Allergen

based on machine learning predictions using extracted sequence features.


🚀 Usage

🔹 Minimum Usage

python algpred3.py -h

🔹 Full Usage

usage: algpred3.py [-h]
                  -i INPUT
                  [-o OUTPUT]
                  -j {pred,scan,des}
                  [-t THRESHOLD]
                  [-l LENGTH]
                  [-s STEP]

📌 Arguments

Argument Description
-i INPUT Input FASTA or sequence file
-o OUTPUT Output CSV file (default: final_predictions.csv)
-j Job type: pred, scan, des
-t Threshold (default: 0.5)
-l Window length (scan mode only)
-s Step size (default: 1)

📂 Input & Output Files

✅ Input Formats

  1. FASTA format
  2. Plain text format (one sequence per line)

✅ Output File

  • Results are saved in CSV format

  • Includes:

    • Sequence ID
    • Prediction score
    • Allergen / Non-Allergen status

🔍 Jobs & Features

🔹 Job Types

Job Description
🔹 pred Predict allergenic potential of sequences
🔹 scan Identify allergenic regions in proteins
🔹 des Generate and evaluate all possible mutants

🔹 Scan Mode

  • Uses sliding window approach
  • Generates peptide fragments
  • Predicts allergenic regions

🔹 Design Mode

  • Generates all single amino acid mutants

  • Predicts allergenicity of each mutant

  • Useful for:

    • Peptide optimization
    • Allergenicity reduction

⚙️ Workflow

  1. Input sequence validation
  2. Cleaning invalid sequences
  3. Feature extraction (pfeature)
  4. Feature selection
  5. Model prediction
  6. Output generation

📑 Package Contents

File Description
algpred3.py Main prediction script
columns.csv Feature selection file
algpred3_model.sav Trained ML model
pfeature_comp Feature extraction binary
example.fasta Sample input

📦 PIP Installation (Again for Reference)

pip install algpred3

🚀 Start predicting allergenic peptides with AlgPred3 today! 🔗 Visit: Algpred3

Project details


Release history Release notifications | RSS feed

This version

1.0

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

algpred3-1.0.tar.gz (6.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

algpred3-1.0-py3-none-any.whl (7.0 kB view details)

Uploaded Python 3

File details

Details for the file algpred3-1.0.tar.gz.

File metadata

  • Download URL: algpred3-1.0.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for algpred3-1.0.tar.gz
Algorithm Hash digest
SHA256 54c4cec0327bce1de98408ad3b359e832577e3abdc27fa4c62a8d4d834861a30
MD5 7d8e99ba727a5f6e704a8b63ed51eb54
BLAKE2b-256 03da3c85b47164173933d9dee7f99a88f335b8b58c9968f5e9e83c63d15794b3

See more details on using hashes here.

File details

Details for the file algpred3-1.0-py3-none-any.whl.

File metadata

  • Download URL: algpred3-1.0-py3-none-any.whl
  • Upload date:
  • Size: 7.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for algpred3-1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d88d6bb23de1d2d3fc10fd878103711f9fe0b3d4c3d345754aba411976e9b8d2
MD5 915d2eabcd5931c5aef5352aea1f8c83
BLAKE2b-256 35078a1f103430b259a1fccdcedc909dcd97d09a7d850d82608c6f4a7962e6f5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page