Skip to main content

Your package description

Project description

Machine Learning-Guided Mapping Sleep-Promoting Volatiles in Aromatic Plants

Project Description

Project Description: This repository presents an advanced machine learning pipeline for identifying sleep-promoting volatile organic compounds (VOCs) from aromatic plants. image

Installation

Dependency

The code has been tested in the following environment:

Package Version
Python 3.8.16
conda 23.5.0
RDKit 2023.3.1

How to Use

Installation

# Use the yaml environment file on the GitHub homepage to directly copy the current environment
conda env create -f environment.yaml -n sleep_model
conda activate sleep_model
conda install jupyter notebook
conda install ipykernel
python -m ipykernel install --user --name sleep_model --display-name   "sleep_model"
jupyter notebook

File Structure

├── data/                   # Input data files
├── data_analysis/          # Data processing and analysis
├── models/                 # Pretrained base model files for Stacking model training
│   ├── RF/
│   │   ├── rf_MACCSkeys_random_0.ipynb
│   │   ├── rf_RDkit_random_0.ipynb
│   ├── SVM/
│   │   ├── svm_MACCSkeys_random_3.ipynb
│   ├── XGB/
│   │   ├── xgb_MACCSkeys_random_0.ipynb
│   │── stacking_predict.ipynb
├── predict_smiles.py 
└── README.md

These four models are the base models that we use to train the final stacking model.

Predicting

python predict_smiles.py --smiles "CC(=O)OC1=CC=CC=C1C(=O)O"

Batch prediction from CSV

Predict for a CSV file containing a SMILES column (default column name: SMILES):

python predict_smiles.py --csv path/to/input.csv --out path/to/output.csv

Customize the SMILES column name and file encoding if needed:

python predict_smiles.py \
  --csv path/to/input.csv \
  --out path/to/output.csv \
  --smiles-column SMILES \
  --input-encoding utf-8

The output CSV includes: smiles, prediction, probability, and base-model probabilities (rf_maccs_p0/p1, rf_rdkit_p0/p1, svm_maccs_p0/p1, xgb_maccs_p0/p1).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sleep_model-0.1.0.tar.gz (102.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sleep_model-0.1.0-py3-none-any.whl (102.3 MB view details)

Uploaded Python 3

File details

Details for the file sleep_model-0.1.0.tar.gz.

File metadata

  • Download URL: sleep_model-0.1.0.tar.gz
  • Upload date:
  • Size: 102.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.8.16

File hashes

Hashes for sleep_model-0.1.0.tar.gz
Algorithm Hash digest
SHA256 37068d1f643b11f4be0b6f61183b4e586fbc0dd60d3544f6543cbcd941586c23
MD5 610fc637be14d29b3cb31f39d37019c5
BLAKE2b-256 4f49d8a7cc60f0667d6ddd907169441901d4ff862e4c666c45a4ec74c127ec39

See more details on using hashes here.

File details

Details for the file sleep_model-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: sleep_model-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 102.3 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.8.16

File hashes

Hashes for sleep_model-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e7f4aff7f6d5299f698fe39ee82465f5a141a621f278878fc29d1162d42b8ce4
MD5 734a0dd3b481408dc30c9d218de179b0
BLAKE2b-256 bb0a7da3e6466014c55c8bd32d84c35ef2b12b20afead680370f4230bcd07075

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page