Your package description
Project description
Machine Learning-Guided Mapping Sleep-Promoting Volatiles in Aromatic Plants
Project Description
Project Description:
This repository presents an advanced machine learning pipeline for identifying sleep-promoting volatile organic compounds (VOCs) from aromatic plants.
Installation
Dependency
The code has been tested in the following environment:
| Package | Version |
|---|---|
| Python | 3.8.16 |
| conda | 23.5.0 |
| RDKit | 2023.3.1 |
How to Use
Installation
# Use the yaml environment file on the GitHub homepage to directly copy the current environment
conda env create -f environment.yaml -n sleep_model
conda activate sleep_model
conda install jupyter notebook
conda install ipykernel
python -m ipykernel install --user --name sleep_model --display-name "sleep_model"
jupyter notebook
File Structure
├── data/ # Input data files
├── data_analysis/ # Data processing and analysis
├── models/ # Pretrained base model files for Stacking model training
│ ├── RF/
│ │ ├── rf_MACCSkeys_random_0.ipynb
│ │ ├── rf_RDkit_random_0.ipynb
│ ├── SVM/
│ │ ├── svm_MACCSkeys_random_3.ipynb
│ ├── XGB/
│ │ ├── xgb_MACCSkeys_random_0.ipynb
│ │── stacking_predict.ipynb
├── predict_smiles.py
└── README.md
These four models are the base models that we use to train the final stacking model.
Predicting
python predict_smiles.py --smiles "CC(=O)OC1=CC=CC=C1C(=O)O"
Batch prediction from CSV
Predict for a CSV file containing a SMILES column (default column name: SMILES):
python predict_smiles.py --csv path/to/input.csv --out path/to/output.csv
Customize the SMILES column name and file encoding if needed:
python predict_smiles.py \
--csv path/to/input.csv \
--out path/to/output.csv \
--smiles-column SMILES \
--input-encoding utf-8
The output CSV includes: smiles, prediction, probability, and base-model probabilities (rf_maccs_p0/p1, rf_rdkit_p0/p1, svm_maccs_p0/p1, xgb_maccs_p0/p1).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sleep_model-0.1.0.tar.gz.
File metadata
- Download URL: sleep_model-0.1.0.tar.gz
- Upload date:
- Size: 102.0 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.8.16
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
37068d1f643b11f4be0b6f61183b4e586fbc0dd60d3544f6543cbcd941586c23
|
|
| MD5 |
610fc637be14d29b3cb31f39d37019c5
|
|
| BLAKE2b-256 |
4f49d8a7cc60f0667d6ddd907169441901d4ff862e4c666c45a4ec74c127ec39
|
File details
Details for the file sleep_model-0.1.0-py3-none-any.whl.
File metadata
- Download URL: sleep_model-0.1.0-py3-none-any.whl
- Upload date:
- Size: 102.3 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.8.16
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e7f4aff7f6d5299f698fe39ee82465f5a141a621f278878fc29d1162d42b8ce4
|
|
| MD5 |
734a0dd3b481408dc30c9d218de179b0
|
|
| BLAKE2b-256 |
bb0a7da3e6466014c55c8bd32d84c35ef2b12b20afead680370f4230bcd07075
|