Skip to main content

Complete pipelines and helper tools for data science and machine learning projects.

Project description

dragon-ml-toolbox

A collection of machine learning pipelines and utilities, structured as modular packages for easy reuse and installation. This package has no base dependencies, allowing for lightweight and customized virtual environments.

Features:

  • Modular scripts for data science workflows, including data exploration, ETL, model training, evaluation, and inference.
  • Support for PyTorch-based models, ensemble learning (XGBoost, LightGBM), and MICE imputation.

Installation

Python 3.12

Via PyPI

Install the latest stable release from PyPI:

Using pip:

pip install dragon-ml-toolbox

Using UV:

uv add dragon-ml-toolbox

Via conda-forge (Deprecated)

Install from the conda-forge channel:

conda install -c conda-forge dragon-ml-toolbox

Modular Installation

This toolbox is designed as a collection of mutually exclusive environments due to conflicting core dependencies, except APP bundlers (PyInstaller/Nuitka).

  • Rule: Create a fresh virtual environment for each module to use.

📦 Core Machine Learning Toolbox [ML]

Installs a comprehensive set of tools for typical data science workflows, including data manipulation, modeling, and evaluation using PyTorch.

➡️ Prerequisite: PyTorch required. Follow the official instructions: PyTorch website

pip install "dragon-ml-toolbox[ML]"

Modules:

constants
custom_logger
data_exploration
ETL_cleaning
ETL_engineering
math_utilities
ML_callbacks
ML_chaining_inference
ML_chaining_utilities
ML_configuration
ML_datasetmaster
ML_evaluation_captum
ML_evaluation_multi
ML_evaluation
ML_inference
ML_models
ML_models_advanced
ML_optimization_pareto
ML_optimization
ML_scaler
ML_sequence_datasetmaster
ML_sequence_evaluation
ML_sequence_inference
ML_sequence_models
ML_trainer
ML_utilities
ML_vision_datasetmaster
ML_vision_evaluation
ML_vision_inference
ML_vision_models
ML_vision_transformers
optimization_tools
path_manager
schema
serde
SQL
utilities

Wrappers for some pytorch_tabular models are available:

pip install "dragon-ml-toolbox[ML,py-tab]"

# Extra Modules:
ML_models_pytab
ML_configuration_pytab

🌳 Ensemble Learning [ensemble]

Comprehensive set of tools for typical data science workflows focused on XGBoost and LightGBM.

pip install "dragon-ml-toolbox[ensemble]"

Modules:

constants
custom_logger
data_exploration
ensemble_evaluation
ensemble_inference
ensemble_learning
ETL_cleaning
ETL_engineering
math_utilities
optimization_tools
path_manager
PSO_optimization
schema
serde
SQL
utilities

🔬 MICE Imputation and Variance Inflation Factor [mice]

Utilities for advanced data cleaning and statistical checks. Features Multiple Imputation by Chained Equations (MICE) for handling missing data and Variance Inflation Factor (VIF) analysis to detect multicollinearity in features.

pip install "dragon-ml-toolbox[mice]"

Modules:

constants
custom_logger
math_utilities
MICE_imputation
serde
VIF_factor
path_manager
utilities

📋 Excel File Handling [excel]

Installs dependencies required to process and handle .xlsx or .xls files.

pip install "dragon-ml-toolbox[excel]"

Modules:

custom_logger
excel_handler
path_manager

🎰 GUI for Boosting Algorithms (XGBoost, LightGBM) [gui-boost]

GUI tools compatible with XGBoost and LightGBM models used for inference.

pip install "dragon-ml-toolbox[gui-boost]"

Modules:

constants
custom_logger
GUI_tools
ensemble_inference
path_manager
schema
serde

🤖 GUI for PyTorch Models [gui-torch]

GUI tools compatible with PyTorch models used for inference.

pip install "dragon-ml-toolbox[gui-torch]"

Modules:

constants
custom_logger
GUI_tools
ML_models
ML_models_advanced
ML_sequence_models
ML_vision_models # Requires: torchvision and Pillow
ML_inference
ML_sequence_inference
ML_vision_inference # Requires: torchvision and Pillow
ML_vision_transformers # Requires: torchvision and Pillow
ML_scaler
path_manager
schema

⚒️ APP bundlers

Dependencies required to compile applications, inference scripts, or GUIs into standalone executables (.exe or binary) for distribution. Choose your preferred backend:

pip install "dragon-ml-toolbox[pyinstaller]"
pip install "dragon-ml-toolbox[nuitka]"

Usage

After installation, import modules like this:

from ml_tools.serde import serialize_object, deserialize_object
from ml_tools.custom_logger import train_logger

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dragon_ml_toolbox-19.5.0.tar.gz (280.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dragon_ml_toolbox-19.5.0-py3-none-any.whl (324.7 kB view details)

Uploaded Python 3

File details

Details for the file dragon_ml_toolbox-19.5.0.tar.gz.

File metadata

  • Download URL: dragon_ml_toolbox-19.5.0.tar.gz
  • Upload date:
  • Size: 280.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.17 {"installer":{"name":"uv","version":"0.9.17","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for dragon_ml_toolbox-19.5.0.tar.gz
Algorithm Hash digest
SHA256 965a50b1182d2568be5fbd600fed4828fea8beeac437395a05ad59a4d105a000
MD5 38f5a104f71ae4cd02d2d8d5763136ae
BLAKE2b-256 5222e13f7ec8c9d7079aec3b872162770ad8775418811fc28348206cc6970729

See more details on using hashes here.

File details

Details for the file dragon_ml_toolbox-19.5.0-py3-none-any.whl.

File metadata

  • Download URL: dragon_ml_toolbox-19.5.0-py3-none-any.whl
  • Upload date:
  • Size: 324.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.17 {"installer":{"name":"uv","version":"0.9.17","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for dragon_ml_toolbox-19.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ea3bbdea1fa0f74442dd7383654c949a8bdf8a314f57fd345093237052b88531
MD5 4237a367cdccc87af667a46958e2a433
BLAKE2b-256 e701fec2a0d3a4fee7f5c6207b916e6f8e96fee9e04a445576b1df7af1d60878

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page