Skip to main content

A centralized pattern for creating Machine Learning pipelines

Project description

GitHub Pages
PyPI

ml_pipeline

A centralized pattern for creating Machine Learning pipelines


Installation

pip install kabbes_ml_pipeline


Description

Contains a host of tools for standardizing the format of machine learning pipelines. Provides methods for querying databases, cleaning data, preprocessing data, model operations, exporting results, and more. Each machine learning project is a "child" of this template, with the ability to overwrite any of the default class attributes/methods.

Code Overview

Read this high-level overview is necessary to understand how the package operates.

Code Hierarchy
Models → Model → Input_Files → Input_File → Features → Feature

Each point in the hierarchy has certain methods and attributes associated with it. These methods give you the functionality for operating the pipeline.

Usage

For more in-depth documentation, read the information provided on the Pages. Or better yet, read the source code.

Initializing a Repo for ml_pipeline

  1. Navigate to a directory in command prompt
cd C:/Path/to/Repo
  1. Call the package's main script
python -m ml_pipeline

Navigating the Menu

  1. Run python main_XYZ.py This opens the Models options screen, along with the options for the "Models" class instance.
  2. To navigate one level down to a Model, select the option "Open Model"
  3. Select a Model from the list.
  4. Now in the Model options screen, press enter to navigate back up to Models.
  5. You can navigate from Models->Model->Input_Files->Input_File->Features->Feature and all the way back up.
  6. Option 1 shows "Open XXXXX" to navigate to the next level down in the tree.
  7. Press enter to exit back up to the previous level.

Query new Raw Data

  1. python main_XYZ.py
  2. At the Models options screen, select "Open Model".
  3. Select any Model from the list (this selection does not matter)
  4. In the Model options screen, select "Open Input Files".
  5. Select the first option.
  6. In the Input Files options screen, select "Open Input File"
  7. Select the Input File for which you would like to query new data.
  8. In the Input File options screen, select "Query from Source Database"

Move Query Staged data to Raw Data

  1. Follow steps (1-7) from "Query new Raw Data" to get to the Input File of interest.
  2. In the Input File options screen, select "Move from Query Staging"

Clean a Raw Dataset

  1. Follow steps (1-7) from "Query new Raw Data" to get to the Input File of interest.
  2. In the Input File options screen, select "Clean Raw Dataset"

Running one Model

  1. python main_XYZ.py
  2. At the Models options screen, select the option for "Open Model"
  3. Select the Model you would like to run
  4. In the Model options screen, select the option for "Run Pipeline"

Running all Models

  1. python main_XYZ.py
  2. At the Models options screen, select the option for "Run Models"

Author(s)

James Kabbes

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kabbes_ml_pipeline-0.6.0.tar.gz (30.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

kabbes_ml_pipeline-0.6.0-py3-none-any.whl (39.1 kB view details)

Uploaded Python 3

File details

Details for the file kabbes_ml_pipeline-0.6.0.tar.gz.

File metadata

  • Download URL: kabbes_ml_pipeline-0.6.0.tar.gz
  • Upload date:
  • Size: 30.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.7 tqdm/4.63.1 importlib-metadata/4.8.2 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.7

File hashes

Hashes for kabbes_ml_pipeline-0.6.0.tar.gz
Algorithm Hash digest
SHA256 2085cc70f723a7323729e3e2c872e633467c6837becf3b9e4362da4e76a487c2
MD5 97d3e677fbcda4c228546176d0212ddb
BLAKE2b-256 d0ab47dbaac0e95f916b4e6abef6ea07e6c52fee4d663ec403769ad89b65be35

See more details on using hashes here.

File details

Details for the file kabbes_ml_pipeline-0.6.0-py3-none-any.whl.

File metadata

  • Download URL: kabbes_ml_pipeline-0.6.0-py3-none-any.whl
  • Upload date:
  • Size: 39.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.7 tqdm/4.63.1 importlib-metadata/4.8.2 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.7

File hashes

Hashes for kabbes_ml_pipeline-0.6.0-py3-none-any.whl
Algorithm Hash digest
SHA256 df31d422f52566f2ae631fb950d64469f171d092c2a123364f5b69584ccbcaee
MD5 31be2426cf3c774cc4d0a2a2a17c599d
BLAKE2b-256 6b3d4118ba0f92c464a8b80997ffd862ae28c79c4beb3b4674bbaef087d82001

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page