Skip to main content

A package containing classes to be used in the data pipeline for the PSUSANNX project.

Project description

psusannx_pipeline

A package that contains the custom classes & sklearn preprocessing pipeline object to be used in the PSUSANNX project. Creating the package will allow the custom classes to be imported into any script (like standard sklearn preprocessors) and used. In particular once the pipeline has been fit to data, this package will allow the fitted transformer to be pickled out as a file and read into another script that has this package imported too.

This package was created to be used as a subpackage in a wider project - PSUSANNX.

Custom Transformers

  • CorrectNegPoints
  • BucketFormations
  • GetLogRatios
  • GetPercentagesAndPerGame
  • GetDifferences
  • CatboostEncodeFormations
  • DropFeatures
  • CustomScaler

Installation

pip install psusannx_pipeline

Usage

This pipeline is to be used on a specific dataframe.

# Import all the custom classes & pipeline object
from psusannx_pipeline.pipeline import *

# Fit the processing pipeline to the data
data_preprocessed = preprocessing_pipeline.fit_transform(data).dropna()

Now save the fitted preprocessing pipeline out to a pkl file so it can be used in other scripts.

# Import pickle for serialization
import pickle

# Save out the preprocessing pipeline to be used in processing the new predictions
pickle.dump(preprocessing_pipeline, open("preprocessing_pipeline.pkl", 'wb'))

Notes

  • The package is quite restricted in what it can do, but it only needs to do things that are required by the parent project so there won't be much development.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

psusannx_pipeline-0.1.2.tar.gz (5.2 kB view details)

Uploaded Source

Built Distribution

psusannx_pipeline-0.1.2-py3-none-any.whl (5.7 kB view details)

Uploaded Python 3

File details

Details for the file psusannx_pipeline-0.1.2.tar.gz.

File metadata

  • Download URL: psusannx_pipeline-0.1.2.tar.gz
  • Upload date:
  • Size: 5.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.63.0 importlib-metadata/4.11.3 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.7

File hashes

Hashes for psusannx_pipeline-0.1.2.tar.gz
Algorithm Hash digest
SHA256 ae4354384a34b0e4284231fc7195f23cbe54c0766e2f8a2dba26e2dcdf2e82d6
MD5 4a0d7e18cad72ea6f666819f35a62c2e
BLAKE2b-256 e3c1b083d55714d1576aebc86bacee229acdb8aac1625b9fc02d725814275c43

See more details on using hashes here.

File details

Details for the file psusannx_pipeline-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: psusannx_pipeline-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 5.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.63.0 importlib-metadata/4.11.3 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.7

File hashes

Hashes for psusannx_pipeline-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 d002529d848a84942a9cfad6f4ccb0de61f89eba9711b9d407b0aa83d8cbc22e
MD5 bc498be35c4a4c31af8de710e45f15d1
BLAKE2b-256 c27cd6c1b129a16146e617d294eef51077bbfec1378e0ac76d1c080e4106ec6f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page