Skip to main content

Machine learning tools for running repeated nested leave-one-dataset-out validation and more.

Project description

Generalize

Author: Ludvig R. Olsen ( r-pkgs@ludvigolsen.dk )

The ultimate goal of training machine learning models is to generalize to new, unseen data. This package contains tools for measuring model performance across multiple datasets via cross-dataset-validation (aka. leave-one-dataset-out).

Under development!

  • Not generalized enough for general usage (ironic, I know)
  • Relies on an old version of scikit-learn, needs updating
  • Linear regression is not currently implemented
  • Help strings are likely not up-to-date

Main functions and classes

Function Description
nested_cross_validate() Run (repeated) nested cross-validation.
train_full_model() Train model on all data and save to disk.
evaluate_univariate_models() Evaluate prediction potential of every predictor separately.
PipelineDesigner Design a scikit-learn pipeline for use in cross-validation.
ROCCurve, ROCCurves ROC curve containers with various utility methods.
select_samples() Utility for selecting samples based on (collapsed) labels.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

generalize-0.2.0.tar.gz (121.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

generalize-0.2.0-py3-none-any.whl (149.8 kB view details)

Uploaded Python 3

File details

Details for the file generalize-0.2.0.tar.gz.

File metadata

  • Download URL: generalize-0.2.0.tar.gz
  • Upload date:
  • Size: 121.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.0 CPython/3.9.6 Darwin/23.6.0

File hashes

Hashes for generalize-0.2.0.tar.gz
Algorithm Hash digest
SHA256 4b4ab7b7a9c5d380f63a19c6129136d811499bf4a01e50b63288f0323cdd6f42
MD5 d4e6ce848c96ae3088e3764e29c84e5e
BLAKE2b-256 de94909f120043c9d2795835b37bf490844d9c652bf4334457c89a2941ec1c1e

See more details on using hashes here.

File details

Details for the file generalize-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: generalize-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 149.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.0 CPython/3.9.6 Darwin/23.6.0

File hashes

Hashes for generalize-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 110f58994cffcb17838131feba469d8ee26ed6dec8ea1d1935f0d2463d5378a2
MD5 20d582994d845702a1fff129931016cd
BLAKE2b-256 c31ce4f9eeac0088e7607ad07c6aba4aa2b35f4dc24dc92f56cfa92872580bb8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page