Skip to main content

An Auto-ML framework optimized for small datasets

Project description

Octopus

Octopus is a lightweight AutoML framework specifically designed for small datasets (<1k samples) and with high dimensionality (number of features). The goal of Octopus is to speed up machine learning projects and to increase the reliability of results in the context of small datasets.

What distinguishes Octopus from others

  • Nested cross-validation (CV)
  • Performance on small datasets
  • No information leakage
  • No data split mistakes
  • Constrained regularization
  • Ensembling, optimized for (nested) CV
  • Simplicity
  • Time to event
  • Testing system (branching workflows)
  • Reporting based on nested CV
  • Test predictions over all samples

Hardware

For maximum speed it is recommended to run Octopus on a compute node with $n\times m$ CPUS for a $n \times m$ nested cross validation. Octopus development is done, for example, on a c5.9xlarge EC2 instance.

Installation

Package Installation works via pip or any other standard Python package manager:

# Install with recommended dependencies (includes optional packages such as AutoGluon)
pip install "octopus-automl[recommended]"

# Explicitly specify optional dependencies
pip install "octopus-automl[autogluon]"     # AutoGluon
pip install "octopus-automl[boruta]"        # Boruta feature selection
pip install "octopus-automl[survival]"      # Support time-to-event / survival analysis
pip install "octopus-automl[examples]"      # Dependencies for running examples

# Install with more than one extras, e.g.
pip install "octopus-automl[autogluon,examples]"

For contributors / octopus developers, a specific dependency group exists. It contains code sanitization and quality tools.

pip install "octopus-automl[dev]"

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

octopus_automl-0.5.2.tar.gz (1.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

octopus_automl-0.5.2-py3-none-any.whl (229.3 kB view details)

Uploaded Python 3

File details

Details for the file octopus_automl-0.5.2.tar.gz.

File metadata

  • Download URL: octopus_automl-0.5.2.tar.gz
  • Upload date:
  • Size: 1.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for octopus_automl-0.5.2.tar.gz
Algorithm Hash digest
SHA256 25c3792650125a711146c9b6945a243d07eeb7804e4c81c054c7225f48cc905c
MD5 fd9079a3553bd21e7696f5d116108043
BLAKE2b-256 dae85a68980d840cccea519fbd3cc216456337d6151e1e069ea82ce02fa66c9b

See more details on using hashes here.

Provenance

The following attestation bundles were made for octopus_automl-0.5.2.tar.gz:

Publisher: release.yml on emdgroup/octopus-automl

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file octopus_automl-0.5.2-py3-none-any.whl.

File metadata

  • Download URL: octopus_automl-0.5.2-py3-none-any.whl
  • Upload date:
  • Size: 229.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for octopus_automl-0.5.2-py3-none-any.whl
Algorithm Hash digest
SHA256 93188ca97256e5d7fdcfee9230b6e456c03fc5f35d8f4dd2b762b6bfbfca7138
MD5 a3a93a62c4c81bcdd0318601408a91cf
BLAKE2b-256 8d2ac8c440c15333cf452e17fcc3aabc998c309a68a8d9c3f4bfe94da8f80c24

See more details on using hashes here.

Provenance

The following attestation bundles were made for octopus_automl-0.5.2-py3-none-any.whl:

Publisher: release.yml on emdgroup/octopus-automl

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page