Skip to main content

An Auto-ML framework optimized for small datasets

Project description

Octopus

Octopus is a lightweight AutoML framework specifically designed for small datasets (<1k samples) and with high dimensionality (number of features). The goal of Octopus is to speed up machine learning projects and to increase the reliability of results in the context of small datasets.

What distinguishes Octopus from others

  • Nested cross-validation (CV)
  • Performance on small datasets
  • No information leakage
  • No data split mistakes
  • Constrained regularization
  • Ensembling, optimized for (nested) CV
  • Simplicity
  • Time to event
  • Testing system (branching workflows)
  • Reporting based on nested CV
  • Test predictions over all samples

Hardware

For maximum speed it is recommended to run Octopus on a compute node with $n\times m$ CPUS for a $n \times m$ nested cross validation. Octopus development is done, for example, on a c5.9xlarge EC2 instance.

Installation

Package Installation works via pip or any other standard Python package manager:

# Install with recommended dependencies (includes optional packages such as AutoGluon)
pip install "octopus-automl[recommended]"

# Explicitly specify optional dependencies
pip install "octopus-automl[autogluon]"     # AutoGluon
pip install "octopus-automl[boruta]"        # Boruta feature selection
pip install "octopus-automl[survival]"      # Support time-to-event / survival analysis
pip install "octopus-automl[examples]"      # Dependencies for running examples

# Install with more than one extras, e.g.
pip install "octopus-automl[autogluon,examples]"

For contributors / octopus developers, a specific dependency group exists. It contains code sanitization and quality tools.

pip install "octopus-automl[dev]"

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

octopus_automl-0.0.7.tar.gz (1.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

octopus_automl-0.0.7-py3-none-any.whl (229.3 kB view details)

Uploaded Python 3

File details

Details for the file octopus_automl-0.0.7.tar.gz.

File metadata

  • Download URL: octopus_automl-0.0.7.tar.gz
  • Upload date:
  • Size: 1.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for octopus_automl-0.0.7.tar.gz
Algorithm Hash digest
SHA256 dd5d964fc43d6d5072ac43f1cc3cbf85de9280e1f2c1956f0b6c7dc10482179d
MD5 39aeab87463cd8d97ab6e848d4d479d1
BLAKE2b-256 93ce13365fc56e6c2276f4bf7d68c6ea5dc910ad739de7f64624206e298ddca0

See more details on using hashes here.

Provenance

The following attestation bundles were made for octopus_automl-0.0.7.tar.gz:

Publisher: release.yml on emdgroup/octopus-automl

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file octopus_automl-0.0.7-py3-none-any.whl.

File metadata

  • Download URL: octopus_automl-0.0.7-py3-none-any.whl
  • Upload date:
  • Size: 229.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for octopus_automl-0.0.7-py3-none-any.whl
Algorithm Hash digest
SHA256 107c20fbeb8e818174aa9cd9aa7304d2b2a8886e12c8e4f7668f9e6be6b5cd72
MD5 9ceb155eb81d373eace07fb8f2eef469
BLAKE2b-256 6d08499d46e6ef27d0210db819bf6a98d3c2b4c9e87202caa5c009ed1302adc5

See more details on using hashes here.

Provenance

The following attestation bundles were made for octopus_automl-0.0.7-py3-none-any.whl:

Publisher: release.yml on emdgroup/octopus-automl

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page