Skip to main content

JoinBoost: In-Database Tree-Models over Many Tables

Project description

JoinBoost: In-Database Factorized Tree-Models

License

JoinBoost the first In-DB factorized learning system for tree-based models.

The technical report is in the /technical.

Reproducibility

We note that some feataures discussed in the paper (e.g., inter-query parallelism, DP) are not implemented in the main codes for reliability concerns. To reproduce the experiment results from the paper, we include the prototype codes for JoinBoost under /proto folder, which includes the JoinBoost codes and Jupyer Notebook to train models over Favorita. The Favorita dataset is too large to store in Github. They can be found in dropbox.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

joinboost-0.0.11.tar.gz (16.3 kB view hashes)

Uploaded Source

Built Distribution

joinboost-0.0.11-py3-none-any.whl (18.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page