Skip to main content

LightGBM distributed training on Dask

Project description

Dask-LightGBM - DEPRECATED

THIS REPOSITORY IS DEPRECATED

This repository is deprecated and it is no longer maintained. The code was migrated into LightGBM package - https://github.com/microsoft/LightGBM.

Build Status

Distributed training with LightGBM and Dask.distributed

This repository enables you to perform distributed training with LightGBM on Dask.Array and Dask.DataFrame collections. It is based on dask-xgboost package.

Usage

Load your data into distributed data-structure, which can be either Dask.Array or Dask.DataFrame. Connect to a Dask cluster using Dask.distributed.Client. Let dask-lightgbm train a model or make predictions for you. See system tests for a sample code: https://github.com/dask/dask-lightgbm/blob/main/system_tests/test_fit_predict.py

How this works

Dask is used mainly for accessing the cluster and managing data. The library assures that both features and a label for each sample are located on the same worker. It also lets each worker to know addresses and available ports of all other workers. The distributed training is performed by LightGBM library itself using sockets. See more details on distributed training in LightGBM here: https://github.com/microsoft/LightGBM/blob/main/docs/Parallel-Learning-Guide.rst

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dask-lightgbm-0.2.0.tar.gz (5.8 kB view details)

Uploaded Source

Built Distribution

dask_lightgbm-0.2.0-py3-none-any.whl (5.8 kB view details)

Uploaded Python 3

File details

Details for the file dask-lightgbm-0.2.0.tar.gz.

File metadata

  • Download URL: dask-lightgbm-0.2.0.tar.gz
  • Upload date:
  • Size: 5.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.3 CPython/3.8.5 Darwin/20.5.0

File hashes

Hashes for dask-lightgbm-0.2.0.tar.gz
Algorithm Hash digest
SHA256 2a771d63d5326ec7954f826fc9c3a5a249f387190c031a1bbc79c3963590b24d
MD5 f4a2b11409c0b3fe6da993bfa5c2b83d
BLAKE2b-256 62637d857963b6f0fde00c4ca4f5f18f822d0f1a26297a396d3ca103318c55fa

See more details on using hashes here.

File details

Details for the file dask_lightgbm-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: dask_lightgbm-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 5.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.3 CPython/3.8.5 Darwin/20.5.0

File hashes

Hashes for dask_lightgbm-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 88de6cc3312cc54ddf172dd4314ad4579aa5cb25be0a83eb2e2a86b76f73db06
MD5 8bf4b182ec987ca65784402aa9e2f11b
BLAKE2b-256 5b5d88388c8d618f0a7ae3fcff45dd51f53c7719f041618ab692ab70bbc543a6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page