Skip to main content

Spark-based distribution version of fast and customizable framework for automatic ML model creation (AutoML)

Project description

SLAMA: LightAutoML on Spark

SLAMA is a version of LightAutoML library modified to run in distributed mode with Apache Spark framework.

It requires:

  1. Python 3.9
  2. PySpark 3.2+ (installed as a dependency)
  3. Synapse ML library (It will be downloaded by Spark automatically)

Currently, only tabular Preset is supported. See demo with spark-based tabular automl preset in examples/spark/tabular-preset-automl.py. For further information check docs in the root of the project containing dedicated SLAMA section.

License

This project is licensed under the Apache License, Version 2.0. See LICENSE file for more details.

Installation

First of all you need to install git and poetry.

# Load LAMA source code
git clone https://github.com/fonhorst/LightAutoML_Spark.git

cd LightAutoML/

# !!!Choose only one item!!!

# 1. Global installation: Don't create virtual environment
poetry config virtualenvs.create false --local

# 2. Recommended: Create virtual environment inside your project directory
poetry config virtualenvs.in-project true

# For more information read poetry docs

# Install LAMA
poetry lock
poetry install

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

SparkLightAutoML_DEV-0.3.0.tar.gz (113.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

SparkLightAutoML_DEV-0.3.0-py3-none-any.whl (141.3 kB view details)

Uploaded Python 3

File details

Details for the file SparkLightAutoML_DEV-0.3.0.tar.gz.

File metadata

  • Download URL: SparkLightAutoML_DEV-0.3.0.tar.gz
  • Upload date:
  • Size: 113.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.15 CPython/3.9.12 Linux/5.14.18-100.fc33.x86_64

File hashes

Hashes for SparkLightAutoML_DEV-0.3.0.tar.gz
Algorithm Hash digest
SHA256 ffd6b319fe33b52c284c6fb93e45a60c63f838e4bff37e7e0d7ba752dc018d55
MD5 ace61ffe476ecf90f122d1afcd893852
BLAKE2b-256 70609758f55e3098babdf3328503481a8ee096aa44276fcc210ea2af8aa0a77c

See more details on using hashes here.

File details

Details for the file SparkLightAutoML_DEV-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: SparkLightAutoML_DEV-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 141.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.15 CPython/3.9.12 Linux/5.14.18-100.fc33.x86_64

File hashes

Hashes for SparkLightAutoML_DEV-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 8e6aba48ad084a4dd6bbb8b0670f18705b57e79415b2dbc4690849193ac31edb
MD5 ec2117190b1f45b14fd4b94d5aad37ba
BLAKE2b-256 abe4250e32ed60dce1a1c9c9ed873a3b267845f688ac72684ec692e779a8bd7f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page