Skip to main content

Spark-based distribution version of fast and customizable framework for automatic ML model creation (AutoML)

Project description

SLAMA: LightAutoML on Spark

SLAMA is a version of LightAutoML library modified to run in distributed mode with Apache Spark framework.

It requires:

  1. Python 3.9
  2. PySpark 3.2+ (installed as a dependency)
  3. Synapse ML library (It will be downloaded by Spark automatically)

Currently, only tabular Preset is supported. See demo with spark-based tabular automl preset in examples/spark/tabular-preset-automl.py. For further information check docs in the root of the project containing dedicated SLAMA section.

License

This project is licensed under the Apache License, Version 2.0. See LICENSE file for more details.

Installation

First of all you need to install git and poetry.

# Load LAMA source code
git clone https://github.com/fonhorst/LightAutoML_Spark.git

cd LightAutoML/

# !!!Choose only one item!!!

# 1. Global installation: Don't create virtual environment
poetry config virtualenvs.create false --local

# 2. Recommended: Create virtual environment inside your project directory
poetry config virtualenvs.in-project true

# For more information read poetry docs

# Install LAMA
poetry lock
poetry install

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

sparklightautoml-0.4.1-py3-none-any.whl (159.7 kB view details)

Uploaded Python 3

File details

Details for the file sparklightautoml-0.4.1-py3-none-any.whl.

File metadata

  • Download URL: sparklightautoml-0.4.1-py3-none-any.whl
  • Upload date:
  • Size: 159.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.13 CPython/3.8.10 Linux/5.4.0-117-generic

File hashes

Hashes for sparklightautoml-0.4.1-py3-none-any.whl
Algorithm Hash digest
SHA256 ebfaf3ce75df6bdfc5790ef494f2766e3323a7db0f8f8a8026e0a2abacfd387d
MD5 d09f0ecb895ccdf894034bda14600e70
BLAKE2b-256 06f397789a30c317cb22db960afa3f9f5903e594103e09138e989937bc1cd6f5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page