Spark-based distribution version of fast and customizable framework for automatic ML model creation (AutoML)
Project description
SLAMA: LightAutoML on Spark
SLAMA is a version of LightAutoML library modified to run in distributed mode with Apache Spark framework.
It requires:
- Python 3.9
- PySpark 3.2+ (installed as a dependency)
- Synapse ML library (It will be downloaded by Spark automatically)
Currently, only tabular Preset is supported. See demo with spark-based tabular automl preset in examples/spark/tabular-preset-automl.py. For further information check docs in the root of the project containing dedicated SLAMA section.
License
This project is licensed under the Apache License, Version 2.0. See LICENSE file for more details.
Installation
First of all you need to install git and poetry.
# Load LAMA source code
git clone https://github.com/fonhorst/LightAutoML_Spark.git
cd LightAutoML/
# !!!Choose only one item!!!
# 1. Global installation: Don't create virtual environment
poetry config virtualenvs.create false --local
# 2. Recommended: Create virtual environment inside your project directory
poetry config virtualenvs.in-project true
# For more information read poetry docs
# Install LAMA
poetry lock
poetry install
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
File details
Details for the file sparklightautoml-0.4.1-py3-none-any.whl
.
File metadata
- Download URL: sparklightautoml-0.4.1-py3-none-any.whl
- Upload date:
- Size: 159.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.13 CPython/3.8.10 Linux/5.4.0-117-generic
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ebfaf3ce75df6bdfc5790ef494f2766e3323a7db0f8f8a8026e0a2abacfd387d |
|
MD5 | d09f0ecb895ccdf894034bda14600e70 |
|
BLAKE2b-256 | 06f397789a30c317cb22db960afa3f9f5903e594103e09138e989937bc1cd6f5 |