Skip to main content

Apache SystemDS - An open source ML system for the end-to-end data science lifecycle

Project description

This package provides a Pythonic interface for working with Apache SystemDS.

Apache SystemDS is an open source ML system for the end-to-end data science lifecycle from data integration, cleaning, and feature engineering, over efficient, local and distributed ML model training, to deployment and serving. To facilitate this, bindings from different languages and different system abstractions provide help for:

  1. The different tasks of the data-science lifecycle, and
  2. users with different expertise.

These high-level scripts are compiled into hybrid execution plans of local, in-memory CPU and GPU operations, as well as distributed operations on Apache Spark. In contrast to existing systems - that either provide homogeneous tensors or 2D Datasets - and in order to serve the entire data science lifecycle, the underlying data model are DataTensors, i.e., tensors (multi-dimensional arrays) whose first dimension may have a heterogeneous and nested schema.

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

systemds-3.1.0.tar.gz (69.8 MB view hashes)

Uploaded Source

Built Distribution

systemds-3.1.0-py3-none-any.whl (70.1 MB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page