Skip to main content

Delta reader for the Ray open-source toolkit for building ML applications

Project description

deltaray

License

The deltaray library provides a Delta Lake table reader for the Ray open-source ML toolkit.

Quickstart

Install from PyPI:

pip install deltaray

Install from GitHub:

pip install git+https://github.com/delta-incubator/deltaray.git

Basic use, check notebooks for more detailed example:

# Standard Libraries
import pathlib

# External Libraries
import deltaray
import deltalake as dl
import pandas as pd


# Creating a Delta Table
cwd = pathlib.Path().resolve()
table_uri = f'{cwd}/tmp/delta-table'
df = pd.DataFrame({'id': [0, 1, 2, 3, 4, ], })
dl.write_deltalake(table_uri, df)

# Reading our Delta Table
ds = deltaray.read_delta(table_uri)
ds.show()

Running Test Matrix

note: you can add -s flag to print to stderr/stdout during pytest-ing

tox

Building Distribution

Building Wheel:

python setup.py bdist_wheel sdist

Installing Wheel:

pip install /path/to/wheel/..

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deltaray-0.1.1.tar.gz (7.2 kB view hashes)

Uploaded Source

Built Distribution

deltaray-0.1.1-py3-none-any.whl (7.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page