Skip to main content

A scalable, fast, ACID-compliant Data Catalog powered by Ray.

Project description

DeltaCAT

DeltaCAT is a Pythonic Data Catalog powered by Ray.

Its data storage model allows you to define and manage fast, scalable, ACID-compliant data catalogs through git-like stage/commit APIs, and has been used to successfully host exabyte-scale enterprise data lakes.

DeltaCAT uses the Ray distributed compute framework together with Apache Arrow for common table management tasks, including petabyte-scale change-data-capture, data consistency checks, and table repair.

Getting Started

Install

pip install deltacat

Running Tests

pip3 install virtualenv
virtualenv test_env
source test_env/bin/activate
pip3 install -r requirements.txt

pytest

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deltacat-0.1.18b17.tar.gz (153.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

deltacat-0.1.18b17-py3-none-any.whl (226.4 kB view details)

Uploaded Python 3

File details

Details for the file deltacat-0.1.18b17.tar.gz.

File metadata

  • Download URL: deltacat-0.1.18b17.tar.gz
  • Upload date:
  • Size: 153.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for deltacat-0.1.18b17.tar.gz
Algorithm Hash digest
SHA256 d4c978c7ee86deb86542fc139dfc636ba06c7813e8d94a1f0e738d1b500d6e4b
MD5 f296bc5c3584f263670e0813e5f4d972
BLAKE2b-256 741a8ad39766a5d3fe4b5d67749b738aaec7ddc2e549051b588eeef8ef232c83

See more details on using hashes here.

File details

Details for the file deltacat-0.1.18b17-py3-none-any.whl.

File metadata

  • Download URL: deltacat-0.1.18b17-py3-none-any.whl
  • Upload date:
  • Size: 226.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for deltacat-0.1.18b17-py3-none-any.whl
Algorithm Hash digest
SHA256 9d3cef6cc2e2bd832be79098dc33173ac000151faf506e4812d8332e4f729215
MD5 1f861fe8b266e540771aca5f2ae106f3
BLAKE2b-256 ea8714da8f77a8bc744c632e5bcb1a6f12e1a18309fcd96fb7c8742b1bad8984

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page