Skip to main content

A scalable, fast, ACID-compliant Data Catalog powered by Ray.

Project description

DeltaCAT

DeltaCAT is a Pythonic Data Catalog powered by Ray.

Its data storage model allows you to define and manage fast, scalable, ACID-compliant data catalogs through git-like stage/commit APIs, and has been used to successfully host exabyte-scale enterprise data lakes.

DeltaCAT uses the Ray distributed compute framework together with Apache Arrow for common table management tasks, including petabyte-scale change-data-capture, data consistency checks, and table repair.

Getting Started

Install

pip install deltacat

Running Tests

pip3 install virtualenv
virtualenv test_env
source test_env/bin/activate
pip3 install -r requirements.txt

pytest

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deltacat-1.1.27.tar.gz (217.3 kB view details)

Uploaded Source

Built Distribution

deltacat-1.1.27-py3-none-any.whl (302.6 kB view details)

Uploaded Python 3

File details

Details for the file deltacat-1.1.27.tar.gz.

File metadata

  • Download URL: deltacat-1.1.27.tar.gz
  • Upload date:
  • Size: 217.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for deltacat-1.1.27.tar.gz
Algorithm Hash digest
SHA256 fd1cd27def3ba5fc0d13e2ee427c9b5e7dffcf7fe3f0502821c9c7f3f34ea301
MD5 4ae042f76844d1ab78d2cabfdf2967d5
BLAKE2b-256 918c005ba9e707810e5349c74705c05181ec75177455419f28f14cf8549878b7

See more details on using hashes here.

File details

Details for the file deltacat-1.1.27-py3-none-any.whl.

File metadata

  • Download URL: deltacat-1.1.27-py3-none-any.whl
  • Upload date:
  • Size: 302.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for deltacat-1.1.27-py3-none-any.whl
Algorithm Hash digest
SHA256 1497b43547ff7e502ec4f903665d350103bbf25227e7427bf189f0a27a738768
MD5 6389c55d9cb71d5ea3694906363c662d
BLAKE2b-256 e1ca525423e0f669edf9991de5fa60f2e1ae30ad5db7bae845c431ace53b5122

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page