Skip to main content

A scalable, fast, ACID-compliant Data Catalog powered by Ray.

Project description

DeltaCAT

DeltaCAT is a Pythonic Data Catalog powered by Ray.

Its data storage model allows you to define and manage fast, scalable, ACID-compliant data catalogs through git-like stage/commit APIs, and has been used to successfully host exabyte-scale enterprise data lakes.

DeltaCAT uses the Ray distributed compute framework together with Apache Arrow for common table management tasks, including petabyte-scale change-data-capture, data consistency checks, and table repair.

Getting Started

Install

pip install deltacat

Running Tests

pip3 install virtualenv
virtualenv test_env
source test_env/bin/activate
pip3 install -r requirements.txt

pytest

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deltacat-0.1.20.tar.gz (155.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

deltacat-0.1.20-py3-none-any.whl (228.7 kB view details)

Uploaded Python 3

File details

Details for the file deltacat-0.1.20.tar.gz.

File metadata

  • Download URL: deltacat-0.1.20.tar.gz
  • Upload date:
  • Size: 155.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for deltacat-0.1.20.tar.gz
Algorithm Hash digest
SHA256 1b74d902416dc941fc04a7b916c040099f4b28db1337dcce829d433fec6360be
MD5 67628385f280565033c01b2be94c4911
BLAKE2b-256 49b444d3f7ede9498eb6e7f66177460777295e418c6bb3f0f131e4166d354572

See more details on using hashes here.

File details

Details for the file deltacat-0.1.20-py3-none-any.whl.

File metadata

  • Download URL: deltacat-0.1.20-py3-none-any.whl
  • Upload date:
  • Size: 228.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for deltacat-0.1.20-py3-none-any.whl
Algorithm Hash digest
SHA256 f5961e70731b1c51ff0e268daf796e9639bfd009f023601992a63b35488ec698
MD5 843032da5e09a4d352a3a4b2b017be7f
BLAKE2b-256 29a3d7ba30b6806a68ebbf87db2250362f0683cdaec28be845f253a0a7f3f6ca

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page