A scalable, fast, ACID-compliant Data Catalog powered by Ray.
Project description
DeltaCAT
DeltaCAT is a Pythonic Data Catalog powered by Ray.
Its data storage model allows you to define and manage fast, scalable, ACID-compliant data catalogs through git-like stage/commit APIs, and has been used to successfully host exabyte-scale enterprise data lakes.
DeltaCAT uses the Ray distributed compute framework together with Apache Arrow for common table management tasks, including petabyte-scale change-data-capture, data consistency checks, and table repair.
Getting Started
Install
pip install deltacat
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file deltacat-fork-0.1.14.tar.gz
.
File metadata
- Download URL: deltacat-fork-0.1.14.tar.gz
- Upload date:
- Size: 87.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9f138cae5a3d60930ecc3ecd64e096d20fb5c9ded00910d28c28971e9b4677c3 |
|
MD5 | 0eb740f5ce2fce2bd2073bf411954297 |
|
BLAKE2b-256 | 888779ff167cb3bf5d7431f7b6a893fccd7b263ad356218aa27bdea04e876a61 |
File details
Details for the file deltacat_fork-0.1.14-py3-none-any.whl
.
File metadata
- Download URL: deltacat_fork-0.1.14-py3-none-any.whl
- Upload date:
- Size: 133.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 30114d214557e5a94c0553d3f74cd98da4da41e718a29449c485aead577891b2 |
|
MD5 | 28e39fcb15763b5dd6966ee447aa36d1 |
|
BLAKE2b-256 | bc2d28b97c3a390a4e4cad3fc5db8f31f2638f383b127a730482941c58868781 |