Skip to main content

A portable, scalable, and fast AI Data Lakehouse.

Project description

deltacat-header-logo

DeltaCAT is a portable Multimodal Data Lakehouse powered by Ray. It lets you define and manage fast, scalable, ACID-compliant Multimodal data lakes, and has been used to successfully manage exabyte-scale enterprise data lakes.

It uses the Ray distributed compute framework together with Apache Arrow and Daft to efficiently scale common table management tasks, like petabyte-scale merge-on-read and copy-on-write operations.

DeltaCAT provides four high-level components:

  1. Catalog: High-level APIs to create, discover, organize, and manage datasets.
  2. Compute: Distributed data management jobs to read, write, and optimize datasets.
  3. Storage: In-memory and on-disk multimodal dataset formats.
  4. Sync: Synchronize DeltaCAT datasets with other data warehouses and table formats.

Getting Started

DeltaCAT is rapidly evolving. Usage instructions will be posted here soon!

For now, feel free to peruse some of our examples:

Project details


Release history Release notifications | RSS feed

This version

0.1

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cpcat-0.1.tar.gz (334.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cpcat-0.1-py3-none-any.whl (456.9 kB view details)

Uploaded Python 3

File details

Details for the file cpcat-0.1.tar.gz.

File metadata

  • Download URL: cpcat-0.1.tar.gz
  • Upload date:
  • Size: 334.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.12

File hashes

Hashes for cpcat-0.1.tar.gz
Algorithm Hash digest
SHA256 1a37821affb0de0f4cf546744466dba5f251c37fd033f3ff0534f67b28b92e32
MD5 4042cb3235b3983653356b00b3674464
BLAKE2b-256 4355a15fd141eff6f3565a4f81e0b129eca6de82e5841a782339847508b7e78e

See more details on using hashes here.

File details

Details for the file cpcat-0.1-py3-none-any.whl.

File metadata

  • Download URL: cpcat-0.1-py3-none-any.whl
  • Upload date:
  • Size: 456.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.12

File hashes

Hashes for cpcat-0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 ae75e03ff0cabf0291ccbfa2541d2740fff66441fae7f69dfd14ba1cca3a2c32
MD5 ac5c90a7d11d859ec6d2248c9da098e3
BLAKE2b-256 7a68bd419bb1fc9d56817eb96656e43df6f2c595aad0464973c4047047802f3d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page