Skip to main content

Centralized data repository for cross-process data filtering in Python.

Project description

TableVault

TableVault is a Python package for storing and querying workflow data with lineage tracking across scripts and notebooks.

It uses ArangoDB as the backend and gives you a single API (Vault) to:

  • Store typed data lists (file, document, embedding, record)
  • Track upstream/downstream dependencies between items
  • Search by text, code provenance, and embedding similarity
  • Coordinate long-running processes with safe pause/stop checkpoints

Documentation

You can find the full documentation at tablevault.org.

Installation

Install from PyPI:

pip install tablevault

Citation

If you use TableVault in research, cite:

  • Zhao, J. and Krishnan, S. (2025). TableVault: Managing Dynamic Data Collections for LLM-Augmented Workflows. NOVAS @ SIGMOD.
    ArXiv: https://arxiv.org/abs/2506.18257

License

MIT License. See LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tablevault-0.2.2.post1.tar.gz (60.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tablevault-0.2.2.post1-py3-none-any.whl (32.9 kB view details)

Uploaded Python 3

File details

Details for the file tablevault-0.2.2.post1.tar.gz.

File metadata

  • Download URL: tablevault-0.2.2.post1.tar.gz
  • Upload date:
  • Size: 60.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.1

File hashes

Hashes for tablevault-0.2.2.post1.tar.gz
Algorithm Hash digest
SHA256 af12e01e92038e1f04e80260f74823c6d0ebe1082502ad651703203b9484f3fd
MD5 bf904acfef48ecf78976e554f46a1eef
BLAKE2b-256 d25da5fd532d2082756201a0ea678f6204ffa733a82078cd4827ce2113492c78

See more details on using hashes here.

File details

Details for the file tablevault-0.2.2.post1-py3-none-any.whl.

File metadata

File hashes

Hashes for tablevault-0.2.2.post1-py3-none-any.whl
Algorithm Hash digest
SHA256 a473d3f25abb199c933542dec302da9a4705fe1ad4c20ac1d36fe510886f62b7
MD5 0049c0b1b628b53dd03d5dc8343c2367
BLAKE2b-256 5b04cac69d89c1c6fd880165e73dc1804d369c1f286e6e1acfc42bd716e22f57

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page