Skip to main content

Pixeltable: The Multimodal AI Data Plane

Project description

Pixeltable: The Multimodal AI Data Plane

License    pytest status

Pixeltable is a Python library that lets AI engineers and data scientists focus on exploration, modeling, and app development without having to deal with the customary data plumbing.

Pixeltable redefines data infrastructure and workflow orchestration for AI development. It brings together data storage, versioning, and indexing with orchestration and model versioning under a declarative table interface, with transformations, model inference, and custom logic represented as computed columns.

Installation

Pixeltable works with Python 3.9, 3.10, or 3.11 running on Linux or MacOS.

pip install pixeltable

To verify that it's working:

import pixeltable as pxt
cl = pxt.Client()

For more detailed installation instructions, see the Getting Started with Pixeltable guide. Then, check out the Pixeltable Basics tutorial for a tour of its most important features.

What problems does Pixeltable solve?

Today’s solutions for AI app development require extensive custom coding and infrastructure plumbing. Tracking lineage and versions between and across data transformations, models, and deployment is cumbersome. Pixeltable is a replacement for traditional data plumbing, providing a unified plane for data, models, and orchestration. It removes the data plumbing overhead in building and productionizing AI applications.

Why should you use Pixeltable?

  • It gives you transparency and reproducibility
    • All generated data is automatically recorded and versioned
    • You will never need to re-run a workload because you lost track of the input data
  • It saves you money
    • All data changes are automatically incremental
    • You never need to re-run pipelines from scratch because you’re adding data
  • It integrates with any existing Python code or libraries
    • Bring your ever-changing code and workloads
    • You choose the models, tools, and AI practices (e.g., your embedding model for a vector index); Pixeltable orchestrates the data

Example Use Cases

  • Interact with video data at the frame level without having to think about frame extraction, intermediate file storage, or storage space explosion.
  • Augment your data incrementally and interactively with built-in functions and UDFs, such as image transformations, model inference, and visualizations, without having to think about data pipelines, incremental updates, or capturing function output.
  • Interact with all the data relevant to your AI application (video, images, documents, audio, structured data, JSON) through a simple dataframe-style API directly in Python. This includes:
    • similarity search on embeddings, supported by high-dimensional vector indexing
    • path expressions and transformations on JSON data
    • PIL and OpenCV image operations
    • assembling frames into videos
  • Perform keyword and image similarity search at the video frame level without having to worry about frame storage.
  • Access all Pixeltable-resident data directly as a PyTorch dataset in your training scripts.
  • Understand the compute and storage costs of your data at the granularity of individual augmentations and get cost projections before adding new data and new augmentations.
  • Rely on Pixeltable's automatic versioning and snapshot functionality to protect against regressions and to ensure reproducibility.

Project details


Release history Release notifications | RSS feed

This version

0.2.1

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pixeltable-0.2.1.tar.gz (188.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pixeltable-0.2.1-py3-none-any.whl (239.5 kB view details)

Uploaded Python 3

File details

Details for the file pixeltable-0.2.1.tar.gz.

File metadata

  • Download URL: pixeltable-0.2.1.tar.gz
  • Upload date:
  • Size: 188.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.11.4 Darwin/21.4.0

File hashes

Hashes for pixeltable-0.2.1.tar.gz
Algorithm Hash digest
SHA256 b14ab60df85b4b94e0210b638527226a2c417f66b0b50fa45f2ec0effd447156
MD5 23bf88b21afb1ae09064195db7d7bfbb
BLAKE2b-256 68071f5fb6f2cb73266639546ac877a52f0ab2e584fa118a0b1b2f984f116d1e

See more details on using hashes here.

File details

Details for the file pixeltable-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: pixeltable-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 239.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.11.4 Darwin/21.4.0

File hashes

Hashes for pixeltable-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 2d917300429e2d8210e1203dd4555d401125b0aaa901c6382e994815facd0d40
MD5 7de065405481117a0d5d2159a2a4bc37
BLAKE2b-256 91dad50807f4d8ebb2303dedf95075c69ac4f61e8545aa31cb701f925ffa2ab4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page