Skip to main content

An embedded streaming OLAP data pipeline with LanceDB

Project description

lancedb-tables

lancedb-tables is a python package wrapper over LanceDB that makes it easy to create and update LanceDB tables into embedded streaming OLAP data pipeline designs.

Since Lance is designed to be mutable, it is possible to create an embedded streaming pipeline using the same data source. The main advantage of this streaming approach is that it doesn't require any parquet glob file management. This reduces the complexity of setting up streaming to the same as batch processing. The other main benefit is that LanceDB leverages the Apache Arrow Standard which makes integrations into ETL pipelines using Polars and DuckDB simple.

Install with pip

pip install lancedb-tables

Install from source

  1. Clone the repository
  2. This repository uses rye to manage dependencies and the virtual environment. To install rye, refer to this link for instructions here.
  3. Once rye is installed, run rye sync to install dependencies and setup the virtual environment, which has a default name of .venv.
  4. Activate the virtual environment with the command source .venv/bin/activate.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lancedb_tables-0.1.1.tar.gz (2.8 kB view details)

Uploaded Source

Built Distribution

lancedb_tables-0.1.1-py3-none-any.whl (3.2 kB view details)

Uploaded Python 3

File details

Details for the file lancedb_tables-0.1.1.tar.gz.

File metadata

  • Download URL: lancedb_tables-0.1.1.tar.gz
  • Upload date:
  • Size: 2.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.12.2

File hashes

Hashes for lancedb_tables-0.1.1.tar.gz
Algorithm Hash digest
SHA256 aed5dc61b27bdad13393c3c2a9e5539fd20a72cede0cc6c5342de758abc3c96e
MD5 0c825ecdddc11625862495a9119750ea
BLAKE2b-256 de8051617d98419435899df71ba978fd7d2cfa26aff2d2fbb98f6cbbb36e5fc4

See more details on using hashes here.

File details

Details for the file lancedb_tables-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for lancedb_tables-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 46a1bc8d68c2a8edee4a77f13d638b750978f3fc7f90c825651533e5cfc4b725
MD5 4798713adb5552ba0583107238559759
BLAKE2b-256 3fee6b21741a62c7a634844210512cb14da8d89e13244a3ae53b62006d7e13ec

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page