Skip to main content

SageFlow - Vector-native stream processing engine for incremental semantic state snapshots

Project description

sageFlow

sageFlow is a cutting-edge, vector-native stream processing engine designed specifically to maintain and materialize semantic state snapshots for real-time, LLM-based generation tasks. The engine offers a declarative API to compose stateful vector operations within temporal windows, enabling fast and efficient updates to semantic context for dynamically changing datasets.

Features

  • Vector-Native Stream Processing: At its core, sageFlow is built to handle high-dimensional vector streams efficiently.
  • Declarative API: Easily compose complex, stateful vector operations such as TopK, Filter, and Join within defined temporal windows.
  • Incremental Low-Latency Updates: Optimized for incremental computations, ensuring semantic states are updated with minimal delay.
  • Optimized Three-Phase Pipeline: Abstracts stream processing into three distinct phases—ingestion, state materialization, and snapshot exposure—unlocking significant optimization opportunities.
  • Stateful and Windowed Operations: Natively supports windowing to create time-bound semantic snapshots from continuous data streams.

Key Use Cases

  • Real-time LLM Generation: Provide large language models with fresh, stateful context snapshots for more accurate and timely responses.
  • Dynamic Context Maintenance: Ideal for conversational AI or interactive applications where the context evolves rapidly over time.
  • Streaming Data Analytics: Serve high-velocity data analysis use cases that require complex, stateful semantic queries on vector data.
  • Adaptive Recommendation Systems: Build systems that can update recommendations in real-time based on the most recent user interactions and streaming events.

Setup

To setup sageFlow and it's dependencies, begin by making sure that you have docker installed, or any Linux release version that contains apt, such as Ubuntu or Debian

We suggest first begin with docker before you are familiar with sageFlow.

Quick Installation (Ubuntu/Debian)

For a quick one-click installation of all dependencies including DiskANN support, run:

cd <PATH_TO_REPO>
sudo ./scripts/install-deps.sh

This script will install:

  • Build essentials (gcc, g++, cmake, etc.)
  • DiskANN dependencies (libaio, boost, etc.)
  • Intel MKL (Math Kernel Library)
  • Environment configuration

After installation, reload your environment:

source /etc/profile.d/mkl.sh

Docker

make sure you have installed Docker and Docker is running

Windows

cd <PATH_TO_REPO>/setup
./start_win.bat

Linux

cd <PATH_TO_REPO>/setup
./start.sh

Linux with apt

check the dependencies in <PATH_TO_REPO>/setup/Dockerfile, and build your env

sageFlow: examples

run the following commands to generate examples

cmake -B build
cmake --build build -j $(nproc)

This will generate the examples in the build/bin directory. You can run the examples with:

./build/bin/itopk

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

isage_flow-0.1.1-cp311-cp311-manylinux_2_34_x86_64.whl (1.0 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.34+ x86-64

File details

Details for the file isage_flow-0.1.1-cp311-cp311-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for isage_flow-0.1.1-cp311-cp311-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 ea90fda5b2b3a3e13d0b827dcd3269c4761f04e6b892621cfe0d832e16f93a39
MD5 19b5a89a39af8ace15a1efa02617ebc8
BLAKE2b-256 44801050427d8f71e1528da23e391a5d8a771d821766dfaac0545e55b8312c5b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page