Skip to main content

High-performance key-value storage engine with Python bindings

Project description

PegaFlow Python Package

High-performance key-value storage engine with Python bindings, built with Rust and PyO3.

Features

  • PegaEngine: Fast Rust-based key-value storage with Python bindings
  • PegaKVConnector: vLLM KV connector for distributed inference with KV cache transfer

Installation

From Source

# Install maturin if you haven't already
pip install maturin

# Build and install in development mode
cd python
maturin develop

# Or build a wheel
maturin build --release

From PyPI (coming soon)

pip install pegaflow

Usage

Basic KV Storage

from pegaflow import PegaEngine

# Create a new engine
engine = PegaEngine()

# Store key-value pairs
engine.put("name", "PegaFlow")
engine.put("version", "0.1.0")

# Retrieve values
name = engine.get("name")  # Returns "PegaFlow"
missing = engine.get("nonexistent")  # Returns None

# Remove keys
removed = engine.remove("name")  # Returns "PegaFlow"

vLLM KV Connector

from vllm import LLM
from vllm.distributed.kv_transfer.kv_transfer_agent import KVTransferConfig

# Configure vLLM to use PegaKVConnector
kv_transfer_config = KVTransferConfig(
    kv_connector="PegaKVConnector",
    kv_role="kv_both",
    kv_connector_module_path="pegaflow.connector",
)

# Create LLM with KV transfer enabled
llm = LLM(
    model="gpt2",
    kv_transfer_config=kv_transfer_config,
)

Development

See the examples directory for more usage examples.

Testing

Running Unit Tests

The test suite includes integration tests that verify the EngineRpcClient can correctly communicate with a running pegaflow-server instance.

Prerequisites

  1. Build the Rust extension:

    cd python
    maturin develop --release
    
  2. Build the server binary:

    cd ..
    cargo build --release --bin pegaflow-server
    
  3. Ensure CUDA is available (tests require GPU):

    python -c "import torch; assert torch.cuda.is_available()"
    

Running Tests

cd python

# Run all tests
pytest tests/ -v

# Run specific test file
pytest tests/test_engine_client.py -v

# Run with coverage
pytest tests/ --cov=pegaflow --cov-report=html

Test Structure

  • tests/conftest.py: Contains pytest fixtures for:

    • pega_server: Automatically starts/stops pegaflow-server for integration tests
    • engine_client: Creates an EngineRpcClient connected to the test server
    • client_context: Provides a ClientContext representing a vLLM instance with GPU KV cache tensors
    • registered_instance: Provides a registered instance ID for query tests
  • tests/test_engine_client.py: Integration tests for:

    • Server connectivity
    • Query operations with various inputs

Test Fixtures

The ClientContext class abstracts a vLLM instance and provides:

  • register_kv_caches(): Register GPU KV cache tensors with the server
  • query(block_hashes): Query available blocks
  • unregister_context(): Unregister context from server

Example test usage:

def test_query(client_context):
    """Test query operation."""
    result = client_context.query([])
    assert result is not None

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

pegaflow_llm-0.22.2-cp314-cp314-manylinux_2_34_x86_64.whl (8.5 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.34+ x86-64

pegaflow_llm-0.22.2-cp313-cp313-manylinux_2_34_x86_64.whl (8.5 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.34+ x86-64

pegaflow_llm-0.22.2-cp312-cp312-manylinux_2_34_x86_64.whl (8.5 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.34+ x86-64

pegaflow_llm-0.22.2-cp311-cp311-manylinux_2_34_x86_64.whl (8.5 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.34+ x86-64

pegaflow_llm-0.22.2-cp310-cp310-manylinux_2_34_x86_64.whl (8.5 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.34+ x86-64

File details

Details for the file pegaflow_llm-0.22.2-cp314-cp314-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for pegaflow_llm-0.22.2-cp314-cp314-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 03b80fb0a4f6f21f856c7b16211e63c9522f46ba998fa324627f0fcc4cc64d9f
MD5 f4b34d3079254efca0da657f2a918918
BLAKE2b-256 8db3f7422d33a8ce5d721cbe3ebe922a9c1a3e5e888182ee3f99b156d7fe51b0

See more details on using hashes here.

File details

Details for the file pegaflow_llm-0.22.2-cp313-cp313-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for pegaflow_llm-0.22.2-cp313-cp313-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 2b981c25e249d4e423a2772acc51ffe75ca692753fab8fafb8f24247aea1a5fd
MD5 22ef9dbebeaf8de5939aec627ac2ff87
BLAKE2b-256 3bfe1cbbe0f076fae3bdd9861f35910791bea9ed5bcedf9469da91715b2f2bdb

See more details on using hashes here.

File details

Details for the file pegaflow_llm-0.22.2-cp312-cp312-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for pegaflow_llm-0.22.2-cp312-cp312-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 142be845ef75a2a57c52b31b35b3364b533864222ec3cd9218b1f3f1babe868f
MD5 1255d01c1a342645ecfc4a3159ca1584
BLAKE2b-256 5821dbb45b5c74d7b7717661f4b9c7afd11f254fce32c95c086f932fb9c6bbf7

See more details on using hashes here.

File details

Details for the file pegaflow_llm-0.22.2-cp311-cp311-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for pegaflow_llm-0.22.2-cp311-cp311-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 07ac8af82fbc108bc2ec1398bc00c9eb5406868c10b874922e653e22333d1b77
MD5 49d4af69ce17fabd6ca5a8808549df47
BLAKE2b-256 7c573fe37a8905b054a14495556d09c08aa503ebd59f94180a9b67bc47331688

See more details on using hashes here.

File details

Details for the file pegaflow_llm-0.22.2-cp310-cp310-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for pegaflow_llm-0.22.2-cp310-cp310-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 b800a9c58ad4b77cf4ca5392f2172e1a1a69f922ec89d602242b2983a3b7bd2c
MD5 6a155770dcb20dda49416708cbb5a4bc
BLAKE2b-256 d2300b9b579175ae200ffb7d09c2a1b9fd914d57c0c6ab9931e2fdb9e5503f25

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page