Skip to main content

A little client for applications driven by numpy

This project has been archived.

The maintainers of this project have marked this project as archived. No new releases are expected.

Project description

Numpy with Apache Arrow Flight

PyPI version License: MIT CI Coverage Status Created with qCradle

Open in GitHub Codespaces

A Problem

We provide

  • An abstract base class for an Apache flight server
  • A client class to communicate with such servers

We efficiently transfer NumPy arrays over Apache Arrow Flight using a custom Client. The client provides a simple interface for sending NumPy arrays, performing computations, and retrieving results, all while handling the serialization and deserialization automatically in the background.

To create a server we expect the user to overload a function performing the calcutation based on a dictionary of numpy arrays.

Features

  • Seamless conversion between NumPy arrays and Arrow Tables
  • Simple interface for data transfer operations
  • Support for batch computations
  • Automatic resource management
  • Type-safe operations with proper error handling

Installation

You can install this client via

pip install numpy-flight

Usage

Basic Setup

import numpy as np
import pyarrow.flight as fl
from np.flight import Client

# Initialize the Flight client
flight_client = fl.FlightClient('grpc://localhost:8815')
client = Client(flight_client)

Sending Data

# Prepare your NumPy arrays
data = {
    'values': np.array([1, 2, 3, 4, 5]),
    'labels': np.array(['a', 'b', 'c', 'd', 'e'])
}

# Send data to the server
client.write('store_data', data)

Retrieving Data

# Get data from the server
result_table = client.get('retrieve_data')

Computing with Data

# Send data and get results in one operation
input_data = {
    'x': np.array([1, 2, 3]),
    'y': np.array([4, 5, 6])
}
results = client.compute('multiply_arrays', input_data)

API Reference

Client

__init__(client: fl.FlightClient)

Initialize the client with a Flight client instance.

write(command: str, data: Dict[str, np.ndarray])

Write NumPy arrays to the Flight server.

  • command: String identifying the operation
  • data: Dictionary mapping column names to NumPy arrays

get(command: str) -> pa.Table

Retrieve data from the Flight server.

  • command: String identifying the data to retrieve
  • Returns: PyArrow Table containing the retrieved data

compute(command: str, data: Dict[str, np.ndarray]) -> Dict[str, np.ndarray]

Perform a computation on the server and retrieve results.

  • command: String identifying the computation
  • data: Input data as dictionary of NumPy arrays
  • Returns: Dictionary of NumPy arrays containing results

Error Handling

The client includes proper error handling for common scenarios:

  • FlightError: Raised for Flight protocol communication errors
  • ValueError: Raised for data conversion errors
  • Resource cleanup is handled automatically, even in error cases

Best Practices

  • Always close the Flight client when done:
client.close()
  • Use context managers when possible to ensure proper cleanup:
with flight.FlightClient('grpc://localhost:8815') as flight_client:
    client = Client(flight_client)
    # ... perform operations
  • Handle large datasets in chunks to manage memory usage effectively.

Set Up Environment

make install

This installs/updates uv, creates your virtual environment and installs dependencies.

For adding or removing packages:

uv add/remove requests  # for main dependencies
uv add/remove requests --dev  # for dev dependencies

Contributing

  • Fork the repository
  • Create your feature branch (git checkout -b feature/amazing-feature)
  • Commit your changes (git commit -m 'Add some amazing feature')
  • Push to the branch (git push origin feature/amazing-feature)
  • Open a Pull Request

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

numpy_flight-0.0.6.tar.gz (8.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

numpy_flight-0.0.6-py3-none-any.whl (8.5 kB view details)

Uploaded Python 3

File details

Details for the file numpy_flight-0.0.6.tar.gz.

File metadata

  • Download URL: numpy_flight-0.0.6.tar.gz
  • Upload date:
  • Size: 8.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for numpy_flight-0.0.6.tar.gz
Algorithm Hash digest
SHA256 fc65df20a37c5ee9fa677499367d749f5392be08721b0e20f9e8f2bd4f406057
MD5 7fa9d66b40ca7d68f5074889934dffd7
BLAKE2b-256 70ebfe3fcc05447d3fe837c26fb7255e7072e98cc52097f743e7217245199b16

See more details on using hashes here.

Provenance

The following attestation bundles were made for numpy_flight-0.0.6.tar.gz:

Publisher: release.yml on tschm/numpy-flight

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file numpy_flight-0.0.6-py3-none-any.whl.

File metadata

  • Download URL: numpy_flight-0.0.6-py3-none-any.whl
  • Upload date:
  • Size: 8.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for numpy_flight-0.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 cdd17465989b06c2a48e166f64378eda795bd1e5b7fa89551bf0892c34d0fb43
MD5 9791f5fc57227848ba607ee82317e280
BLAKE2b-256 f0703f00f257c841e21609b525c60630e01164dd57920d0dfe8fd5e99e8a2b96

See more details on using hashes here.

Provenance

The following attestation bundles were made for numpy_flight-0.0.6-py3-none-any.whl:

Publisher: release.yml on tschm/numpy-flight

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page