Skip to main content

A little client for applications driven by numpy

This project has been archived.

The maintainers of this project have marked this project as archived. No new releases are expected.

Project description

Numpy with Apache Arrow Flight

PyPI version License: MIT CI Coverage Status Created with qCradle

Open in GitHub Codespaces

A Problem

We provide

  • An abstract base class for an Apache flight server
  • A client class to communicate with such servers

We efficiently transfer NumPy arrays over Apache Arrow Flight using a custom Client. The client provides a simple interface for sending NumPy arrays, performing computations, and retrieving results, all while handling the serialization and deserialization automatically in the background.

To create a server we expect the user to overload a function performing the calcutation based on a dictionary of numpy arrays.

Features

  • Seamless conversion between NumPy arrays and Arrow Tables
  • Simple interface for data transfer operations
  • Support for batch computations
  • Automatic resource management
  • Type-safe operations with proper error handling

Installation

You can install this client via

pip install numpy-flight

Usage

Basic Setup

import numpy as np
import pyarrow.flight as fl
from np.flight import Client

# Initialize the Flight client
flight_client = fl.FlightClient('grpc://localhost:8815')
client = Client(flight_client)

Sending Data

# Prepare your NumPy arrays
data = {
    'values': np.array([1, 2, 3, 4, 5]),
    'labels': np.array(['a', 'b', 'c', 'd', 'e'])
}

# Send data to the server
client.write('store_data', data)

Retrieving Data

# Get data from the server
result_table = client.get('retrieve_data')

Computing with Data

# Send data and get results in one operation
input_data = {
    'x': np.array([1, 2, 3]),
    'y': np.array([4, 5, 6])
}
results = client.compute('multiply_arrays', input_data)

API Reference

Client

__init__(client: fl.FlightClient)

Initialize the client with a Flight client instance.

write(command: str, data: Dict[str, np.ndarray])

Write NumPy arrays to the Flight server.

  • command: String identifying the operation
  • data: Dictionary mapping column names to NumPy arrays

get(command: str) -> pa.Table

Retrieve data from the Flight server.

  • command: String identifying the data to retrieve
  • Returns: PyArrow Table containing the retrieved data

compute(command: str, data: Dict[str, np.ndarray]) -> Dict[str, np.ndarray]

Perform a computation on the server and retrieve results.

  • command: String identifying the computation
  • data: Input data as dictionary of NumPy arrays
  • Returns: Dictionary of NumPy arrays containing results

Error Handling

The client includes proper error handling for common scenarios:

  • FlightError: Raised for Flight protocol communication errors
  • ValueError: Raised for data conversion errors
  • Resource cleanup is handled automatically, even in error cases

Best Practices

  • Always close the Flight client when done:
client.close()
  • Use context managers when possible to ensure proper cleanup:
with flight.FlightClient('grpc://localhost:8815') as flight_client:
    client = Client(flight_client)
    # ... perform operations
  • Handle large datasets in chunks to manage memory usage effectively.

Set Up Environment

make install

This installs/updates uv, creates your virtual environment and installs dependencies.

For adding or removing packages:

uv add/remove requests  # for main dependencies
uv add/remove requests --dev  # for dev dependencies

Contributing

  • Fork the repository
  • Create your feature branch (git checkout -b feature/amazing-feature)
  • Commit your changes (git commit -m 'Add some amazing feature')
  • Push to the branch (git push origin feature/amazing-feature)
  • Open a Pull Request

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

numpy_flight-0.0.5.tar.gz (8.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

numpy_flight-0.0.5-py3-none-any.whl (8.5 kB view details)

Uploaded Python 3

File details

Details for the file numpy_flight-0.0.5.tar.gz.

File metadata

  • Download URL: numpy_flight-0.0.5.tar.gz
  • Upload date:
  • Size: 8.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for numpy_flight-0.0.5.tar.gz
Algorithm Hash digest
SHA256 3bc0adec160e78cae8b5c9144498c9daeb782d197234c303b16721e94491110b
MD5 13e150303d4b6d088edb4e9ef2fe0f3c
BLAKE2b-256 612e688df179fc00b83e7bd8fbc80e53b8e2fbe47350f2d3885f77cdfff01944

See more details on using hashes here.

Provenance

The following attestation bundles were made for numpy_flight-0.0.5.tar.gz:

Publisher: release.yml on tschm/numpy-flight

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file numpy_flight-0.0.5-py3-none-any.whl.

File metadata

  • Download URL: numpy_flight-0.0.5-py3-none-any.whl
  • Upload date:
  • Size: 8.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for numpy_flight-0.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 a94429f261ca275cd9293dc5c9b1025bf040cb430d315a0a03c93b07e6ed75ac
MD5 82c0cb3fc82f3b2d648ce6379702c50c
BLAKE2b-256 34578c6c570a134170d71b6ac61d641d1a7db53e21ee7efa6c8250ae565de8fb

See more details on using hashes here.

Provenance

The following attestation bundles were made for numpy_flight-0.0.5-py3-none-any.whl:

Publisher: release.yml on tschm/numpy-flight

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page