Skip to main content

Nexus SDK library for Python.

Project description

Introduction

Nexus SDK Py is a Python development kit for Nexus client applications. It builds upon Golang Client via cgo. Python SDK does not use any Python-level HTTP middleware for Nexus interactions, but authentication might require it.

SDK is tested against a Nexus stack in a docker-compose deployment, backed by kind Kubernetes clusters.

Quickstart

Install CGO library from Go SDK by running:

chmod +x ./sdk-installer.sh
./sdk-installer.sh

Setting up the development environment

To set up the development environment, follow these steps:

To start a local Nexus stack for testing, run:

just up

The command will start the Kind cluster and deploy Nexus stack on it. You can then run tests against this local deployment.

The Nexus Scheduler API will be available at http://localhost:5555/scheduler and the Nexus Receiver API will be available at http://localhost:8080/receiver. Note that the cluster's ingress configured to rewrite URL paths to reduce number of ports you need to interact with and reduce probability of errors related to ports collision. For example, when you send a request to http://localhost:5555/scheduler/api/something/something, it will be automatically rewritten to http://scheduler-pod:8080/api/something/something and forwarded to the Scheduler API.

The Scylla DB will be available at localhost:9042 and you can connect to it using any CQL client. The default credentials are cassandra/cassandra.

The MinIO s3 API will be available at localhost:9000 and you can connect to it using any S3 client. The default credentials are minioadmin/minioadmin. The minio console is not exposed, but you can use the mc client to interact with it.

In case you are testing changes for Go SDK, clone branch you are testing and compile the .so file from source:

go build -v -buildmode=c-shared -o nexus_sdk.so main.go

Afterwards, copy the nexus_sdk.so under nexus_client_sdk/.extensions/nexus_sdk.so.

Initialize a client and retrieve results for a tagged submission:

from nexus_client_sdk.models.access_token import AccessToken
from nexus_client_sdk.models.scheduler import SdkCustomRunConfiguration
from nexus_client_sdk.clients.nexus_scheduler_client import NexusSchedulerClient

token = "..."
client = NexusSchedulerClient.create("https://localhost:8080", lambda: AccessToken.empty())

alg_params = {"field1": {"field2": 1, "field3": "abc"}, "field4": "cde"}

# create a run
new_run = client.create_run(
    algorithm_parameters=alg_params,
    algorithm_name="test-algorithm",
    custom_configuration=SdkCustomRunConfiguration.create(version="v1.2.3"),
    tag="test-py-sdk",
    payload_valid_for="6h",
)

print(f"Run id: {new_run}")

for result in client.get_run_results("abc"):
    print(result)

Nexus Development Framework

Apart from API clients for Nexus, SDK ships a development framework under nexus subpackage. It allows to create production-grade, asyncio-native ML/AI solutions that use a unified structure and are compose of objects and object relations, rather than methods. Nexus turns ML/AI apps into standard Python applications and removes the common noise found in notebook- type code, such as variable reassignment, frequent data copying due to lack of reusable code, copy-paste of code etc.

Nexus's design makes life even easier when using AI code generation, as it is essentially a framework an AI agent can follow to generate a working data science pipeline. Nexus takes care of result accounting, error handling, logging, metric reporting and, most importantly, execution flow. A key feature in Nexus is automatic resolution of execution graph via dependency injection. In essence, a developer just needs to specify which inputs are required for an algorithm to run, and provide class implementations for this, and Nexus will take care of the rest. This also implies that whether an IO operation happens, such as a database read or a file load, Nexus will utilize asyncio coroutines to run multiple IO ops in parallel, significantly increasing the execution speed, without any need for a developer to understand async programming.

For a example of how to use Nexus, take a look at a Sample Algorithm and a corresponding test configuration and a test itself.

Execution tree

Nexus provides a set of utilities that allow viewing and inspecting the execution tree:

from nexus_client_sdk.nexus.execution.trees import get_tree
from tests.sample_algorithm.sample_main import TestAlgorithm

print(get_tree(TestAlgorithm).serialize())

# graph TB
# TESTALGORITHM["TestAlgorithm"] --> XYPROCESSOR["XYProcessor"] --> XYREADER["XYReader"]
# TESTALGORITHM["TestAlgorithm"] --> ZPROCESSOR["ZProcessor"] --> ZREADER["ZReader"]

Handling Compressed Payloads

Nexus supports reading compressed payloads for efficient data transfer. When a payload is compressed, it must include both the compressed content and a reference to the decompression function.

Payload Structure

A compressed payload should be a json with the following keys:

  • content: The compressed data (as a base64-encoded string).
  • decompressor_import_path: The Python import path to the decompression function.

Example:

{
    "content": "SGVsbG8gd29ybGQ=",  # base64-encoded string of compressed bytes
    "decompressor_import_path": "my_module.my_decompress"
}

When Nexus receives such a payload, it will:

  1. Base64-decode the content field to obtain the compressed bytes.
  2. Dynamically import and call the function specified by decompressor_import_path to decompress the payload.
  3. Use the decompressed data as the actual payload for the algorithm.

This mechanism allows for flexible, pluggable decompression logic, as long as the function path is importable and callable in the runtime environment.

Automatic Payload Compression

Nexus can automatically compress and decompress payloads when using RemoteAlgorithm. To use this feature, you must first configure it with environment variables and then explicitly enable it in your RemoteAlgorithm implementation.

Step 1: Configuration (Environment Variables)

First, you need to provide the Python import paths for your compression and decompression logic. Setting these environment variables allows Nexus to create an injectable Compressor service.

  • NEXUS__REMOTE_ALGORITHM__COMPRESSION_IMPORT_PATH: The import path to your compression function (e.g., my_module.my_compress).
  • NEXUS__REMOTE_ALGORITHM__DECOMPRESSION_IMPORT_PATH: The import path to your decompression function (e.g., my_module.my_decompress).

Step 2: Enabling Compression in Your Algorithm

Once the environment variables are set, you can activate compression on a RemoteAlgorithm instance by providing two arguments during its initialization:

  1. compress_payload=True: This boolean flag signals your intent to use compression for this remote algorithm.
  2. compressor=<injected_compressor_instance>: You must inject the Compressor service that Nexus creates from your environment variables.

Important Requirement

For compression to work, both conditions must be met. The application will raise an error if compress_payload is set to True but a valid Compressor instance is not injected. Ensure that the required environment variables are set so the Compressor service can be created and injected successfully.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

nexus_client_sdk-1.6.0-cp311-cp311-manylinux_2_28_x86_64.whl (25.1 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

nexus_client_sdk-1.6.0-cp311-cp311-manylinux_2_28_aarch64.whl (23.0 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

nexus_client_sdk-1.6.0-cp311-cp311-macosx_13_0_arm64.whl (11.4 MB view details)

Uploaded CPython 3.11macOS 13.0+ ARM64

File details

Details for the file nexus_client_sdk-1.6.0-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nexus_client_sdk-1.6.0-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 0926ff4ba57cb47bc0e6b9860f4f3c2960d8467dba409c09c694d80b333f550b
MD5 5fe13b2dc8f8bbc58bff5b69ce2986b6
BLAKE2b-256 8c34a89398693fee3a3f7e0e00d0fca21e92f3a6d2efc970d57c059ed55525c4

See more details on using hashes here.

File details

Details for the file nexus_client_sdk-1.6.0-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nexus_client_sdk-1.6.0-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 a4a296bf980af3e05c5e9b125b6c5b74e80baf6fdfa3d732f85f75b3c8a83c68
MD5 81700c219ae543715482a3f690ee5189
BLAKE2b-256 1d2c1f3f81d527dbb849e738fe6d68e2f7cebbc3175746f79efa49cdc45acdcd

See more details on using hashes here.

File details

Details for the file nexus_client_sdk-1.6.0-cp311-cp311-macosx_13_0_arm64.whl.

File metadata

File hashes

Hashes for nexus_client_sdk-1.6.0-cp311-cp311-macosx_13_0_arm64.whl
Algorithm Hash digest
SHA256 21040846c349ff2ff2fca260087fbb5aa41d113349bd4619acf68d961e3661ab
MD5 27989af662afe458905a6b62e4da709c
BLAKE2b-256 12cff5639dc471e06b280209b21cb83df262943005b0afa511b997d4026e7ec1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page