No project description provided

These details have not been verified by PyPI

Project description

pyraisdk

AML models are meant to be deployed to GPU instances to provide inference service. If the code that operates the model uses the GPU for inferencing in each request separately, the overall performance of the model will be quite inefficent. This SDK has APIs that can allocate batches of inference requests to run on the GPU in a separate thread, thereby considerably improving the usage efficiency of GPU and making the model more performant.

The SDK also collects telemetry data for each so the performance of the model can be evaluated and tracked and provides logging primitives that can be used to produce additional troubleshooting information.

Dynamic Batching Support

There are APIs you must implement in your model to support batching of inference requests for best model performance. Those APIs allow the SDK to distribute load efficiently to the GPU instances. The APIs are:

preprocess Modifies the input to the model, if necessary. For example, if your model needs the input in a special JSON format instead of as a list of strings, you can do that modification in the preprocess method.
predict Executes the model inference for a list of input strings

Usage Examples

Build YourModel class inherited from pyraisdk.dynbatch.BaseModel.

from typing import List
from pyraisdk.dynbatch import BaseModel

class YourModel(BaseModel):
    def predict(self, items: List[str]) -> List[int]:
        rs = []
        for item in items:
            rs.append(len(item))
        return rs
            
    def preprocess(self, items: List[str]) -> List[str]:
        rs = []
        for item in items:
            rs.append(f'[{item}]')
        return rs

Initialize a pyraisdk.dynbatch.DynamicBatchModel with YourModel instance, and call predict / predict_one for inferencing.

from pyraisdk.dynbatch import DynamicBatchModel

# prepare model
simple_model = YourModel()
batch_model = DynamicBatchModel(simple_model)

# predict
items = ['abc', '123456', 'xyzcccffaffaaa']
predictions = batch_model.predict(items)
assert predictions == [5, 8, 16]

# predict_one
item = 'abc'
prediction = batch_model.predict_one(item)
assert prediction == 5

Concurrent requests to predict / predict_one, in different threads.

from threading import Thread
from pyraisdk.dynbatch import DynamicBatchModel

# prepare model
simple_model = YourModel()
batch_model = DynamicBatchModel(simple_model)

# thread run function
def run(name, num):
    for step in range(num):
        item = f'{name}-{step}'
        prediction = batch_model.predict_one(item)
        assert prediction == len(item) + 2

# start concurrent inference
threads = [Thread(target=run, args=(f'{tid}', 100)) for tid in range(20)]
for t in threads:
    t.start()
for t in threads:
    t.join()

Loging & Events

Description

This module is for logging and event tracing.

interface

def initialize(
    eh_hostname: Optional[str] = None,
    client_id: Optional[str] = None,
    eh_conn_str: Optional[str] = None,
    eh_structured: Optional[str] = None,
    eh_unstructured: Optional[str] = None,
    role: Optional[str] = None,
    instance: Optional[str] = None,
)

Parameter description for initialize:

eh_hostname: Fully Qualified Namespace aka EH Endpoint URL (*.servicebus.windows.net). Default, read $EVENTHUB_NAMESPACE
client_id: client_id of service principal. Default, read $UAI_CLIENT_ID
eh_conn_str: connection string of eventhub namespace. Default, read $EVENTHUB_CONN_STRING
eh_structured: structured eventhub name. Default, read $EVENTHUB_AUX_STRUCTURED
eh_unstructured: unstructured eventhub name. Default, read $EVENTHUB_AUX_UNSTRUCTURED
role: role, Default: RemoteModel_${ENDPOINT_NAME}
instance: instance, Default: "${ENDPOINT_VERSION}|{os.uname()[1]}" or "${ENDPOINT_VERSION}|{_probably_unique_id()}"

def event(self, key: str, code: str, numeric: float, detail: str='', corr_id: str='', elem: int=-1)
def infof(self, format: str, *args: Any)
def infocf(self, corr_id: str, elem: int, format: str, *args: Any)
def warnf(self, format: str, *args: Any)
def warncf(self, corr_id: str, elem: int, format: str, *args: Any)
def errorf(self, format: str, *args: Any)
def errorcf(self, corr_id: str, elem: int, ex: Optional[Exception], format: str, *args: Any)
def fatalf(self, format: str, *args: Any)
def fatalcf(self, corr_id: str, elem: int, ex: Optional[Exception], format: str, *args: Any)

examples

# export EVENTHUB_AUX_UNSTRUCTURED='ehunstruct'
# export EVENTHUB_AUX_STRUCTURED='ehstruct'
# export UAI_CLIENT_ID='xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx'
# export EVENTHUB_NAMESPACE='xxx.servicebus.windows.net'

from pyraisdk import rlog
rlog.initialize()

rlog.infof('this is a info message %s', 123)
rlog.event('LifetimeEvent', 'STOP_GRACEFUL_SIGNAL', 0, 'detail info')

# export EVENTHUB_AUX_UNSTRUCTURED='ehunstruct'
# export EVENTHUB_AUX_STRUCTURED='ehstruct'
# export EVENTHUB_CONN_STRING='<connection string>'

from pyraisdk import rlog
rlog.initialize()

rlog.infocf('corrid', -1, 'this is a info message: %s', 123)
rlog.event('RequestDuration', '200', 0.01, 'this is duration in seconds')

from pyraisdk import rlog
rlog.initialize(eh_structured='ehstruct', eh_unstructured='ehunstruct', eh_conn_str='<eventhub-conn-str>')

rlog.errorcf('corrid', -1, Exception('error msg'), 'error message: %s %s', 1,2)
rlog.event('CpuUsage', '', 0.314, detail='cpu usage', corr_id='corrid', elem=-1)

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.11

Jan 30, 2023

0.1.10

Jan 30, 2023

0.1.9

Jan 30, 2023

0.1.8

Jan 30, 2023

0.1.7

Jan 19, 2023

0.1.6

Jan 19, 2023

0.1.5

Jan 19, 2023

0.1.4

Jan 19, 2023

0.1.3

Jan 18, 2023

0.1.2

Jan 18, 2023

0.1.1

Jan 18, 2023

0.1.0

Jan 18, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

publish_test_0118-0.1.11.tar.gz (13.6 kB view details)

Uploaded Jan 30, 2023 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

publish_test_0118-0.1.11-py3-none-any.whl (15.0 kB view details)

Uploaded Jan 30, 2023 Python 3

File details

Details for the file publish_test_0118-0.1.11.tar.gz.

File metadata

Download URL: publish_test_0118-0.1.11.tar.gz
Upload date: Jan 30, 2023
Size: 13.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.3.2 CPython/3.9.16 Linux/5.15.0-1031-azure

File hashes

Hashes for publish_test_0118-0.1.11.tar.gz
Algorithm	Hash digest
SHA256	`4cb5e171728cff88205d900051c8ff19d402d1f6ccd2b1c617cfc6d726a6b39d`
MD5	`d1577c6f2dc7807858521b9aac239fd9`
BLAKE2b-256	`e789016ca8688d17669365f071214a67eb5667e074bf0b8073fd04bcc051e1ce`

See more details on using hashes here.

File details

Details for the file publish_test_0118-0.1.11-py3-none-any.whl.

File metadata

Download URL: publish_test_0118-0.1.11-py3-none-any.whl
Upload date: Jan 30, 2023
Size: 15.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.3.2 CPython/3.9.16 Linux/5.15.0-1031-azure

File hashes

Hashes for publish_test_0118-0.1.11-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4412a63e7ed8dd1f586f175ba3d7da52f46a72500fa756bd5225f1a376ca0369`
MD5	`0fb39be3da39592546bd13bc326cfd60`
BLAKE2b-256	`698d0309666d8e6b871d719e5654a1867714fae6df8ae055e6ae863844de6013`

See more details on using hashes here.

publish-test-0118 0.1.11

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

pyraisdk

Dynamic Batching Support

Usage Examples

Loging & Events

Description

interface

examples

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes