TPU Python API

Project description

IVA TPU Python API

Main entities

TPUDevice

TPUDevice is a device handle

TPUProgram

TPUProgram contains TPU instructions and weigths data

TPUProgramInfo

Object can be used to configure inference.

config = TPUProgramInfo()
config.max_tasks_count = 4 # configures depth of tasks queue in driver
config.disable_static_checker = true # disables static checker for program
program = TPUProgram("program.tpu", config)

TPUInference

TPUInference contains input/output data

Example

import asyncio
import numpy as np
from iva_tpu import TPUDevice, TPUProgram, TPUInference

from iva_applications.resnet50 import image_to_tensor
from iva_applications.imagenet import tpu_tensor_to_classes
from PIL import Image

image = Image.open('ILSVRC2012_val_00000045.JPEG')
tensor = image_to_tensor(image)

device = TPUDevice()
program = TPUProgram("resnet50.tpu")  # default TPUProgramInfo is totally fine
device.load_program(program)
inference = TPUInference(program)
inference.load([tensor])
status_future = device._load_inference(inference)  # device returns future for inference status
event_loop = asyncio.get_event_loop()
status = event_loop.run_until_complete(status_future)
assert status.is_success  # check that there is no errors during inference
output = inference.get()  # get results
tpu_tensor_to_classes(output[0], top=1)

TPU Dictionary interface

...
program = TPUProgram("resnet50.tpu")
inference = TPUInference(program)
inference.load({"Placeholder:0": tensor})
...
assert status.is_success
output = inference.get(as_dist=True)
tpu_tensor_to_classes(output["logits:0"], top=1)

TPU Blocking interface

status = device.load_inference_sync(inference) #would block until completion

TPU Raw buffer examples

import asyncio
from iva_tpu import TPUDevice, TPUProgram, TPUInference, ProcessingMode
program = TPUProgram("omega_program_dnn_quant_3.0.0.tpu")
device = TPUDevice()
device.load_program(program)
inference = TPUInference(program)

with open("f.bin", "rb") as f:
    buf=f.read()

inference.load([buf], mode=ProcessingMode.RAW)
asyncio.get_event_loop().run_until_complete(device.load_inference(inference))
outputs = inference.get(mode=ProcessingMode.RAW)

for i in range(3):
  o = outputs[i]
  with open(f"o{i}.bin", "wb") as f:
    f.write(o)

TPU Single inference statistics examples

result = device.load_inference_sync(inference)
result.timings # contains statistics about inference
result.timings["queue_timings"] # contains array of timings for 3 queues (QUEUE_TRANSFER_TO, QUEUE_EXECUTOR, QUEUE_TRANSFER_FROM)
result.timings["queue_timings"][%d] # contains tuple of 2 elements: idle time and actual work time
result.timings["queue_timings"][%d][%d] # contains tuple of 3 values: last, average, maximum through all inferences for the device object
result.timings["execution_timing"][%d] # same as before but with execution on tpu timings

TPU Global statistics examples

device = TPUDevice()
device.stats # returns object with global statistics about the current device
device.stats["mem"] # current usage of memory in the device

Project details

Release history Release notifications | RSS feed

This version

15.0.24

Nov 27, 2023

15.0.23

Nov 15, 2023

15.0.22

Nov 3, 2023

15.0.21

Oct 30, 2023

15.0.20

Oct 30, 2023

15.0.19

Oct 27, 2023

15.0.18

Oct 25, 2023

15.0.17

Oct 4, 2023

15.0.16

Sep 26, 2023

15.0.15

Sep 26, 2023

15.0.14

Sep 18, 2023

15.0.13

Sep 12, 2023

15.0.12

Aug 29, 2023

15.0.11

Aug 28, 2023

15.0.10

Aug 22, 2023

15.0.9

Aug 14, 2023

15.0.8

Aug 11, 2023

15.0.7

Jun 14, 2023

15.0.4

May 15, 2023

15.0.3

May 15, 2023

15.0.2

May 10, 2023

14.7.0

Apr 11, 2023

14.6.8

Apr 10, 2023

14.6.7

Mar 7, 2023

14.6.6

Mar 6, 2023

14.6.5

Feb 28, 2023

14.6.3

Feb 27, 2023

14.6.2

Jan 20, 2023

14.6.1

Nov 24, 2022

14.6.0

Nov 7, 2022

14.5.5

Nov 3, 2022

14.5.3

Oct 19, 2022

14.5.2

Oct 19, 2022

14.5.1

Oct 17, 2022

14.5.0

Oct 12, 2022

14.4.9

Oct 11, 2022

14.4.8

Oct 7, 2022

14.4.7

Oct 7, 2022

14.4.6

Oct 7, 2022

14.4.5

Oct 6, 2022

14.4.4

Oct 3, 2022

14.4.3

Sep 23, 2022

14.4.2

Sep 22, 2022

14.4.1

Sep 13, 2022

14.4.0

Sep 12, 2022

14.2.2

Sep 9, 2022

14.2.1

May 25, 2022

14.2.0

Apr 14, 2022

14.1.1

Mar 30, 2022

14.1.0

Mar 29, 2022

14.0.1

Jan 31, 2022

14.0.0

Jan 30, 2022

13.0.18

Jan 28, 2022

13.0.17

Dec 27, 2021

13.0.16

Dec 17, 2021

13.0.15

Dec 16, 2021

13.0.13

Dec 1, 2021

13.0.8

Oct 20, 2021

13.0.7

Oct 13, 2021

13.0.6

Oct 11, 2021

13.0.5

Oct 9, 2021

13.0.4

Oct 8, 2021

13.0.3

Oct 7, 2021

13.0.2

Oct 7, 2021

13.0.1

Oct 6, 2021

12.0.1

Oct 4, 2021

12.0.0

Sep 28, 2021

2.0.1

Apr 27, 2023

2.0.0

Apr 26, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pytpu-15.0.24.tar.gz (19.2 kB view hashes)

Uploaded Nov 27, 2023 Source

Hashes for pytpu-15.0.24.tar.gz

Hashes for pytpu-15.0.24.tar.gz
Algorithm	Hash digest
SHA256	`e63ec3c5011de2dc660adcfd321227e41006aac7d8382e330d4667e8356eb27f`
MD5	`af53b9f6ef9ed91182c54ac54734495c`
BLAKE2b-256	`bd9aee7cc0a40291dcb7ef399d30b80892909c3add5e3a0ebf1bc04a8c97f0ba`