Skip to main content

A simple framework for accelerating deep learning inference runtime.

Project description

dl_acceleration

Installation

Build from source

git clone https://gitlab.gnomondigital.com/fzyuan/dl_acceleration.git
DLACC_HOME = ./dl_acceleration
PYTHONPATH=$DLACC_HOME:${PYTHONPATH}
export TOKENIZERS_PARALLELISM=false

Install via pip

pip install dlacc

Python SDK

python3.9 setup.py sdist bdist_wheel
python3.9 -m twine upload dist/* --verbose

Features

  • Automatic Optimization
  • Benchmark with various metrics (mean inference time, improvement compare, ..)
  • Output optimized models
  • Save tuning log
  • Support pytorch and onnx models, for tensorflow models, see https://github.com/onnx/tensorflow-onnx

Usage

Command line

python3.9 main.py --config example1.json

Python script

View getting_started.ipynb

Supported Targets

['aocl', 'hybrid', 'nvptx', 'sdaccel', 'opencl', 'metal', 'hexagon', 'aocl_sw_emu', 'rocm', 'webgpu', 'llvm', 'cuda', 'vulkan', 'ext_dev', 'c']

Specifying the correct target can have a huge impact on the performance of the compiled module, as it can take advantage of hardware features available on the target. For more information, please refer to Auto-tuning a convolutional network for x86 CPU. We recommend identifying which CPU you are running, along with optional features, and set the target appropriately. For example, for some processors target = "llvm -mcpu=skylake", or target = "llvm -mcpu=skylake-avx512" for processors with the AVX-512 vector instruction set.

Notes:

Generally:

  • Use 'cuda' for GPU backend;
  • Use 'llvm' for CPU backend.

specify num_measure_trials=20000 for best performance tuning for optimum.run() method call.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dlacc-1.7.tar.gz (9.4 kB view details)

Uploaded Source

Built Distribution

dlacc-1.7-py3-none-any.whl (9.7 kB view details)

Uploaded Python 3

File details

Details for the file dlacc-1.7.tar.gz.

File metadata

  • Download URL: dlacc-1.7.tar.gz
  • Upload date:
  • Size: 9.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.9.12

File hashes

Hashes for dlacc-1.7.tar.gz
Algorithm Hash digest
SHA256 fe6f1e2c0b652a64a5170786707039fdd8d9cdc109eaf8e48ca4c9dd17eb375a
MD5 2b427c7ade58b9b64a5d605bef3d1490
BLAKE2b-256 ad43b5024b5d00a51cf6bb81cec7f8dbe846fba6cd726f91c57857fbabf58030

See more details on using hashes here.

File details

Details for the file dlacc-1.7-py3-none-any.whl.

File metadata

  • Download URL: dlacc-1.7-py3-none-any.whl
  • Upload date:
  • Size: 9.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.9.12

File hashes

Hashes for dlacc-1.7-py3-none-any.whl
Algorithm Hash digest
SHA256 d074da029508d14188f6b05c5a51dc241e2a9e573bf9e2c20d8ebd3ee251137a
MD5 0ea02c6f24424be0daf1f9d5d82323f9
BLAKE2b-256 e0f469ec4fa242339f6c30a062eec1484a6e55c821efa79cc0636166b02e2870

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page