A simple framework for accelerating deep learning inference runtime.
Project description
dl_acceleration
Installation
Build from source
git clone https://gitlab.gnomondigital.com/fzyuan/dl_acceleration.git
DLACC_HOME = ./dl_acceleration
PYTHONPATH=$DLACC_HOME:${PYTHONPATH}
export TOKENIZERS_PARALLELISM=false
Install via pip
pip install dlacc
Python SDK
python3.9 setup.py sdist bdist_wheel
python3.9 -m twine upload dist/* --verbose
Features
- Automatic Optimization
- Benchmark with various metrics (mean inference time, improvement compare, ..)
- Output optimized models
- Save tuning log
- Support pytorch and onnx models, for tensorflow models, see https://github.com/onnx/tensorflow-onnx
Usage
Command line
python3.9 main.py --config example1.json
Python script
View getting_started.ipynb
Supported Targets
['aocl', 'hybrid', 'nvptx', 'sdaccel', 'opencl', 'metal', 'hexagon', 'aocl_sw_emu', 'rocm', 'webgpu', 'llvm', 'cuda', 'vulkan', 'ext_dev', 'c']
Specifying the correct target can have a huge impact on the performance of the compiled module, as it can take advantage of hardware features available on the target. For more information, please refer to Auto-tuning a convolutional network for x86 CPU. We recommend identifying which CPU you are running, along with optional features, and set the target appropriately. For example, for some processors target = "llvm -mcpu=skylake", or target = "llvm -mcpu=skylake-avx512" for processors with the AVX-512 vector instruction set.
Notes:
Generally:
- Use 'cuda' for GPU backend;
- Use 'llvm' for CPU backend.
specify num_measure_trials=20000 for best performance tuning for optimum.run() method call.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file dlacc-1.7.tar.gz
.
File metadata
- Download URL: dlacc-1.7.tar.gz
- Upload date:
- Size: 9.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.9.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fe6f1e2c0b652a64a5170786707039fdd8d9cdc109eaf8e48ca4c9dd17eb375a |
|
MD5 | 2b427c7ade58b9b64a5d605bef3d1490 |
|
BLAKE2b-256 | ad43b5024b5d00a51cf6bb81cec7f8dbe846fba6cd726f91c57857fbabf58030 |
File details
Details for the file dlacc-1.7-py3-none-any.whl
.
File metadata
- Download URL: dlacc-1.7-py3-none-any.whl
- Upload date:
- Size: 9.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.9.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d074da029508d14188f6b05c5a51dc241e2a9e573bf9e2c20d8ebd3ee251137a |
|
MD5 | 0ea02c6f24424be0daf1f9d5d82323f9 |
|
BLAKE2b-256 | e0f469ec4fa242339f6c30a062eec1484a6e55c821efa79cc0636166b02e2870 |