Loop unrolling and batching for CUDA
Project description
unrollcuda
Loop unrolling and batching for CUDA
The core idea of this solution is to give a way to solve the following tasks:
- Use Loop unrolling to compute in CUDA any size and any count of dimensions array
- Use Batching to compute any size array, even if it s big that can't be fitted in GPU memory
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
unrollcuda-0.0.3.tar.gz
(4.0 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file unrollcuda-0.0.3.tar.gz.
File metadata
- Download URL: unrollcuda-0.0.3.tar.gz
- Upload date:
- Size: 4.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
abfef074d1803ea8b08db6df414f22399f03434695d2bc7416a0fb516b571fce
|
|
| MD5 |
dbc78246687351012d5d2762b3e81ccb
|
|
| BLAKE2b-256 |
4dcd20229c3ad28dea396ce0ae2028543c7f464de12faf60447333b4b8155b04
|
File details
Details for the file unrollcuda-0.0.3-py3-none-any.whl.
File metadata
- Download URL: unrollcuda-0.0.3-py3-none-any.whl
- Upload date:
- Size: 3.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4ab3f4a134182335048465e4917c6aaeef82a0bb66e74ffa114178bb45aef70c
|
|
| MD5 |
4145e2e7697766f56ff6fd555a1e7381
|
|
| BLAKE2b-256 |
cee3832e53c504593465cf645d683377799cd5751fe3c1fceb95d7ea63afa123
|