GPU-GEMM generator
Project description
GPU-GEMM generator for the Discontinuous Galerkin method
Installation
For users
pip install gemmforge
For developers
git clone https://github.com/ravil-mobile/gemmforge.git gemmforge
cd gemmforge
pip install -e .
Usage
from gemmforge import DenseMatrix, GemmGenerator, GenerationError
from gemmforge import arch
arch = arch.produce("nvidia", "sm_60")
mat_a = DenseMatrix(num_rows=56,
num_cols=9,
addressing="strided",
bbox=[0, 0, 55, 8],
transpose=False)
mat_b = DenseMatrix(num_rows=9,
num_cols=9,
addressing="strided",
bbox=[0, 0, 8, 8],
transpose=False)
mat_c = DenseMatrix(num_rows=56,
num_cols=9,
bbox=[0, 0, 55, 8],
addressing="strided",
transpose=False)
try:
gen = GemmGenerator(arch, "float")
gen.generate(mat_a, mat_b, mat_c, alpha=1.1, beta=1.1)
print(gen.get_kernel())
print(gen.get_launcher())
print(gen.get_launcher_header())
except GenerationError as err:
print("ERROR: {}".format(err))
raise err
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
Close
Hashes for gemmforge-0.0.200-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 80e50433a666abb18edb2e66fca0035d49f24d6aba328a7918932f7fef760930 |
|
MD5 | 7b3650ba61dfa63cf49246d75898b08b |
|
BLAKE2b-256 | aa281f625f7c4d7c9699c6141ebffaf0d50bb312e3977d74fbb01110c7ec2e26 |