GPU-GEMM generator
Project description
GPU-GEMM generator for the Discontinuous Galerkin method
Installation
For users
pip install gemmforge
For developers
git clone https://github.com/ravil-mobile/gemmforge.git gemmforge
cd gemmforge
pip install -e .
Usage
from gemmforge import DenseMatrix, GemmGenerator, GenerationError
from gemmforge import arch
arch = arch.produce("nvidia", "sm_60")
mat_a = DenseMatrix(num_rows=56,
num_cols=9,
addressing="strided",
bbox=[0, 0, 55, 8],
transpose=False)
mat_b = DenseMatrix(num_rows=9,
num_cols=9,
addressing="strided",
bbox=[0, 0, 8, 8],
transpose=False)
mat_c = DenseMatrix(num_rows=56,
num_cols=9,
bbox=[0, 0, 55, 8],
addressing="strided",
transpose=False)
try:
gen = GemmGenerator(arch, "float")
gen.generate(mat_a, mat_b, mat_c, alpha=1.1, beta=1.1)
print(gen.get_kernel())
print(gen.get_launcher())
print(gen.get_launcher_header())
except GenerationError as err:
print("ERROR: {}".format(err))
raise err
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
gemmforge-0.0.202.tar.gz
(19.7 kB
view hashes)
Built Distribution
Close
Hashes for gemmforge-0.0.202-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c4b2441d76654243306178815a2d81965b157f6babf22b188166eaa4ebe30cd4 |
|
MD5 | b946cd9fe0532cd018c82ae5adf9ca4f |
|
BLAKE2b-256 | f7797e42fb420bb0177a1d9341d723a870373af325e0603504c9508f7e3c72dc |