Loop Kernel Analysis and Performance Modeling Toolkit
Project description
kerncraft
Loop Kernel Analysis and Performance Modeling Toolkit
This tool allows automatic analysis of loop kernels using the Execution Cache Memory (ECM) model, the Roofline model and actual benchmarks. kerncraft provides a framework to investigate the data reuse and cache requirements by static code analysis. In combination with the Intel IACA tool kerncraft can give a good overview of both in-core and memory bottlenecks and use that data to apply performance models.
For a detailed documentation see this master’s thesis: pdf
Installation
Run: pip install kerncraft
- Additional requirements are:
Intel IACA tool, with (working) iaca.sh in PATH environment variable (used by ECM, ECMCPU and Roofline models)
likwid (used in Benchmark model and by likwid_bench_auto.py)
Usage
Get an example kernel and machine file from the examples directory
wget https://raw.githubusercontent.com/cod3monk/kerncraft/master/examples/machine-files/phinally.yaml
wget https://raw.githubusercontent.com/cod3monk/kerncraft/master/examples/kernels/2d-5pt.c
Have a look at the machine file and change it to match your targeted machine (above we downloaded a file for a sandy bridge EP machine)
Run kerncraft
kerncraft -p ECM -m phinally.yaml 2d-5pt.c -D N 10000 -D M 10000 add -vv for more information on the kernel and ECM model analysis.
Credits
Implementation: Julian Hammer ECM Model (theory): Georg Hager, Holger Stengel, Jan Treibig
License
AGPLv3
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file kerncraft-0.1.dev10.tar.gz
.
File metadata
- Download URL: kerncraft-0.1.dev10.tar.gz
- Upload date:
- Size: 45.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1691b5cd7c9b8474a4ed25636f1fd9a0ffab7339f4970e1df9c26d10e3bdc13c |
|
MD5 | 4703e67d37a7aa42b4bd8e6ba0374e41 |
|
BLAKE2b-256 | 786b0164fdc3222279567a570a8e6b0639e5cb6e8643ec619e5f2e1a7a3b8895 |