A profiler for Numba

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

Profila: a profiler for Numba

This profiler is sponsored by my book on writing fast low-level code in Python, which uses Numba for most of its examples.

Here's what Profila output looks like:

$ python -m profila annotate -- scripts_for_tests/simple.py
# Total samples: 328 (54.9% non-Numba samples, 1.8% bad samples)

## File `/home/itamarst/devel/profila/scripts_for_tests/simple.py`
Lines 10 to 15:

  0.3% |     for i in range(len(timeseries)):
       |         # This should be the most expensive line:
 38.7% |         result[i] = (7 + timeseries[i] / 9 + (timeseries[i] ** 2) / 7) / 5
       |     for i in range(len(result)):
       |         # This should be cheaper:
  4.3% |         result[i] -= 1

You can also use it with Jupyter!

Beyond this README, you can also read this introductory article with a more detailed example and explanations.

TL;DR limitations: Linux only, and only single-threaded Numba can be profiled currently, parallel functions are not yet supported.

Installation

Currently Profila works on Linux only.

On macOS you can use Docker, Podman, or a Linux VM.
On Windows you can use Docker, Podman, or probably WSL2.

Install this library using pip:

pip install profila

Then, have Profila install some dependencies it needs, specifically an older version of gdb; this will not interfere with your system gdb. (This is necessary due to some bad interactions with newer gdb and Numba that will hopefully go away someday.)

python -m profila setup

Usage

Jupyter profiling

First, before you import numba you should:

%load_ext profila

Then define your functions as usual:

from numba import njit

@njit
def myfunc(arr):
    # ... your code here ...

You probably want to call your Numba function at least once, so profiling doesn't measure compilation time:

myfunc(DATA)

Then, you can profile a specific cell using the %%profila magic, e.g.

%%profila
# Make sure we run this enough to get good measurements:
for i in range(100):
    myfunc(DATA)

Command-line profiling

If you usually run your script like this:

$ python yourscript.py --arg1=200

Instead run it like this:

$ python -m profila annotate -- yourscript.py --arg1=200

If you usually run your script like this, with -m:

$ python -m yourpackage --arg1=200

Instead run it like this:

$ python -m profila annotate -- -m yourpackage --arg1=200

Sampling is done every 10 milliseconds, so you need to make sure your Numba code runs for a sufficiently long time. For example, you can run your function in a loop until a number of seconds has passed:

from time import time

@njit
def myfunc():
    # ...

start = time()
# Run for 3 seconds:
while (time() - start) < 3:
    myfunc()

The limitations of profiling output

Parallel Numba code will not be profiled correctly; at the moment only single-threaded profiling is supported.
GPU (CUDA) code is not profiled.

Beyond that:

1. The compiled code isn't the same as the input code

Compiled languages like Numba do optimization passes and transform the code to make it faster. That means the running code doesn't necessarily map one to one to the original code; different lines might be combined, for example.

As far as I can tell Numba does give you a reasonable mapping, but you can't assume the source code maps one to one to executed code.

2. Adding the necessary info can change the performance of your code

In order to profile, additional info needs to be added during compilation; specifically, the NUMBA_DEBUGINFO env variable is set. This might change runtime characteristics slightly, because it increases the memory size of the compiled code.

3. Compiled code is impacted by CPU effects that aren't obvious from profiling results

Instruction-level parallelism, branch mispredictions, SIMD, and the CPU memory caches all have a significant impact on runtime performance, but the profiling result doesn't always make it clear that's the problem. I'm writing a book about this if you want to learn more.

Changelog

v0.3.2

python -m profila setup should now work correctly in Conda environments.
Support for latest IPython.
Support for Python 3.13.

v0.3.1

Documentation improvements.

v0.3.0

Bug fixes:

Work on newer versions of Linux, like Ubuntu 24.04 and RHEL 9, in the face of issues with newer gdb.
Fixed occasional error TypeError: argument of type 'NoneType' is not iterable.

v0.2.1

Bug fixes:

Run Python using sys.executable, so it works in more environments. Thanks to Jeremiah England for the bug report.

v0.2.0

Added support for Jupyter profiling.

v0.1.0

Initial release.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

itamarst

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.3.4

Jun 26, 2026

0.3.2

Feb 28, 2025

0.3.1

Dec 17, 2024

0.3.0

Dec 16, 2024

0.2.1

Feb 20, 2024

0.2.0

Jan 31, 2024

0.1.1

Jan 30, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

profila-0.3.4.tar.gz (20.6 kB view details)

Uploaded Jun 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

profila-0.3.4-py3-none-any.whl (16.4 kB view details)

Uploaded Jun 26, 2026 Python 3

File details

Details for the file profila-0.3.4.tar.gz.

File metadata

Download URL: profila-0.3.4.tar.gz
Upload date: Jun 26, 2026
Size: 20.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for profila-0.3.4.tar.gz
Algorithm	Hash digest
SHA256	`d4d0e6657c4142cdac600a1866b9d35db44b03d6f9321dda636e0ad9617619d3`
MD5	`79a33ca99c4797b20cb515a19191e2a7`
BLAKE2b-256	`10d0ae101258db0b1f4d607df1021ca9b7c6a7ff79c44554456e3ac331e2ce29`

See more details on using hashes here.

Provenance

The following attestation bundles were made for profila-0.3.4.tar.gz:

Publisher: publish.yml on pythonspeed/profila

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: profila-0.3.4.tar.gz
- Subject digest: d4d0e6657c4142cdac600a1866b9d35db44b03d6f9321dda636e0ad9617619d3
- Sigstore transparency entry: 1972692737
- Sigstore integration time: Jun 26, 2026
Source repository:
- Permalink: pythonspeed/profila@fae88c884f4862c207f1bc039230122c72178970
- Branch / Tag: refs/tags/v0.3.4
- Owner: https://github.com/pythonspeed
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@fae88c884f4862c207f1bc039230122c72178970
- Trigger Event: release

File details

Details for the file profila-0.3.4-py3-none-any.whl.

File metadata

Download URL: profila-0.3.4-py3-none-any.whl
Upload date: Jun 26, 2026
Size: 16.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for profila-0.3.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`96ee41c3dc718eb2d70a596fae9b4806fad3cf7a9b123185be318e955dcbe834`
MD5	`4bf5371d25672ce8528ba929e2d2cd1a`
BLAKE2b-256	`ebad1a1ed78ed1fe78568632c2671f60d1ae5f9bdbe7797421d4a21a199b62f4`

See more details on using hashes here.

Provenance

The following attestation bundles were made for profila-0.3.4-py3-none-any.whl:

Publisher: publish.yml on pythonspeed/profila

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: profila-0.3.4-py3-none-any.whl
- Subject digest: 96ee41c3dc718eb2d70a596fae9b4806fad3cf7a9b123185be318e955dcbe834
- Sigstore transparency entry: 1972692740
- Sigstore integration time: Jun 26, 2026
Source repository:
- Permalink: pythonspeed/profila@fae88c884f4862c207f1bc039230122c72178970
- Branch / Tag: refs/tags/v0.3.4
- Owner: https://github.com/pythonspeed
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@fae88c884f4862c207f1bc039230122c72178970
- Trigger Event: release

profila 0.3.4

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Profila: a profiler for Numba

Installation

Usage

Jupyter profiling

Command-line profiling

The limitations of profiling output

1. The compiled code isn't the same as the input code

2. Adding the necessary info can change the performance of your code

3. Compiled code is impacted by CPU effects that aren't obvious from profiling results

Changelog

v0.3.2

v0.3.1

v0.3.0

v0.2.1

v0.2.0

v0.1.0

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance