Skip to main content

Effortless just-in-time compilation of Python functions, powered by Cython

Project description

Cycept

Effortless just-in-time compilation of Python functions, powered by Cython.

Installation

Cycept is available on PyPi:

python -m pip install cycept

Cycept requires Python 3.9 or later.

To run Cycept a C compiler needs to be installed on the system.

  • On Linux you may install GCC (Debian-like distros: sudo apt install build-essential).
  • On macOS you may install Clang (available through Xcode).
  • On Windows you may install MSVC (available through Microsoft C++ Build Tools).

If you are using Anaconda on Linux or macOS, you may also obtain a C compiler through conda install -c conda-forge c-compiler.

Once installed you can check whether Cycept functions correctly using

python -c "import cycept; cycept.check()"

If it does not work due to missing Python.h and you are running Linux, make sure to install the Python development headers (Debian-like distros: sudo apt install python3-dev if you are using the system Python).

Demo

"""Comparison of Python function JITs

Below we implement the sample function sum((a - b)**2) where a and b
are both 2D NumPy arrays. The following strategies are implemented and
compared against each other:
* Pure Python (baseline)
* NumPy
* Cycept JIT
* Cython JIT
* Numba JIT
"""

from time import perf_counter
import numpy as np

m, n = 2_000, 3_000
a = np.random.random((m, n))
b = np.random.random((m, n))

# Pure Python
def func(a, b):
    x = 0
    for i in range(a.shape[0]):
        for j in range(a.shape[1]):
            x += (a[i, j] - b[i, j])**2
    return x
tic = perf_counter()
result = func(a, b)
toc = perf_counter()
t_ref = toc - tic
print(f'Python: {result:<18} in {t_ref:.3e} s')

# NumPy
def func_numpy(a, b):
    return ((a - b)**2).sum()
tic = perf_counter()
result = func_numpy(a, b)
toc = perf_counter()
t = toc - tic
print(f'NumPy:  {result:<18} in {t:.3e} s ({int(t_ref/t)}x)')

# Cycept
import cycept
@cycept.jit
def func_cycept(a, b):
    x = 0
    for i in range(a.shape[0]):
        for j in range(a.shape[1]):
            x += (a[i, j] - b[i, j])**2
    return x
func_cycept(a[:1, :1], b[:1, :1])  # to compile
tic = perf_counter()
result = func_cycept(a, b)
toc = perf_counter()
t = toc - tic
print(f'Cycept: {result:<18} in {t:.3e} s ({int(t_ref/t)}x)')

# Cython
import cython
@cython.compile
def func_cython(a, b):
    x = 0
    for i in range(a.shape[0]):
        for j in range(a.shape[1]):
            x += (a[i, j] - b[i, j])**2
    return x
func_cython(a[:1, :1], b[:1, :1])  # to compile
tic = perf_counter()
result = func_cython(a, b)
toc = perf_counter()
t = toc - tic
print(f'Cython: {result:<18} in {t:.3e} s ({int(t_ref/t)}x)')

# Numba
import numba
@numba.njit
def func_numba(a, b):
    x = 0
    for i in range(a.shape[0]):
        for j in range(a.shape[1]):
            x += (a[i, j] - b[i, j])**2
    return x
func_numba(a[:1, :1], b[:1, :1])  # to compile
tic = perf_counter()
result = func_numba(a, b)
toc = perf_counter()
t = toc - tic
print(f'Numba:  {result:<18} in {t:.3e} s ({int(t_ref/t)}x)')

Running the above results in something similar to

Python: 1000265.9355757801 in 2.316e+00 s
NumPy:  1000265.9355757139 in 2.967e-02 s (78x)
Cycept: 1000265.9355757138 in 6.429e-03 s (360x)
Cython: 1000265.9355757801 in 7.103e-02 s (32x)
Numba:  1000265.9355757801 in 7.376e-03 s (314x)

For scientific codebases in the wild, code of the NumPy style is the most widespread. However, writing out the loops while adding a JIT can often lead to dramatic performance improvements, even when compared to NumPy. A further benefit of this is a reduced memory footprint, as no temporary arrays are created behind the scenes by the computation.

While Cycept especially shines on "array code", speedups are generally achievable on all kinds of code.

Performance benchmarks

Cycept comes with a suite of automated performance tests, all similar in nature to the demo above. To run this, do

python -c "import cycept; cycept.bench(show_func=True)"

Currently, the competitors included are

  • Pure Python
  • NumPy
  • Cycept
  • Cython
  • Numba

Note that you do not need to have all of the above installed in order to run the benchmarks.

How does it work?

When a function is to be jitted by Cycept, the first step is to infer the types of all variables. This is done by simply running the function once in pure Python and applying introspection. For this reason, it it best not to ask for too heavy a computation for the initial call. You might want to manually set the type of some variables, which is possible via Python type hints.

Special treatment is given to NumPy arrays. These are converted into Cython memoryviews to enable fast indexing, but converted back whenever NumPy array operations need to be performed.

With the types and array modifications in place, the modified source code is handed to Cython, which transpiles it into C/C++. The C compiler then compiles this into an extension module, from which we finally get the jitted function. By default, aggressive optimizations are applied at all stages.

The jitted version of the function is cached, so that the whole process does not have to be repeated for every call to the function. If however the function is called with new types, another jitted version of the function will be made, specialized to those types.

Optional arguments to @cycept.jit()

The transpilation carried out by Cycept can be tuned by various optional arguments to @cycept.jit(). These are all documented in the docstring, viewable through

python -c "import cycept; help(cycept.jit)"

Of particular interest is html, which allows for viewing of the transpiled Cython and C code by calling the __cycept__() method on the jitted function, e.g.

import cycept

@cycept.jit(html=True)
def twice(x):
    return 2*x

twice(42)           # int call
twice('hello')      # str call
twice.__cycept__()  # view all transpilations

Unit tests

The code contains a unit test suite which may be run as

python -c "import cycept; cycept.test()"

This requires pytest to be installed (python -m pip install pytest).

What's up with the name?

'Cycept' is an amalgamation of 'Cython' and 'CONCEPT', the latter of which is a cosmological simulation code that makes heavy use of code transformation, both custom and through Cython. As the author of both projects, Cycept is my attempt to extract some of the code transformation ideas buried within CONCEPT, making them available within an easy-to-use library. Though no code is shared between the projects, in many respects Cycept can be considered a spiritual successor to CONCEPT. Furthermore, 'Cycept' has a nice inception ring to it, which seems fitting for a piece of code that generates code.

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cycept-0.0.6.tar.gz (33.6 kB view details)

Uploaded Source

Built Distribution

cycept-0.0.6-py3-none-any.whl (32.9 kB view details)

Uploaded Python 3

File details

Details for the file cycept-0.0.6.tar.gz.

File metadata

  • Download URL: cycept-0.0.6.tar.gz
  • Upload date:
  • Size: 33.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.0

File hashes

Hashes for cycept-0.0.6.tar.gz
Algorithm Hash digest
SHA256 0ae5c75f3fa74965e4c55c2808401e83a09655b44b5c3b0259f440d95a38a107
MD5 4615f76bd1be395c5e6680a63887affe
BLAKE2b-256 e4eb07093dbdf66c1a8ebf4727f72d3644f3eac7d08bbd8e759a6ae6ea429098

See more details on using hashes here.

File details

Details for the file cycept-0.0.6-py3-none-any.whl.

File metadata

  • Download URL: cycept-0.0.6-py3-none-any.whl
  • Upload date:
  • Size: 32.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.0

File hashes

Hashes for cycept-0.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 a54ad882fbebf0b28d2513cfdea5f662175a4232abb46027f28c38870c09f9b7
MD5 8fbc14225168e65c28b0e4d9fc8192c9
BLAKE2b-256 e7d5f3bbe58bfed151ee8bedf85f30cf5d983ee9b153cc54a3d426a90df64469

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page