Skip to main content

Python bindings for LLVM auto-generated from the LLVM-C API

Project description

llvmcpy

llvmcpy automatically generates Python wrappers for the LLVM-C API.

Goal

The main goal of llvmcpy is to provide Python bindings for the LLVM project that are fast and require the lowest possible maintainance effort. To achive this, we use CFFI to parse the (slightly adapted) LLVM-C API header files and automatically generate a set of classes and functions to interact with them in a Pythonic way.

This project is in an early stage, but allows you to run the following code:

import sys
from llvmcpy import LLVMCPy

llvm = LLVMCPy()
buffer = llvm.create_memory_buffer_with_contents_of_file(sys.argv[1])
context = llvm.get_global_context()
module = context.parse_ir(buffer)
for function in module.iter_functions():
    for bb in function.iter_basic_blocks():
        for instruction in bb.iter_instructions():
            instruction.dump()

It's tested on all LLVM versions from 5 to 19 and on Python from 3.7 to 3.13. Supporting newer versions of the LLVM-C API should be basically effortless.

To try it out, install LLVM and install llvmcpy:

sudo apt-get install llvm
python -m venv venv
source venv/bin/activate
pip install llvmcpy

Naming of the generated classes/functions

The basic idea behind this project is to take the LLVM-C API, create a class for each data type and create a method for that class for each function in the API taking an argument of that data type as first argument.

This means that the following functions:

LLVMModuleRef LLVMCloneModule (LLVMModuleRef M)

Will become:

class Module(object):
    def clone(self):
        # ...

Note the change in the case. Use help(Module.clone) to see which LLVM-C API function a certain method is using.

Each class in llvmcpy is basically a wrapper around a pointer to an LLVM object.

If an API function doesn't take an LLVM object as a first argument, it will be part of the llvm module.

Additionally, we have some generated properties and generators for certain well known patterns in the API.

Properties

For each function starting with LLVMGet or LLVMSet in the LLVM-C API, we generate a property. For example, consider the following two functions:

void LLVMSetValueName (LLVMValueRef Val, const char *Name);
const char* LLVMGetValueName(LLVMValueRef Val);

In llvmcpy the Get/Set prefixes disappear, along with Value (the name of the class) and you can use them as properties of the Value class, e.g.:

my_value.name = "sum"
print(my_value.name)

Generators

The LLVM-C API has a recurrent pattern which allows you to navigate through the hierarchy of its class hierarchy, i.e. the pair of LLVMGetSomething and LLVMGetNextSomething functions. Something can be Function, BasicBlock and so on. llvmcpy identifies these patterns and produces a generator method which allows you to iterate over these objects in a Pythonic way:

for function in module.iter_functions():
    for bb in function.iter_basic_blocks():
        for instruction in bb.iter_instructions():
            # ...

Where are my bindings?

Bindings are automatically generated in a lazy way. Multiple installations of LLVM are supported, just set the LLVM_CONFIG environment variable to the llvm-config program in the bin/ directory of your LLVM installation and everything should work fine.

The bindings are generated in a Python script which is stored in $XDG_CACHE_DIR/llvmcpy/ (typically ~/.cache/llvmcpy) in a directory whose name is obtained by hashing the full path of the output of llvm-config --prefix concatenated with the LLVM version number. For example, for LLVM 19 installed in /usr you'll find the API bindings in ~/.cache/llvmcpy/7fea08f2e9d5108688f692e686c8528b914eda563e7069b25ef18c49ba96d7f2-19.

To generate the bindings, a working C preprocessor must be available in the system. By default, cpp (the C preprocessor part of GCC) is used. If it's not available we check if clang is available in the LLVM installation and use it.

License and credits

This project is developed and maintained by rev.ng Labs as part of the rev.ng project, and it's released under the MIT license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llvmcpy-0.2.0.tar.gz (16.3 kB view details)

Uploaded Source

Built Distribution

llvmcpy-0.2.0-py3-none-any.whl (15.6 kB view details)

Uploaded Python 3

File details

Details for the file llvmcpy-0.2.0.tar.gz.

File metadata

  • Download URL: llvmcpy-0.2.0.tar.gz
  • Upload date:
  • Size: 16.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.5

File hashes

Hashes for llvmcpy-0.2.0.tar.gz
Algorithm Hash digest
SHA256 46d066518ea7fc879ae936b94a7dec2bdd4893efc5bc0d4d10688c99f696a3cb
MD5 2732af909f2a6101017a13dc0a7d3dc0
BLAKE2b-256 870eb9916a03db73008659282bd74927915c7d65ff4afea62468135a41bf6c94

See more details on using hashes here.

File details

Details for the file llvmcpy-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: llvmcpy-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 15.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.5

File hashes

Hashes for llvmcpy-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 263c8879db81cceed5d15ec909184273664b80eff247b7c6c5b007096264e6b3
MD5 272878853f3134ba8ae3f7da4ad60805
BLAKE2b-256 14bb519b3f62b7e7112391d5a4aea2d18f7876d87cd9489f858c5d6cdc97b05e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page