Skip to main content

High level python bytecode manipulation

Project description

PyBytecodeManipulator

A high level cross-version python bytecode manipulation library build ontop of 'dis' and 'inspect'

Supports code inlining, branch removing and arbitrary code injection into existing functions.

WARNING: using bytecode manipulation on a so low level as we do can break the python runtime at any point without a warning. We circumvent a lot of safety checks normally done.

WARNING: We cannot make sure that everything works as it should, expect broken code at runtime!

Supported python versions:

  • 3.8 and below: unsupported; will produce syntax errors due to using new features
  • 3.9 (WIP)
  • 3.10 (main development)
  • 3.11[a4] (forward porting; WIP; Currently not working fully)

Other versions will not work as a lot of config is stored in .json files per-version, so you may need to provide your own .json config files for the version you need.

(Some versions might be plug-and-play, but most will require code changes additionally)

Why are there so many print()-s?

Due to the breaking nature of anything this code touches, and the absents of any traces in the function itself, we decided to add a lot of "debug" statements indicating mostly the who-has-done-what-to-which-method for the runtime. This makes debugging broken code easier, as it is more clear what happened to each transformed function.

If you want them removed, create your own Fork of this and remove them, on your own risk.

We may use in the future the logging library, so you can disable our logger instance, but we are currently in an inter-stage of the code, so other stuff has priority.

Compatibility with other libraries

  • Nuitka (https://github.com/Nuitka/Nuitka): Incompatible; will break as nuitka removes the __code__ attribute we modify
  • Other bytecode modification / analysers: Should work as long as you as the user does NOT expose intermediate results which might contain internal instructions

Debugging your injections

There is the possibility to "debug" functions using the execution emulator. It will be able to give you more information about a crash than the python core interpreter, but will be a lot slower than it.

It comes also with the possibility to run your bytecode in another interpreter version, so you can experiment with some stuff. In theory, it is also possible to run in python versions not supported by the bytecode manipulation system, but it is not recommended.

TransformationHandler() takes as an arg debug_code and debug_further_calls for activating it for all accessed methods.

BytecodePatchHelper() contains a method enable_verbose_exceptions() for activating it on that exact method.

Examples

Replacing a code constant (in the whole function body):

from bytecodemanipulation.Transformers import TransformationHandler

handler = TransformationHandler()


def test():
  return 0


handler.makeFunctionArrival("test", test)

# Replaces the constant '0' from the return with a '1'
handler.replace_method_constant("test", 0, 1)

handler.applyTransforms()

assert test() == 1

We can also select single constants, like the following:

from bytecodemanipulation.Transformers import TransformationHandler
from bytecodemanipulation.InstructionMatchers import CounterMatcher

handler = TransformationHandler()


def test():
  print(0)
  return 0


handler.makeFunctionArrival("test", test)

# Replaces the constant '0' from the return with a '1'
handler.replace_method_constant("test", 0, 1, matcher=CounterMatcher(1, 1))

handler.applyTransforms()

# This will return 1, but print 0
assert test() == 1

Code Formatting

We use the python formatting library "black" on our code

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bytecodemanipulation-0.1.9.tar.gz (43.6 kB view details)

Uploaded Source

Built Distribution

bytecodemanipulation-0.1.9-py3-none-any.whl (46.3 kB view details)

Uploaded Python 3

File details

Details for the file bytecodemanipulation-0.1.9.tar.gz.

File metadata

  • Download URL: bytecodemanipulation-0.1.9.tar.gz
  • Upload date:
  • Size: 43.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.0

File hashes

Hashes for bytecodemanipulation-0.1.9.tar.gz
Algorithm Hash digest
SHA256 3a0f4e82be1af7fce1d879cadc4aadf7c2de77d8c71b0989e4baf73f19eeb853
MD5 733344103f7d0a252c9d345806c808e8
BLAKE2b-256 de8ff64284eed87f9d181ba2d38e02b8afc4ba1bd7f968f3f9c8d18f2c6aa759

See more details on using hashes here.

File details

Details for the file bytecodemanipulation-0.1.9-py3-none-any.whl.

File metadata

File hashes

Hashes for bytecodemanipulation-0.1.9-py3-none-any.whl
Algorithm Hash digest
SHA256 5a35a679f640b927b34b028ca8e434f28c9203176ef8b78d0f0e20cc888e30f0
MD5 0efa75c502b2d5aeb1d593f1bd71f584
BLAKE2b-256 606edd943672b7d948929cac2fe7a96d0b93b768d3f99b7a59e5c6cfcb136959

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page