Skip to main content

Python cross-version byte-code deparser

Project description

buildstatus Supported Python Versions

uncompyle6

A native Python cross-version Decompiler and Fragment Decompiler. Follows in the tradition of decompyle, uncompyle, and uncompyle2.

Introduction

uncompyle6 translates Python bytecode back into equivalent Python source code. It accepts bytecodes from Python version 1.5, and 2.1 to 3.6 or so, including PyPy bytecode and Dropbox’s Python 2.5 bytecode.

Why this?

There were a number of decompyle, uncompile, uncompyle2, uncompyle3 forks around. All of them came basically from the same code base, and almost all of them no were no longer actively maintained. Only one handled Python 3, and even there, only 3.2 or 3.3 depending on which code is used. This code pulls these together and moves forward. This project has the most complete support for Python 3.3 and above. It also addresses a number of open issues in the previous forks.

What makes this different from other CPython bytecode decompilers?: its ability to deparse just fragments and give source-code information around a given bytecode offset.

I use this to deparse fragments of code inside my trepan debuggers. For that, I need to record text fragments for all bytecode offsets (of interest). This purpose although largely compatible with the original intention is yet a little bit different. See this for more information.

The idea of Python fragment deparsing given an instruction offset can be used in showing stack traces or any program that wants to show a location in more detail than just a line number. It can be also used when source-code information does not exist and there is just bytecode information.

Requirements

This project requires Python 2.6 or later, PyPy 3-2.4, or PyPy-5.0.1. Python versions 2.4-2.7 are supported in the python-2.4 branch. The bytecode files it can read has been tested on Python bytecodes from versions 1.5, 2.1-2.7, and 3.0-3.6 and the above-mentioned PyPy versions.

Installation

This uses setup.py, so it follows the standard Python routine:

pip install -r requirements.txt
pip install -r requirements-dev.txt
python setup.py install # may need sudo
# or if you have pyenv:
python setup.py develop

A GNU makefile is also provided so make install (possibly as root or sudo) will do the steps above.

Testing

make check

A GNU makefile has been added to smooth over setting running the right command, and running tests from fastest to slowest.

If you have remake installed, you can see the list of all tasks including tests via remake --tasks

Usage

Run

$ uncompyle6 *compiled-python-file-pyc-or-pyo*

For usage help:

$ uncompyle6 -h

If you want strong verification of the correctness of the decompilation process, add the –verify option. But there are situations where this will indicate a failure, although the generated program is semantically equivalent. Using option –weak-verify will tell you if there is something definitely wrong. Generally, large swaths of code are decompiled correctly, if not the entire program.

You can also cross compare the results with pycdc . Since they work differently, bugs here often aren’t in that, and vice versa.

Known Bugs/Restrictions

The biggest known and possibly fixable (but hard) problem has to do with handling control flow. All of the Python decompilers I have looked at have the same problem. In some cases we can detect an erroneous decompilation and report that.

Over 98% of the decompilation of Python standard library packages in Python 2.7.12 verifies correctly. Over 99% of Python 2.7 and 3.3-3.5 “weakly” verify. Python 2.6 drops down to 96% weakly verifying. Other versions drop off in quality too.

Verification is the process of decompiling bytecode, compiling with a Python for that bytecode version, and then comparing the bytecode produced by the decompiled/compiled program. Some allowance is made for inessential differences. But other semantically equivalent differences are not caught. For example 1 and 0 is decompiled to the equivalent 0; remnants of the first true evaluation (1) is lost when Python compiles this. When Python next compiles 0 the resulting code is simpler.

Weak Verification on the other hand doesn’t check bytecode for equivalence but does check to see if the resulting decompiled source is a valid Python program by running the Python interpreter. Because the Python language has changed so much, for best results you should use the same Python Version in checking as used in the bytecode.

Later distributions average about 200 files. There is some work to do on the lower end Python versions which is more difficult for us to handle since we don’t have a Python interpreter for versions 1.5, 1.6, and 2.0.

In the Python 3 series, Python support is is strongest around 3.4 or 3.3 and drops off as you move further away from those versions. Python 3.6 changes things drastically by using word codes rather than byte codes. That has been addressed, but then it also changes function call opcodes and its semantics and has more problems with control flow than 3.5 has.

Currently not all Python magic numbers are supported. Specifically in some versions of Python, notably Python 3.6, the magic number has changes several times within a version. We support only the released magic. There are also customized Python interpreters, notably Dropbox, which use their own magic and encrypt bytcode. With the exception of the Dropbox’s old Python 2.5 interpreter this kind of thing is not handled.

We also don’t handle PJOrion obfuscated code. For that try: PJOrion Deobfuscator to unscramble the bytecode to get valid bytecode before trying this tool.

Handling pathologically long lists of expressions or statements is slow.

There is lots to do, so please dig in and help.

See Also

Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

uncompyle6-2.10.0.tar.gz (831.8 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

uncompyle6-2.10.0-py3.4.egg (345.6 kB view details)

Uploaded Egg

uncompyle6-2.10.0-py3.3.egg (349.9 kB view details)

Uploaded Egg

uncompyle6-2.10.0-py2.py3-none-any.whl (159.2 kB view details)

Uploaded Python 2Python 3

uncompyle6-2.10.0-py2.5.egg (330.3 kB view details)

Uploaded Egg

uncompyle6-2.10.0-py2.4.egg (333.9 kB view details)

Uploaded Egg

File details

Details for the file uncompyle6-2.10.0.tar.gz.

File metadata

  • Download URL: uncompyle6-2.10.0.tar.gz
  • Upload date:
  • Size: 831.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for uncompyle6-2.10.0.tar.gz
Algorithm Hash digest
SHA256 10a83a51b26905a15584b79de930d11ca6a69bf003d2c440b2d3b4d29fffecdf
MD5 fb950bffe361d8e6c61d47aab137ab6c
BLAKE2b-256 fdd6b7e940ccd99671022fe12559648c17350da045c2fed8c7f00f3d0e3dfc1a

See more details on using hashes here.

File details

Details for the file uncompyle6-2.10.0-py3.4.egg.

File metadata

File hashes

Hashes for uncompyle6-2.10.0-py3.4.egg
Algorithm Hash digest
SHA256 2556d4734ba2902a266b596d983ce975b33b332fc0fa2c5e44e88d2e90551897
MD5 ddc653640c0074fc44901363fc8a7248
BLAKE2b-256 2a4a8eaeff0c6ac1cf039d1bceb411dbc4031e924222e1c158ab72b958c1f5da

See more details on using hashes here.

File details

Details for the file uncompyle6-2.10.0-py3.3.egg.

File metadata

File hashes

Hashes for uncompyle6-2.10.0-py3.3.egg
Algorithm Hash digest
SHA256 35a999fe1d97a52aeaae6785d4019e9d10cb97ea099bdd491f2366b7ad6c53a6
MD5 ef26d607690cb1c47cce93ec8a29c381
BLAKE2b-256 28c63c4c62448d14d893c85db1f4ac1d8223d363c40fd58ea8b8c8c8f2308be1

See more details on using hashes here.

File details

Details for the file uncompyle6-2.10.0-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for uncompyle6-2.10.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 c3aac2c2425ef91282e8fe677950771c984c5e17388b0038f75dd3e53bdc1346
MD5 b90708f51c7be314a74c4aa744d51303
BLAKE2b-256 8cc25ae6e2bfeb7191fe0bb89f27bb35a6d98f6dcadc39eccd37ea02df701d4d

See more details on using hashes here.

File details

Details for the file uncompyle6-2.10.0-py2.5.egg.

File metadata

File hashes

Hashes for uncompyle6-2.10.0-py2.5.egg
Algorithm Hash digest
SHA256 0a43dce4fb6a942e8361b046813438974317448ded4b026768026ffeea7986b1
MD5 10f402731c879cb84ea98108af90ab79
BLAKE2b-256 044004aea0083923d631b6ad88a42bfbaec041a344185b6b27567bc4039023f7

See more details on using hashes here.

File details

Details for the file uncompyle6-2.10.0-py2.4.egg.

File metadata

File hashes

Hashes for uncompyle6-2.10.0-py2.4.egg
Algorithm Hash digest
SHA256 7da9f069d8e647ab906acf6913e083cc17c2f151312dbd7718390098569cf7a8
MD5 3ce0bf3eb22ae260d63061553a0abd52
BLAKE2b-256 9c4d1d8cbc269249ee66711628d565b46a1d6f5e5e750313461b46291faa8c01

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page