Skip to main content

Python cross-version byte-code deparser

Project description

buildstatus

uncompyle6

A native Python cross-version Decompiler and Fragment Decompiler. The successor to decompyle, uncompyle, and uncompyle2.

Introduction

uncompyle6 translates Python bytecode back into equivalent Python source code. It accepts bytecodes from Python version 1.5, and 2.1 to 3.7 or so, including PyPy bytecode and Dropbox’s Python 2.5 bytecode.

Why this?

There were a number of decompyle, uncompile, uncompyle2, uncompyle3 forks around. All of them came basically from the same code base, and almost all of them no were no longer actively maintained. Only one handled Python 3, and even there, only 3.2 or 3.3 depending on which code is used. This code pulls these together and moves forward. This project has the most complete support for Python 3.3 and above. It also addresses a number of open issues in the previous forks.

What makes this different from other CPython bytecode decompilers?: its ability to deparse just fragments and give source-code information around a given bytecode offset.

I use this to deparse fragments of code inside my trepan debuggers. For that, I need to record text fragments for all bytecode offsets (of interest). This purpose although largely compatible with the original intention is yet a little bit different. See this for more information.

The idea of Python fragment deparsing given an instruction offset can be used in showing stack traces or any program that wants to show a location in more detail than just a line number. It can be also used when source-code information does not exist and there is just bytecode information.

Requirements

This project requires Python 2.6 or later, PyPy 3-2.4, or PyPy-5.0.1. Python versions 2.4-2.7 are supported in the python-2.4 branch. The bytecode files it can read has been tested on Python bytecodes from versions 1.5, 2.1-2.7, and 3.0-3.6 and the above-mentioned PyPy versions.

Installation

This uses setup.py, so it follows the standard Python routine:

pip install -e .
pip install -r requirements-dev.txt
python setup.py install # may need sudo
# or if you have pyenv:
python setup.py develop

A GNU makefile is also provided so make install (possibly as root or sudo) will do the steps above.

Testing

make check

A GNU makefile has been added to smooth over setting running the right command, and running tests from fastest to slowest.

If you have remake installed, you can see the list of all tasks including tests via remake --tasks

Usage

Run

$ uncompyle6 *compiled-python-file-pyc-or-pyo*

For usage help:

$ uncompyle6 -h

If you want strong verification of the correctness of the decompilation process, add the –verify option. But there are situations where this will indicate a failure, although the generated program is semantically equivalent. Using option –weak-verify will tell you if there is something definitely wrong. Generally, large swaths of code are decompiled correctly, if not the entire program.

You can also cross compare the results with pycdc . Since they work differently, bugs here often aren’t in that, and vice versa.

Known Bugs/Restrictions

The biggest known and possibly fixable (but hard) problem has to do with handling control flow. All of the Python decompilers I have looked at have the same problem. In some cases we can detect an erroneous decompilation and report that.

Over 98% of the decompilation of Python standard library packages in Python 2.7.12 verifies correctly. Over 99% of Python 2.7 and 3.3-3.5 “weakly” verify. Python 2.6 drops down to 96% weakly verifying. Other versions drop off in quality too.

Verification is the process of decompiling bytecode, compiling with a Python for that bytecode version, and then comparing the bytecode produced by the decompiled/compiled program. Some allowance is made for inessential differences. But other semantically equivalent differences are not caught. For example 1 and 0 is decompiled to the equivalent 0; remnants of the first true evaluation (1) is lost when Python compiles this. When Python next compiles 0 the resulting code is simpler.

Weak Verification on the other hand doesn’t check bytecode for equivalence but does check to see if the resulting decompiled source is a valid Python program by running the Python interpreter. Because the Python language has changed so much, for best results you should use the same Python Version in checking as used in the bytecode.

Later distributions average about 200 files. There is some work to do on the lower end Python versions which is more difficult for us to handle since we don’t have a Python interpreter for versions 1.5, 1.6, and 2.0.

In the Python 3 series, Python support is is strongest around 3.4 or 3.3 and drops off as you move further away from those versions. Python 3.6 changes things drastically by using word codes rather than byte codes. That has been addressed, but then it also changes function call opcodes and its semantics and has more problems with control flow than 3.5 has.

Currently not all Python magic numbers are supported. Specifically in some versions of Python, notably Python 3.6, the magic number has changes several times within a version. We support only the released magic. There are also customized Python interpreters, notably Dropbox, which use their own magic and encrypt bytcode. With the exception of the Dropbox’s old Python 2.5 interpreter this kind of thing is not handled.

We also don’t handle PJOrion obfuscated code. For that try: PJOrion Deobfuscator to unscramble the bytecode to get valid bytecode before trying this tool.

Handling pathologically long lists of expressions or statements is slow.

There is lots to do, so please dig in and help.

See Also

Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

uncompyle6-2.13.0-py3.6.egg (331.5 kB view details)

Uploaded Egg

uncompyle6-2.13.0-py3.5.egg (338.1 kB view details)

Uploaded Egg

uncompyle6-2.13.0-py3.4.egg (339.6 kB view details)

Uploaded Egg

uncompyle6-2.13.0-py3.3.egg (343.9 kB view details)

Uploaded Egg

uncompyle6-2.13.0-py2.7.egg (333.3 kB view details)

Uploaded Egg

uncompyle6-2.13.0-py2.6.egg (334.0 kB view details)

Uploaded Egg

uncompyle6-2.13.0-py2.5.egg (330.4 kB view details)

Uploaded Egg

uncompyle6-2.13.0-py2.4.egg (325.9 kB view details)

Uploaded Egg

File details

Details for the file uncompyle6-2.13.0-py3.6.egg.

File metadata

File hashes

Hashes for uncompyle6-2.13.0-py3.6.egg
Algorithm Hash digest
SHA256 2e43e01477b63458c634825af184be342c0090aa1fe360c3a681a1067fd1f8bc
MD5 2e85d85e38c80bd775b8c6673ec48b74
BLAKE2b-256 753f6597227b437786d0a963d4d2b8798d8099c728e5176e525b92808d3808a4

See more details on using hashes here.

File details

Details for the file uncompyle6-2.13.0-py3.5.egg.

File metadata

File hashes

Hashes for uncompyle6-2.13.0-py3.5.egg
Algorithm Hash digest
SHA256 6ee080bc78d5cfbc5a7fbf6cf95608757d81294edaa29275736dc2755dc658cb
MD5 ec63c4723381a335c088a1efb7eb37a0
BLAKE2b-256 d3a97b54a19703948bfd27fcb846aa7ca97a4877ffb6e23d01c936922066a4c8

See more details on using hashes here.

File details

Details for the file uncompyle6-2.13.0-py3.4.egg.

File metadata

File hashes

Hashes for uncompyle6-2.13.0-py3.4.egg
Algorithm Hash digest
SHA256 9f7f504aa9a51ec3204cc6fb3862b95a4ce758a824e353e02fea90868c9cdcf3
MD5 6d1ea922903ca3f4e27aa3fac7d1c5d0
BLAKE2b-256 72aa1542258f47300c7d67beb52a4898365093a531c1135a670ea1c914b7be13

See more details on using hashes here.

File details

Details for the file uncompyle6-2.13.0-py3.3.egg.

File metadata

File hashes

Hashes for uncompyle6-2.13.0-py3.3.egg
Algorithm Hash digest
SHA256 7eb2f12e5af0c57bf8a06a9d750d387003d456db015c1896b9b2be9b587a18d2
MD5 c9f871e8956c10090745ecb2e995de99
BLAKE2b-256 2a2435ae591a4488a0c5fa1be03bc99f1eed06a15c6c2c9ffc220768679e2c57

See more details on using hashes here.

File details

Details for the file uncompyle6-2.13.0-py2.7.egg.

File metadata

File hashes

Hashes for uncompyle6-2.13.0-py2.7.egg
Algorithm Hash digest
SHA256 f817b38930669cd139bffcdad5552d829e7a419d0c67aeb8c7006c95a7ced72b
MD5 54cc8b1266da018551c1866d62aa901a
BLAKE2b-256 a8a36f3d5e2a2d5509cadd24ba07c783d715f28ba84abfcb7b89018328017c18

See more details on using hashes here.

File details

Details for the file uncompyle6-2.13.0-py2.6.egg.

File metadata

File hashes

Hashes for uncompyle6-2.13.0-py2.6.egg
Algorithm Hash digest
SHA256 73de933510fb1883dd3dab6cc01114be85465fce21462f5d83b5410ced419c25
MD5 8e1ce2d2bd01da39546e29e4bcc0c801
BLAKE2b-256 a7285675bb5e282222fcf74d77a5d00ad72b007f4f300c388d9534c29191774f

See more details on using hashes here.

File details

Details for the file uncompyle6-2.13.0-py2.5.egg.

File metadata

File hashes

Hashes for uncompyle6-2.13.0-py2.5.egg
Algorithm Hash digest
SHA256 31abb88ab78ed01fa2e3c652b1d32d949263a51c1aea99f428bcbf24257f8e71
MD5 601ff7b2989fe02789c97639ef2c59a7
BLAKE2b-256 1451c6ba7340778b4ce25398b31e294904ce866dfb2ae090b1e311dd0bf771f1

See more details on using hashes here.

File details

Details for the file uncompyle6-2.13.0-py2.4.egg.

File metadata

File hashes

Hashes for uncompyle6-2.13.0-py2.4.egg
Algorithm Hash digest
SHA256 8871ef391832e323f3013e5b104a8ced371b9fcbd7826e03a97d65ed7ae10ff3
MD5 734dafacc390943ce3e376eff95b3731
BLAKE2b-256 da049e4ee97420f7aa73194c0e050e0682954c0b9395d36eac38cd688f40f874

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page