Skip to main content

GDB-like Python Debugger in the Trepan family

Project description

CircleCI Pypi Installs License Documentation Status Supported Python Versions

packagestatus

Abstract

This is a gdb-like debugger for Python. It is a rewrite of pdb from the ground up. I was disappointed with the flakiness, imprecision, and poor quality of coding, modularity, and level of documentation when I first looked at pdb. (pdb has gotten better since then. But a full and complete debugger, is way more complex than what you’d expect from a Standard Python module; it requires a larger set of supporting packages too than is found in the Standard Python library).

trepan3k is both a high-level debugger as well as a lower-level bytecode debugger inspector. The code understands a lot about byte code and the Python code object. The debugger makes use of this knowledge to get more precise and accurate results and provide more reliable operations.

A command-line interface (CLI) is provided as well as remote access interface over TCP/IP.

See the entry-exit for the various ways you can enter the debugger.

This code supports versions of Python back to version 3.0 using different git branches. See trepan2 for the same code modified to work with Python 2.

Features

Since this debugger is similar to other trepanning debuggers and gdb in general, knowledge gained by learning this is transferable to those debuggers and vice versa.

There’s a lot of cool stuff here that’s not in the stock Python debugger pdb, or any other Python debugger that I know about.

More Exact location information

Python reports line information on the granularity of a line. For Python versions up to 3.8, To get more precise information, we can (de)parse into Python the byte code around a bytecode offset such as the place you are stopped at.

So far as I know, there is no other debugger that decompiles code at runtime to narrow position down to the specific bytecode instruction.

See the deparse command for details on getting this kind of information.

The problem with deparsing after 3.8 is that there is no decompiler that can deparse code and give associations to bytecode instructions. I am slowly working on that though.

We use information in Python’s code object line number table in byte to understand which lines are breakpointable, and in which module or function the line appears in. Use info-line to see this information. Most if not all other debuggers do go to such lengths, and as a result, it is possible to request stopping on a line number that can never occur without complaint.

In the future, we may allow specifying an offset to indicate which offset to stop at when there are several choices for a given line number.

Debugging Python bytecode (no source available)

You can pass the debugger the name of Python bytecode and many times, the debugger will merrily proceed. This debugger tries very hard to find the source code. Either by using the current executable search path (e.g. PATH) or for some by looking inside the bytecode for a filename in the main code object (co_filename) and applying that with a search path that takes into account the directory where the bytecode lives.

Failing to find source code this way, and in other situations where source code can’t be found, the debugger will decompile the bytecode and use that for showing the source text. This allows us to debug ``eval``’d or ``exec``’d code.

But if you happen to know where the source code is located, you can associate a file source code with the current name listed in the bytecode. See the set-substitute command for details here.

Source-code Syntax Colorization

Terminal source code is colorized via pygments. And with that, you can set the pygments color style, e.g. “colorful”, “paraiso-dark”. See set-style . Furthermore, we make use of terminal bold and emphasized text in debugger output and help text. Of course, you can also turn this off. You can use your own pygments_style, provided you have a terminal that supports 256 colors. If your terminal supports the basic ANSI color sequences only, we support that too in both dark and light themes.

Command Completion

Command completion is available for GNU readline and prompt_toolkit. While prompt_toolkit is new, command completion for GNU Readline is not just a simple static list but varies depending on the context. For example, for frame-changing commands that take optional numbers, the list of valid numbers is considered.

In time (and perhaps with some volunteers), prompt_toolkit completion will be as good as GNU Readline completion.

Terminal Handling

We can adjust debugger output depending on the line width of your terminal. If it changes, or you want to adjust it, see set-width.

Signal Handling

Following gdb, we provide its rich set of signal handling. From the gdb documentation:

GDB has the ability to detect any occurrence of a signal in your program. You can tell GDB in advance what to do for each kind of signal.

Better Support for Thread Debugging

When you are stopped inside a thread, the thread name is shown to make this fact more clear and you can see and switch between frames in different threads. See frame for more information.

And following gdb, you can list the threads too. See info-threads for more information.

Smart Eval

If you want to evaluate the current source line before it is run in the code, use eval. To evaluate the text of a common fragment of a line, such as the expression part of an if statement, you can do that with eval?. See eval for more information.

Function Breakpoints

Many Python debuggers only allow setting a breakpoint at a line event and functions are treated like line numbers. But functions and lines are fundamentally different. If I write:

def five(): return 5

this line contains three different kinds of things. First, there is the code in Python that defines the function five() for the first time. Then there is the function itself, and then there is some code inside that function.

In this debugger, you can give the name of a function by surrounding adding () at the end:

break five()

Also five could be a method of an object that is currently defined when the breakpoint command is given:

self.five()

More Stepping Control

Sometimes you want small steps, and sometimes large steps.

This fundamental issue is handled in a couple of ways:

Step Granularity

There are now step event and next event commands with aliases to s+, s>, and so on. The plus-suffixed commands force a different line on a subsequent stop, the dash-suffixed commands don’t. Suffixes >, <, and ! specify call, return and exception events respectively. And without a suffix, you get the default; this is set by the set different command.

Event Filtering and Tracing

By default, the debugger stops at every event: call, return, line, exception, c-call, c-exception. If you just want to stop at line events (which is largely what happens in pdb) you can. If however you just want to stop at calls and returns, that’s possible too. Or pick some combination.

In conjunction with handling all events by default, the event status is shown when stopped. The reason for stopping is also available via info program.

Event Tracing of Calls and Returns

I’m not sure why this was not done before. Probably because of the lack of the ability to set and move by different granularities, tracing calls and returns leads to too many uninteresting stops (such as at the same place you just were at). Also, stopping on function definitions probably also added to this tedium.

Because we’re really handling return events, we can stop on the return. This is a little more precise than pdb’s retval command.

Debugger Macros via Python Lambda expressions

There are debugger macros. In gdb, there is a macro debugger command to extend debugger commands.

However, Python has its own rich programming language so it seems silly to recreate the macro language that is in gdb. Simpler and more powerful is just to use Python here. A debugger macro here is just a lambda expression that returns a string or a list of strings. Each string returned should be a debugger command.

We also have aliases for the extremely simple situation where you want to give an alias to an existing debugger command. But beware: Some commands, like step inspect command suffixes and change their behavior accordingly.

We also provide extending the debugger either through additional Python packages.

Byte-code Instruction Introspection

We do more in the way of looking at the byte codes to give better information. Through this, we can provide:

  • a skip command. It is like the jump command, but you don’t have to deal with line numbers.

  • disassembly of code fragments. You can now disassemble relative to the stack frames you are currently stopped at.

  • Better interpretation of where you are when inside execfile or exec. (But really though this is probably a Python compiler misfeature.)

  • Check that breakpoints are set only where they make sense.

  • A more accurate determination of if you are at a function-defining def or class statements (because the caller’s instruction contains MAKE_FUNCTION or BUILD_CLASS.)

Even without “deparsing” mentioned above, the ability to disassemble where the PC is currently located (see info-pc), by line number range or byte-offset range lets you tell exactly where you are and code is getting run.

Some Debugger Command Arguments can be Variables and Expressions

Commands that take integer arguments like up, list, or disassemble allow you to use a Python expression which may include local or global variables that evaluate to an integer. This eliminates the need in gdb for special “dollar” debugger variables. (Note however because of shlex parsing, expressions can’t have embedded blanks.)

Out-of-Process Debugging

You can now debug your program in a different process or even a different computer on a different network!

Related, is flexible support for remapping path names from the file system, e.g. the filesystem seen inside a docker container or on a remote filesystem with locally-installed files. See subst for more information.

Egg, Wheel, and Tarballs

Can be installed via the usual pip or easy_install. There is a source tarball. How To Install has full instructions and installation using git or by other means.

Modularity

Because this debugger is modular, I have been able to use it as the basis for debuggers in other projects. In particular, it is used as a module in trepanxpy, a debugger for Python interpreter, x-python, written in Python.

It is also used as a module inside an experimental open-source Wolfram Mathematica interpreter, Mathics3.

Using pytracer, the Debugger plays nice with other trace hooks. You can have several debugger objects.

Many of the things listed below do not directly impact end-users, but it does eventually by way of more robust and featureful code. And keeping developers happy is a good thing.(TM)

  • Commands and subcommands are individual classes now, not methods in a class. This means they now have properties like the context in which they can be run, minimum abbreviation names, or alias names. To add a new command you basically add a file in a directory.

  • I/O is its own layer. This simplifies interactive readline behavior from reading commands over a TCP socket.

  • An interface is its own layer. Local debugging, remote debugging, and running debugger commands from a file (source) are different interfaces. This means, for example, that we are able to give better error reporting if a debugger command file has an error.

  • There is an experimental Python-friendly interface for front-ends

  • more testable. Much more unit and functional tests.

Documentation

Documentation: http://python3-trepan.readthedocs.org

See Also

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

trepan3k-1.3.0.tar.gz (421.1 kB view details)

Uploaded Source

Built Distributions

trepan3k-1.3.0-py3-none-any.whl (579.9 kB view details)

Uploaded Python 3

trepan3k-1.3.0-310-none-any.whl (379.4 kB view details)

Uploaded

trepan3k-1.3.0-39-none-any.whl (379.4 kB view details)

Uploaded

trepan3k-1.3.0-38-none-any.whl (379.4 kB view details)

Uploaded

trepan3k-1.3.0-37-none-any.whl (379.4 kB view details)

Uploaded

trepan3k-1.3.0-36-none-any.whl (379.4 kB view details)

Uploaded

trepan3k-1.3.0-35-none-any.whl (409.3 kB view details)

Uploaded

File details

Details for the file trepan3k-1.3.0.tar.gz.

File metadata

  • Download URL: trepan3k-1.3.0.tar.gz
  • Upload date:
  • Size: 421.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for trepan3k-1.3.0.tar.gz
Algorithm Hash digest
SHA256 e4e7ea48ac7a56b163e7f8e8378d1f1c995ed50d63b81235a57ec77fd31fea7b
MD5 60d6fd596d6051c0248b27219d217ffe
BLAKE2b-256 a49ee7018176cec5b3994c180b30665774c5fec2d560d30b8633de3f54a5a69b

See more details on using hashes here.

File details

Details for the file trepan3k-1.3.0-py3-none-any.whl.

File metadata

  • Download URL: trepan3k-1.3.0-py3-none-any.whl
  • Upload date:
  • Size: 579.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for trepan3k-1.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c03678ea699af5954a71ffb8d978244684f8ada1219e2bf2e42365fcc9c56fd9
MD5 c2e0785402231f8fc939de1f078020cb
BLAKE2b-256 73601651c4226a51da289490ae6177605972a9b62575ae8a4c59635e796049cd

See more details on using hashes here.

File details

Details for the file trepan3k-1.3.0-310-none-any.whl.

File metadata

  • Download URL: trepan3k-1.3.0-310-none-any.whl
  • Upload date:
  • Size: 379.4 kB
  • Tags:
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for trepan3k-1.3.0-310-none-any.whl
Algorithm Hash digest
SHA256 01849d678284b6c1ae7e1fae34991ebb0e96999490a5577314058d29823eb9f7
MD5 653ef15c9647f6ea58af86372207621b
BLAKE2b-256 c9bdd3418488aeb8e36a29f51d237abd2d1e689a1e47b6492181d76625f905f7

See more details on using hashes here.

File details

Details for the file trepan3k-1.3.0-39-none-any.whl.

File metadata

  • Download URL: trepan3k-1.3.0-39-none-any.whl
  • Upload date:
  • Size: 379.4 kB
  • Tags:
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for trepan3k-1.3.0-39-none-any.whl
Algorithm Hash digest
SHA256 39a4705faeffb4fb692a2e191507ce2a52f855a1a63cdc37cad3166f50a9b5f7
MD5 97f29955bef16ac20232eadf633e032e
BLAKE2b-256 80062732f5c7212040aee146a7c32c6ca50f500f2c33da278afe9937d9854f9c

See more details on using hashes here.

File details

Details for the file trepan3k-1.3.0-38-none-any.whl.

File metadata

  • Download URL: trepan3k-1.3.0-38-none-any.whl
  • Upload date:
  • Size: 379.4 kB
  • Tags:
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for trepan3k-1.3.0-38-none-any.whl
Algorithm Hash digest
SHA256 11327d37d0008f50066210a2d2ead0b81e70969285f27144749356f4bf5cca0f
MD5 bdef4c55b41af373658c1a731c3995a8
BLAKE2b-256 4511a9bd38e5b3c076747ff9ed831fee0402c4ba023565d4440d5a31c39f63ea

See more details on using hashes here.

File details

Details for the file trepan3k-1.3.0-37-none-any.whl.

File metadata

  • Download URL: trepan3k-1.3.0-37-none-any.whl
  • Upload date:
  • Size: 379.4 kB
  • Tags:
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for trepan3k-1.3.0-37-none-any.whl
Algorithm Hash digest
SHA256 be6893ab890b479861b2307210857d28fb12f22b5826838e8f7bae3e48a11e75
MD5 0a331f5700b09e8ee4dcf76afbda4071
BLAKE2b-256 7af7b8d14cb6c24817eb6788b362e64e950a980d78fe2e5cbb3cdc299dcfeef0

See more details on using hashes here.

File details

Details for the file trepan3k-1.3.0-36-none-any.whl.

File metadata

  • Download URL: trepan3k-1.3.0-36-none-any.whl
  • Upload date:
  • Size: 379.4 kB
  • Tags:
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for trepan3k-1.3.0-36-none-any.whl
Algorithm Hash digest
SHA256 68bd68cec6552b6e097f174b0b27cd9b66360f2643a64856741f26519708783e
MD5 2a5762aafa6f241759744d40d9c50cee
BLAKE2b-256 ef38f13f22647d5021b06ad38e0fa612cd3a5d2e387dfcad4e80ba0018c0794d

See more details on using hashes here.

File details

Details for the file trepan3k-1.3.0-35-none-any.whl.

File metadata

  • Download URL: trepan3k-1.3.0-35-none-any.whl
  • Upload date:
  • Size: 409.3 kB
  • Tags:
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for trepan3k-1.3.0-35-none-any.whl
Algorithm Hash digest
SHA256 170b86ec7f8ea7da2f91a6fe0dc99e27b671ad56b8e4efd52cbc564de889ccde
MD5 3504fde8385f5f707660d5427bf2b8c8
BLAKE2b-256 7aa07f30fd112c6ff10cd02d4ec995586ccb674e910723813469f18c01363bf0

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page