Skip to main content

Disassemble ARM and Thumb files like RISC OS

Project description

RISC OS disassembly in Python

Introduction

This repository contains the module used to disassemble ARM and Thumb code for RISC OS Pyromaniac. It uses the Capstone library to do so, but modifies the output so that the disassembly is in a form which is generally seen by RISC OS users.

Installation

The tool for disassembly can be installed manually using this repository (see 'Usage' below) or through PyPI. To install, use:

pip3 install riscos_disassemble

Usage

Once installed, the tool can be invoked as riscos-dumpi. For example:

riscos-dumpi examples/hello_world,ffc

The following switches are supported:

  • --thumb: Disassembles Thumb code, rather than ARM code
  • --colour: Uses the original primary and secondary ANSI colours.
  • --colour-8bit: Use the more featureful 8bit colour palette.
  • --debuggerplus: Specify a list of modifications to the debugger output, which originated in DebuggerPlus.
  • --help-debuggerplus: More information on the flags.
  • --function-map: List the addresses for functions with signatures
  • --match: Only disassemble functions with the given siganture, which may use wildcards.

The filename can be followed by a hexadecimal address to use as the base address for the file's disassembly. By default the file will be based at address 0 unless the file is suffixed by ,ff8 (an absolute file).

DebuggerPlus

DebuggerPlus is a module by Darren Salt, which provides more options for disassembly than the standard Debugger module provided by RISC OS. These options were indicated by flags which could be set at the command line. The riscos-dumpi tool supports a subset of those flags.

Supported flags:

  • ANDEQasDCD: Use DCD instead of ANDEQ, MOV Rn,Rn (same register) etc.
  • APCS: Use APCS-R register set.
  • Lower: Force all register names to lower case.
  • QuoteSWIs: Put quotes around SWI names.
  • UseDCD: Use DCD instead of 'Undefined instruction', and BRK where DCD &x6000010 would be used.
  • UseNOP: Use NOP instead of MOV R0,R0.
  • UseVDU: Use VDU x instead of SWI OS_WriteI+x.

The flags may be specified on the command line with --debuggerplus=<flags> repeated, or as a comma-separated list. The flags may also be configured using the environment variable RISCOS_DUMPI_DEBUGGERPLUS.

Flags prefixed by a - character will be disabled. This is useful to disable the configuration in the environment variable, or those default options. Lower is currently selected automatically by default.

Use as a git text converter

If you have RISC OS binaries (or other ARM binaries) checked in to your Git repository, it can be useful to be able to display and diff them. This can be achieved by giving the binary files an attribute to indicate that they should be converted to text first.

To mark the binary files, create a .gitattributes (or update your existing file) with the attribute diff=armbinary. armbinary a label I have chosen but you could use any name you preferred. For example:

examples/*,* diff=armbinary

This indicates that any file containing a comma (,) within the examples directory is to be converted through the armbinary conversion.

To tell Git how to perform the conversion it is necessary to update the .gitconfig file to know how to perform the conversion.

git config --global diff.armbinary.textconv "riscos-dumpi"

This indicates that when performing a diff (including log -p and others), any files with the diff attribute armbinary will be passed to the riscos-dumpi tool. You may prefer to use the colourised version of the disassembly, although this will make it more difficult to see where changes are in diffs:

git config --global diff.armbinary.textconv "riscos-dumpi --colour-8bit"

For more information on the Git attributes, see: https://git-scm.com/docs/gitattributes#_performing_text_diffs_of_binary_files

Requirements

The disassembly code requires the Capstone library to be installed. Capstone 4 is currently recommended, although later versions may function as well.

pip install 'capstone<5'

Reuse

To use this in another project you would usually need to import the module and subclass the Disassemble class to add functions necessary to access memory and registers, and to decode SWI calls. However, if you do not care about these things, the simplest usage can be found in the example script simple_disassemble.py.

The riscos-dumpi tool is provided in the package and can be invoked with:

riscos-dumpi <options>

Or the tool can be manually invoked with:

python -m riscos_disassemble <options>

This tool has more advanced support for the options than the simple example.

Examples

Example files are supplied in the examples directory to demonstrate the disassembly:

  • hello_world utility file (suffixed by ,ffc) is a test from the RISC OS Pyromaniac project, which verifies the behaviour of the SWI OS_Write0.
  • osbyte81_version utility file is another test program, which checks the behaviour of OS_Byte &81 when reading the operating system version.
  • kerneldebug utility file tests that the input and output in the kernel works, and has function signatures that are recognised.

Disassembling the example hello_world example is simple:

charles@laputa ~/riscos-disassemble-python $ riscos-dumpi examples/hello_world,ffc
00000000 : e28f001c : .... : ADR     r0, &00000024             ; -> "Hello world"
00000004 : ef000002 : .... : SWI     OS_Write0
00000008 : e28f1020 :  ... : ADR     r1, &00000030             ; -> [&00000001, &6f203052, &6572206e, &6e727574]
0000000c : e1500001 : ..P. : CMP     r0, r1
00000010 : 1a000001 : .... : BNE     &0000001c
00000014 : ef000003 : .... : SWI     OS_NewLine
00000018 : e1a0f00e : .... : MOV     pc, lr
0000001c : e28f000c : .... : ADR     r0, &00000030             ; -> [&00000001, &6f203052, &6572206e, &6e727574]
00000020 : ef00002b : +... : SWI     OS_GenerateError
00000024 : 6c6c6548 : Hell : STCLVS  p5, c6, [r12], #-&120
00000028 : 6f77206f : o wo : SWIVS   &77206f
0000002c : 00646c72 : rld. : RSBEQ   r6, r4, r2, ROR r12
00000030 : 00000001 : .... : ANDEQ   r0, r0, r1
00000034 : 6f203052 : R0 o : SWIVS   &203052
00000038 : 6572206e : n re : LDRBVS  r2, [r2, #-&6e]!
0000003c : 6e727574 : turn : MRCVS   p5, #3, r7, c2, c4, #3
00000040 : 6f726620 :  fro : SWIVS   &726620
00000044 : 534f206d : m OS : MOVTPL  r2, #&f06d                ; #61549
00000048 : 6972575f : _Wri : LDMDBVS r2!, {r0, r1, r2, r3, r4, r6, r8, r9, r10, r12, lr} ^
0000004c : 20306574 : te0  : EORSHS  r6, r0, r4, ROR r5
00000050 : 20736177 : was  : RSBSHS  r6, r3, r7, ROR r1
00000054 : 20746f6e : not  : RSBSHS  r6, r4, lr, ROR #30
00000058 : 72726f63 : corr : RSBSVC  r6, r2, #&18c             ; #396
0000005c : 6c746365 : ectl : LDCLVS  p3, c6, [r4], #-&194
00000060 : 65732079 : y se : LDRBVS  r2, [r3, #-&79]!
00000064 : 6f742074 : t to : SWIVS   &742074
00000068 : 65687420 :  the : STRBVS  r7, [r8, #-&420]!
0000006c : 72657420 :  ter : RSBVC   r7, r5, #32, #8           ; #536870912 = bit 29
00000070 : 616e696d : mina : Undefined instruction
00000074 : 00726f74 : tor. : RSBSEQ  r6, r2, r4, ROR pc

The bulk of the disassembly is textual data, but this is still decoded as if it is executable.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

riscos-disassemble-0.5.0.tar.gz (37.0 kB view hashes)

Uploaded Source

Built Distributions

riscos_disassemble-0.5.0-py3-none-any.whl (39.6 kB view hashes)

Uploaded Python 3

riscos_disassemble-0.5.0-py2-none-any.whl (39.6 kB view hashes)

Uploaded Python 2

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page