Skip to main content

binary diff and patch using the BSDIFFHS-format

Project description

> :warning: bsdiffhs is a derivative work from bsdiff4: bsdiffhs is similar to bsdiff4 (https://github.com/ilanschnell/bsdiff4) but distinct in its approach to use heatshrink compression algorithm instead of gzip2 format.

The code is mostly derived from cx_bsdiff (written by Anthony Tuininga, http://cx-bsdiff.sourceforge.net/). The cx_bsdiff code in turn was derived from bsdiff, the standalone utility produced for BSD which can be found at http://www.daemonology.net/bsdiff. In addition to the two functions (diff and patch) cx_bsdiff provides, this package includes:

  • an interface to the BSDIFFHS-format

  • command line interfaces: bsdiffhs and bspatchhs

  • tests

The bsdiffhs package defines the following high level functions:

diff(src_bytes, dst_bytes, window_sz2=10, lookahead_sz2=4) -> bytes

Return a BSDIFFHS-format patch (from src_bytes to dst_bytes) as bytes.

patch(src_bytes, patch_bytes, window_sz2=10, lookahead_sz2=4) -> bytes

Apply the BSDIFFHS-format patch_bytes to src_bytes and return the bytes.

file_diff(src_path, dst_path, patch_path, window_sz2=10, lookahead_sz2=4)

Write a BSDIFFHS-format patch (from the file src_path to the file dst_path) to the file patch_path.

file_patch(src_path, dst_path, patch_path, window_sz2=10, lookahead_sz2=4)

Apply the BSDIFFHS-format file patch_path to the file src_path and write the result to the file dst_path.

file_patch_inplace(path, patch_path, window_sz2=10, lookahead_sz2=4)

Apply the BSDIFFHS-format file patch_path to the file path in place.

Compression configuration:

According to official heatshrink documentation: heatshrink has a couple configuration options, which impact its resource usage and how effectively it can compress data. These are set when dynamically allocating an encoder or decoder, or in heatshrink_config.h if they are statically allocated.

  • window_sz2, Set the window size to 2^W bytes.

The window size determines how far back in the input can be searched for repeated patterns. A window_sz2 of 8 will only use 256 bytes (2^8), while a window_sz2 of 10 will use 1024 bytes (2^10). The latter uses more memory, but may also compress more effectively by detecting more repetition.

The window_sz2 setting currently must be between 4 and 15.

  • lookahead_sz2, Set the lookahead size to 2^L bytes.

The lookahead size determines the max length for repeated patterns that are found. If the lookahead_sz2 is 4, a 50-byte run of ‘a’ characters will be represented as several repeated 16-byte patterns (2^4 is 16), whereas a larger lookahead_sz2 may be able to represent it all at once. The number of bits used for the lookahead size is fixed, so an overly large lookahead size can reduce compression by adding unused size bits to small patterns.

The lookahead_sz2 setting currently must be between 3 and the window_sz2 - 1.

Compression recommandation:

Still based on official heatshrink documentation : for embedded/low memory contexts, a `window_sz2 in the 8 to 10 range is probably a good default, depending on how tight memory is. Smaller or larger window sizes may make better trade-offs in specific circumstances, but should be checked with representative data.

The lookahead_sz2 should probably start near the window_sz2/2, e.g. -w 8 -l 4 or -w 10 -l 5. The command-line program can be used to measure how well test data works with different settings.`

Example:

>>> import bsdiffhs
>>> a = 100000 * b'a'
>>> b = bytearray(a)
>>> b[100:106] = b' diff '
>>> p = bsdiffhs.diff(a, bytes(b), 11, 5)
>>> len(p)
154
>>> bsdiffhs.patch(a, p, 11, 5) == b
True

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bsdiffhs-0.3.0.tar.gz (15.7 kB view hashes)

Uploaded Source

Built Distributions

bsdiffhs-0.3.0-cp311-cp311-win_amd64.whl (20.6 kB view hashes)

Uploaded CPython 3.11 Windows x86-64

bsdiffhs-0.3.0-cp311-cp311-win32.whl (19.5 kB view hashes)

Uploaded CPython 3.11 Windows x86

bsdiffhs-0.3.0-cp311-cp311-musllinux_1_1_x86_64.whl (38.5 kB view hashes)

Uploaded CPython 3.11 musllinux: musl 1.1+ x86-64

bsdiffhs-0.3.0-cp311-cp311-musllinux_1_1_s390x.whl (38.8 kB view hashes)

Uploaded CPython 3.11 musllinux: musl 1.1+ s390x

bsdiffhs-0.3.0-cp311-cp311-musllinux_1_1_ppc64le.whl (40.9 kB view hashes)

Uploaded CPython 3.11 musllinux: musl 1.1+ ppc64le

bsdiffhs-0.3.0-cp311-cp311-musllinux_1_1_i686.whl (40.9 kB view hashes)

Uploaded CPython 3.11 musllinux: musl 1.1+ i686

bsdiffhs-0.3.0-cp311-cp311-musllinux_1_1_aarch64.whl (38.7 kB view hashes)

Uploaded CPython 3.11 musllinux: musl 1.1+ ARM64

bsdiffhs-0.3.0-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (35.0 kB view hashes)

Uploaded CPython 3.11 manylinux: glibc 2.17+ x86-64

bsdiffhs-0.3.0-cp311-cp311-manylinux_2_17_s390x.manylinux2014_s390x.whl (35.5 kB view hashes)

Uploaded CPython 3.11 manylinux: glibc 2.17+ s390x

bsdiffhs-0.3.0-cp311-cp311-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl (37.4 kB view hashes)

Uploaded CPython 3.11 manylinux: glibc 2.17+ ppc64le

bsdiffhs-0.3.0-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (35.5 kB view hashes)

Uploaded CPython 3.11 manylinux: glibc 2.17+ ARM64

bsdiffhs-0.3.0-cp311-cp311-manylinux_2_5_i686.manylinux1_i686.manylinux_2_17_i686.manylinux2014_i686.whl (37.6 kB view hashes)

Uploaded CPython 3.11 manylinux: glibc 2.17+ i686 manylinux: glibc 2.5+ i686

bsdiffhs-0.3.0-cp311-cp311-macosx_11_0_arm64.whl (18.0 kB view hashes)

Uploaded CPython 3.11 macOS 11.0+ ARM64

bsdiffhs-0.3.0-cp311-cp311-macosx_10_9_x86_64.whl (18.1 kB view hashes)

Uploaded CPython 3.11 macOS 10.9+ x86-64

bsdiffhs-0.3.0-cp311-cp311-macosx_10_9_universal2.whl (24.9 kB view hashes)

Uploaded CPython 3.11 macOS 10.9+ universal2 (ARM64, x86-64)

bsdiffhs-0.3.0-cp310-cp310-win_amd64.whl (20.6 kB view hashes)

Uploaded CPython 3.10 Windows x86-64

bsdiffhs-0.3.0-cp310-cp310-win32.whl (19.5 kB view hashes)

Uploaded CPython 3.10 Windows x86

bsdiffhs-0.3.0-cp310-cp310-musllinux_1_1_x86_64.whl (36.9 kB view hashes)

Uploaded CPython 3.10 musllinux: musl 1.1+ x86-64

bsdiffhs-0.3.0-cp310-cp310-musllinux_1_1_s390x.whl (37.2 kB view hashes)

Uploaded CPython 3.10 musllinux: musl 1.1+ s390x

bsdiffhs-0.3.0-cp310-cp310-musllinux_1_1_ppc64le.whl (39.1 kB view hashes)

Uploaded CPython 3.10 musllinux: musl 1.1+ ppc64le

bsdiffhs-0.3.0-cp310-cp310-musllinux_1_1_i686.whl (39.2 kB view hashes)

Uploaded CPython 3.10 musllinux: musl 1.1+ i686

bsdiffhs-0.3.0-cp310-cp310-musllinux_1_1_aarch64.whl (37.0 kB view hashes)

Uploaded CPython 3.10 musllinux: musl 1.1+ ARM64

bsdiffhs-0.3.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (33.2 kB view hashes)

Uploaded CPython 3.10 manylinux: glibc 2.17+ x86-64

bsdiffhs-0.3.0-cp310-cp310-manylinux_2_17_s390x.manylinux2014_s390x.whl (33.7 kB view hashes)

Uploaded CPython 3.10 manylinux: glibc 2.17+ s390x

bsdiffhs-0.3.0-cp310-cp310-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl (35.8 kB view hashes)

Uploaded CPython 3.10 manylinux: glibc 2.17+ ppc64le

bsdiffhs-0.3.0-cp310-cp310-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (33.7 kB view hashes)

Uploaded CPython 3.10 manylinux: glibc 2.17+ ARM64

bsdiffhs-0.3.0-cp310-cp310-manylinux_2_5_i686.manylinux1_i686.manylinux_2_17_i686.manylinux2014_i686.whl (36.0 kB view hashes)

Uploaded CPython 3.10 manylinux: glibc 2.17+ i686 manylinux: glibc 2.5+ i686

bsdiffhs-0.3.0-cp310-cp310-macosx_11_0_arm64.whl (18.0 kB view hashes)

Uploaded CPython 3.10 macOS 11.0+ ARM64

bsdiffhs-0.3.0-cp310-cp310-macosx_10_9_x86_64.whl (18.0 kB view hashes)

Uploaded CPython 3.10 macOS 10.9+ x86-64

bsdiffhs-0.3.0-cp310-cp310-macosx_10_9_universal2.whl (24.8 kB view hashes)

Uploaded CPython 3.10 macOS 10.9+ universal2 (ARM64, x86-64)

bsdiffhs-0.3.0-cp39-cp39-win_amd64.whl (20.6 kB view hashes)

Uploaded CPython 3.9 Windows x86-64

bsdiffhs-0.3.0-cp39-cp39-win32.whl (19.5 kB view hashes)

Uploaded CPython 3.9 Windows x86

bsdiffhs-0.3.0-cp39-cp39-musllinux_1_1_x86_64.whl (36.8 kB view hashes)

Uploaded CPython 3.9 musllinux: musl 1.1+ x86-64

bsdiffhs-0.3.0-cp39-cp39-musllinux_1_1_s390x.whl (37.0 kB view hashes)

Uploaded CPython 3.9 musllinux: musl 1.1+ s390x

bsdiffhs-0.3.0-cp39-cp39-musllinux_1_1_ppc64le.whl (38.9 kB view hashes)

Uploaded CPython 3.9 musllinux: musl 1.1+ ppc64le

bsdiffhs-0.3.0-cp39-cp39-musllinux_1_1_i686.whl (39.0 kB view hashes)

Uploaded CPython 3.9 musllinux: musl 1.1+ i686

bsdiffhs-0.3.0-cp39-cp39-musllinux_1_1_aarch64.whl (36.9 kB view hashes)

Uploaded CPython 3.9 musllinux: musl 1.1+ ARM64

bsdiffhs-0.3.0-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (33.0 kB view hashes)

Uploaded CPython 3.9 manylinux: glibc 2.17+ x86-64

bsdiffhs-0.3.0-cp39-cp39-manylinux_2_17_s390x.manylinux2014_s390x.whl (33.5 kB view hashes)

Uploaded CPython 3.9 manylinux: glibc 2.17+ s390x

bsdiffhs-0.3.0-cp39-cp39-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl (35.6 kB view hashes)

Uploaded CPython 3.9 manylinux: glibc 2.17+ ppc64le

bsdiffhs-0.3.0-cp39-cp39-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (33.5 kB view hashes)

Uploaded CPython 3.9 manylinux: glibc 2.17+ ARM64

bsdiffhs-0.3.0-cp39-cp39-manylinux_2_5_i686.manylinux1_i686.manylinux_2_17_i686.manylinux2014_i686.whl (35.8 kB view hashes)

Uploaded CPython 3.9 manylinux: glibc 2.17+ i686 manylinux: glibc 2.5+ i686

bsdiffhs-0.3.0-cp39-cp39-macosx_11_0_arm64.whl (18.0 kB view hashes)

Uploaded CPython 3.9 macOS 11.0+ ARM64

bsdiffhs-0.3.0-cp39-cp39-macosx_10_9_x86_64.whl (18.0 kB view hashes)

Uploaded CPython 3.9 macOS 10.9+ x86-64

bsdiffhs-0.3.0-cp39-cp39-macosx_10_9_universal2.whl (24.8 kB view hashes)

Uploaded CPython 3.9 macOS 10.9+ universal2 (ARM64, x86-64)

bsdiffhs-0.3.0-cp38-cp38-win_amd64.whl (20.6 kB view hashes)

Uploaded CPython 3.8 Windows x86-64

bsdiffhs-0.3.0-cp38-cp38-win32.whl (19.4 kB view hashes)

Uploaded CPython 3.8 Windows x86

bsdiffhs-0.3.0-cp38-cp38-musllinux_1_1_x86_64.whl (37.1 kB view hashes)

Uploaded CPython 3.8 musllinux: musl 1.1+ x86-64

bsdiffhs-0.3.0-cp38-cp38-musllinux_1_1_s390x.whl (37.4 kB view hashes)

Uploaded CPython 3.8 musllinux: musl 1.1+ s390x

bsdiffhs-0.3.0-cp38-cp38-musllinux_1_1_ppc64le.whl (39.2 kB view hashes)

Uploaded CPython 3.8 musllinux: musl 1.1+ ppc64le

bsdiffhs-0.3.0-cp38-cp38-musllinux_1_1_i686.whl (39.3 kB view hashes)

Uploaded CPython 3.8 musllinux: musl 1.1+ i686

bsdiffhs-0.3.0-cp38-cp38-musllinux_1_1_aarch64.whl (37.2 kB view hashes)

Uploaded CPython 3.8 musllinux: musl 1.1+ ARM64

bsdiffhs-0.3.0-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (33.7 kB view hashes)

Uploaded CPython 3.8 manylinux: glibc 2.17+ x86-64

bsdiffhs-0.3.0-cp38-cp38-manylinux_2_17_s390x.manylinux2014_s390x.whl (34.1 kB view hashes)

Uploaded CPython 3.8 manylinux: glibc 2.17+ s390x

bsdiffhs-0.3.0-cp38-cp38-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl (36.2 kB view hashes)

Uploaded CPython 3.8 manylinux: glibc 2.17+ ppc64le

bsdiffhs-0.3.0-cp38-cp38-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (34.2 kB view hashes)

Uploaded CPython 3.8 manylinux: glibc 2.17+ ARM64

bsdiffhs-0.3.0-cp38-cp38-manylinux_2_5_i686.manylinux1_i686.manylinux_2_17_i686.manylinux2014_i686.whl (36.4 kB view hashes)

Uploaded CPython 3.8 manylinux: glibc 2.17+ i686 manylinux: glibc 2.5+ i686

bsdiffhs-0.3.0-cp38-cp38-macosx_11_0_arm64.whl (18.0 kB view hashes)

Uploaded CPython 3.8 macOS 11.0+ ARM64

bsdiffhs-0.3.0-cp38-cp38-macosx_10_9_x86_64.whl (18.0 kB view hashes)

Uploaded CPython 3.8 macOS 10.9+ x86-64

bsdiffhs-0.3.0-cp38-cp38-macosx_10_9_universal2.whl (24.8 kB view hashes)

Uploaded CPython 3.8 macOS 10.9+ universal2 (ARM64, x86-64)

bsdiffhs-0.3.0-cp37-cp37m-win_amd64.whl (20.6 kB view hashes)

Uploaded CPython 3.7m Windows x86-64

bsdiffhs-0.3.0-cp37-cp37m-win32.whl (19.4 kB view hashes)

Uploaded CPython 3.7m Windows x86

bsdiffhs-0.3.0-cp37-cp37m-musllinux_1_1_x86_64.whl (37.3 kB view hashes)

Uploaded CPython 3.7m musllinux: musl 1.1+ x86-64

bsdiffhs-0.3.0-cp37-cp37m-musllinux_1_1_s390x.whl (37.5 kB view hashes)

Uploaded CPython 3.7m musllinux: musl 1.1+ s390x

bsdiffhs-0.3.0-cp37-cp37m-musllinux_1_1_ppc64le.whl (39.4 kB view hashes)

Uploaded CPython 3.7m musllinux: musl 1.1+ ppc64le

bsdiffhs-0.3.0-cp37-cp37m-musllinux_1_1_i686.whl (39.6 kB view hashes)

Uploaded CPython 3.7m musllinux: musl 1.1+ i686

bsdiffhs-0.3.0-cp37-cp37m-musllinux_1_1_aarch64.whl (37.3 kB view hashes)

Uploaded CPython 3.7m musllinux: musl 1.1+ ARM64

bsdiffhs-0.3.0-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (34.1 kB view hashes)

Uploaded CPython 3.7m manylinux: glibc 2.17+ x86-64

bsdiffhs-0.3.0-cp37-cp37m-manylinux_2_17_s390x.manylinux2014_s390x.whl (34.7 kB view hashes)

Uploaded CPython 3.7m manylinux: glibc 2.17+ s390x

bsdiffhs-0.3.0-cp37-cp37m-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl (36.6 kB view hashes)

Uploaded CPython 3.7m manylinux: glibc 2.17+ ppc64le

bsdiffhs-0.3.0-cp37-cp37m-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (34.7 kB view hashes)

Uploaded CPython 3.7m manylinux: glibc 2.17+ ARM64

bsdiffhs-0.3.0-cp37-cp37m-manylinux_2_5_i686.manylinux1_i686.manylinux_2_17_i686.manylinux2014_i686.whl (36.8 kB view hashes)

Uploaded CPython 3.7m manylinux: glibc 2.17+ i686 manylinux: glibc 2.5+ i686

bsdiffhs-0.3.0-cp37-cp37m-macosx_10_9_x86_64.whl (17.9 kB view hashes)

Uploaded CPython 3.7m macOS 10.9+ x86-64

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page