Skip to main content

Fast bytes and bytearray processing functions

Project description

Authors:

Michael Griffin

Version:
2.2.0 for 2021-06-06
License:

This document may be distributed under the Apache 2.0 License.

Language:

Python 3.6 or later


Introduction

The BytesFunc module provides high speed array processing functions for use with Python ‘bytes’ and ‘bytearray’ objects. These functions are patterned after the functions in the standard Python “operator” module together with some additional ones from other sources.

The purpose of these functions is to perform mathematical calculations on “bytes” and “bytearray” objects significantly faster than using native Python.


Function Summary

The compare operators used for ‘ball’, ‘bany’, and ‘findindex’ are examples only, and other compare operations are available. Many functions will accept other parameter combinations of sequences and numeric parameters. See the details for each function for what parameter combinations are valid.

Brief Description

Function

Equivalent to

and_

Perform a bitwise AND across the sequence.

ball

True if all elements of the sequence meet the match criteria.

bany

True if any elements of the sequence meet the match criteria.

bmax

Return the maximum value in the sequence.

bmin

Return the minimum value in the sequence.

bsum

Return the sum of the sequence.

eq

True if all elements of the sequence equal the compare value.

findindex

Returns the index of the first value in an array to meet the specified criteria.

ge

True if all elements of the sequence are greater than or equal to the compare value.

gt

True if all elements of the sequence are greater than the compare value.

invert

Perform a bitwise invert across the sequence.

le

True if all elements of the sequence are less than or equal to the compare value.

lshift

Perform a bitwise left shift across the sequence.

lt

True if all elements of the sequence are less than the compare value.

ne

True if all elements of the sequence are not equal the compare value.

or_

Perform a bitwise OR across the sequence.

rshift

Perform a bitwise right shift across the sequence.

xor

Perform a bitwise XOR across the sequence.

Python Equivalent

Function

Equivalent to

and_

[x & param for x in sequence1]

ball

all([(x > param) for x in array])

bany

any([(x > param) for x in array])

bmax

max(sequence)

bmin

min(sequence)

bsum

sum(sequence)

eq

all([x == param for x in sequence])

findindex

[x for x,y in enumerate(array) if y > param][0]

ge

all([x >= param for x in sequence])

gt

all([x > param for x in sequence])

invert

[~x for x in sequence1]

le

all([x <= param for x in sequence])

lshift

[x << param for x in sequence1]

lt

all([x < param for x in sequence])

ne

all([x != param for x in sequence])

or_

[x | param for x in sequence1]

rshift

[x >> param for x in sequence1]

xor

[x ^ param for x in sequence1]


Supported Sequence Types

BytesFunc supports Python native “bytes” and “bytearray” objects.


Performance

Average performance increase on x86_64 Ubuntu with GCC is 600 times faster than native Python. Performance will vary depending on the function, with the performance increase ranging from 7 times to 1500 times.

Other platforms show similar improvements.

Detailed performance figures are listed in the full documentation.


Platform support

BytesFunc is written in ‘C’ and uses the standard C libraries to implement the underlying math functions. BytesFunc has been tested on the following platforms.

OS

Bits

Compiler

Python Version Tested

Ubuntu 20.04 LTS

64 bit

GCC

3.8

Ubuntu 21.04

64 bit

GCC

3.9

Debian 10

32 bit

GCC

3.7

Debian 10

64 bit

GCC

3.7

OpenSuse 15.3

64 bit

GCC

3.6

Centos 8.4

64 bit

GCC

3.6

FreeBSD 13

64 bit

LLVM

3.7

OpenBSD 6.9

64 bit

LLVM

3.8

MS Windows 10

64 bit

MS VS C 2015

3.9

Raspbian (RPi 3)

32 bit

GCC

3.7

Ubuntu 20.04 (RPi 4)

64 bit

GCC

3.8

  • The Raspbian (RPi 3) tests were conducted on a Raspberry Pi 3 ARM CPU running in 32 bit mode.

  • The Ubuntu ARM tests were conducted on a Raspberry Pi 4 ARM CPU running in 64 bit mode.

  • All others were conducted using VMs running on x86 hardware.


Installation

Please note that this is a Python 3 package. To install using Pip, you will need (with Debian package in brackets):

  • The appropriate C compiler and header files (gcc and build-essential).

  • The Python3 development headers (python3-dev).

  • Pip3 together with the corresponding Setuptools (python3-pip).

example:

# Install from PyPI.
pip3 install arrayfunc
# Install from a local copy of the source package (Linux).
pip3 install --no-index --find-links=. arrayfunc
# Install a local package as a user package.
pip3 install --user --no-index --find-links=. arrayfunc
# Windows, FreeBSD, and OpenBSD seems to use "pip" instead
# of "pip3" for some reason.
pip install arrayfunc

Newer versions of OpenBSD and FreeBSD will not install this package correctly when running setup.py directly. Use pip to install, even for local package installs. Testing of this package has been changed to use only pip (or pip3) in order to provide a common testing method for all platforms. Testing using setup.py directly is no longer done.


Release History

  • 2.2.0 - Updated benchmarks to make each one a separate file. Centos and

    OpenSuse test platforms updated to latest versions.

  • 2.1.1 - Documentation updated and version number bumped to reflect testing

    with Ubuntu 21.04, FreeBSD 13.0, and OpenBSD 6.9. No code changes.

  • 2.1.0 - Changed setup.py to detect Raspberry Pi 4 and set the compiler args

    accordingly. Added support for Pi 4. Dropped testing of 64 bit mode on Pi 3.

  • 2.0.1 - Documentation updated to reflect testing with the release version

    of Ubuntu 20.04 ARM (Rasberry Pi), Ubuntu 2010 (x86-64), OpenBSD 6.8, and Python 3.9 on Windows. No code changes and no change in version number.

  • 2.0.0 - Documentation updated to reflect testing with the release version

    of Ubuntu 20.04. No code changes and no change in version number.

  • 2.0.0 - Added SIMD support for ARMv8 AARCH64. This is 64 bit ARM on a

    Raspberry Pi3 when running 64 bit Ubuntu. Raspbian is 32 bit only and has 64 bit SIMD vectors. 64 bit ARM has 128 bit SIMD vectors and so offers improved performance.

  • 1.0.0 - First release.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bytesfunc-2.2.0.tar.gz (53.7 kB view details)

Uploaded Source

Built Distribution

bytesfunc-2.2.0-cp39-cp39-win_amd64.whl (155.1 kB view details)

Uploaded CPython 3.9 Windows x86-64

File details

Details for the file bytesfunc-2.2.0.tar.gz.

File metadata

  • Download URL: bytesfunc-2.2.0.tar.gz
  • Upload date:
  • Size: 53.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.4.2 requests/2.22.0 setuptools/45.2.0 requests-toolbelt/0.8.0 tqdm/4.30.0 CPython/3.8.5

File hashes

Hashes for bytesfunc-2.2.0.tar.gz
Algorithm Hash digest
SHA256 ea9b8f785495517be01809fe9a496e9b943446bd965e594d880b830878656e19
MD5 e5b31a71d1002ec5b5d07e984bbe352e
BLAKE2b-256 4abc4820ec412f96a1ef0e7448f5bc2c40e4dbf388f96691fa9064ffb3232956

See more details on using hashes here.

File details

Details for the file bytesfunc-2.2.0-cp39-cp39-win_amd64.whl.

File metadata

  • Download URL: bytesfunc-2.2.0-cp39-cp39-win_amd64.whl
  • Upload date:
  • Size: 155.1 kB
  • Tags: CPython 3.9, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.9.0

File hashes

Hashes for bytesfunc-2.2.0-cp39-cp39-win_amd64.whl
Algorithm Hash digest
SHA256 0eebed8a2c0b48ba48ee61b8dac295decb9fa653cd9708c625d5fad097fa0a61
MD5 351f563b7aa69e3bab0c05d9773cf1a4
BLAKE2b-256 be816f9971966d2c2284989d54c4d91b2f250a9b1fb1d578ee693adb10f745d1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page