Hypothesis strategies for Awkward Array
Project description
hypothesis-awkward
Hypothesis strategies for Awkward Arrays.
Hypothesis is a property-based testing library. Its strategies are Python functions that strategically generate test data that can fail test cases in pytest and other testing frameworks. Once a test fails, Hypothesis searches for the simplest sample that causes the same error. Hypothesis automatically explores edge cases; developers do not need to craft test data manually.
Property-based testing is useful for finding edge cases in array libraries and in code that uses them. In fact, Hypothesis strategies for NumPy and pandas data types are included in Hypothesis itself. Xarray provides strategies for its data structure. The Apache Arrow codebase has strategies for PyArrow, which are not officially documented in its API reference.
This package, hypothesis-awkward, is a collection of Hypothesis strategies for Awkward Array, which can represent a wide variety of layouts of nested, variable-length, and mixed-type data. The current version of this package includes strategies that generate samples with certain types of layouts. The goal is to develop strategies that can generate fully general Awkward Arrays with multiple options to control the layout, data types, missing values, masks, and other array attributes. These strategies can help close in on edge cases in tools that use Awkward Array, and Awkward Array itself.
Installation
You can install the package from PyPI using pip:
pip install hypothesis-awkward
This also installs Hypothesis and Awkward Array as dependencies unless they are already installed.
The strategy arrays()
The function arrays() is the main strategy. It generates Awkward Arrays with
many options to control the output arrays.
Sample outputs of arrays()
You can see sample outputs of the current version of arrays() in the test
case:
from hypothesis import given
import awkward as ak
import hypothesis_awkward.strategies as st_ak
@given(array=st_ak.constructors.arrays())
def test_array(array: ak.Array) -> None:
print(f'{array=!r}')
For example, this might print:
array=<Array [] type='0 * bool'>
array=<Array [32766, 32766, 32766, 32766, 32766] type='5 * int16'>
array=<Array [[], [], [], []] type='4 * var * var * unknown'>
array=<Array ['', ''] type='2 * string'>
array=<Array [[b'\xd7']] type='1 * var * bytes'>
array=<Array [] type='0 * var * {"": bool}'>
array=<Array [[], []] type='2 * var * (unknown, union[2 * (string, string), bytes])'>
array=<Array [('\U0003dcd5hE2'), ('¦Ü'), ..., (...), (..., ...)] type='10 * (string)'>
array=<Array [[NaT], [NaT]] type='2 * 1 * union[(unknown), timedelta64[Y]]'>
array=<Array [[], [...], [], [], []] type='5 * union[var * unknown, {Nok: unknown...'>
array=<Array [??, ??, ??, ??, ??, ??] type='6 * bytes'>
array=<Array [[...], [...], ..., ['ÆÓË\U000913a9\x1fê', 'X']] type='5 * 2 * string'>
array=<Array [[[??]], [[??]], [[??]]] type='3 * 1 * 1 * var * union[var * bytes, ...'>
array=<Array [[[[], []], [[]], [], []]]] type='1 * 1 * 3 * var * 1 * var * uint16'>
array=<Array [??, ??, ??, ??, ??] type='5 * var * var * (uint64, bytes)'>
The current version generates arrays with NumpyArray, EmptyArray, string,
and bytestring as leaf contents that can be nested multiple levels deep in
RegularArray, ListOffsetArray, ListArray, RecordArray, and UnionArray.
Arrays might be virtual, shown as ?? in the output.
The options of arrays()
The strategy arrays() has many options to control the output arrays. You can
find all options in the API reference:
Other strategies
In addition to arrays(), this package includes other strategies that generate
Awkward Arrays and related data types, which can be found in the API reference:
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file hypothesis_awkward-0.10.0.tar.gz.
File metadata
- Download URL: hypothesis_awkward-0.10.0.tar.gz
- Upload date:
- Size: 129.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
980a9f2b5116071ca640e8ce7351066f5cbd7dcd34865af8d958765ed4a27907
|
|
| MD5 |
836e0506f2d529fe13a0ec7718222cd2
|
|
| BLAKE2b-256 |
a682bad19af1e6caa64b2203e54faabacdc9ca33a3c84963f22b933f99b0133c
|
Provenance
The following attestation bundles were made for hypothesis_awkward-0.10.0.tar.gz:
Publisher:
pypi.yml on scikit-hep/hypothesis-awkward
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
hypothesis_awkward-0.10.0.tar.gz -
Subject digest:
980a9f2b5116071ca640e8ce7351066f5cbd7dcd34865af8d958765ed4a27907 - Sigstore transparency entry: 1189080904
- Sigstore integration time:
-
Permalink:
scikit-hep/hypothesis-awkward@e0f048708d1a69a5294305f0d61c8511de293bb4 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/scikit-hep
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
pypi.yml@e0f048708d1a69a5294305f0d61c8511de293bb4 -
Trigger Event:
workflow_run
-
Statement type:
File details
Details for the file hypothesis_awkward-0.10.0-py3-none-any.whl.
File metadata
- Download URL: hypothesis_awkward-0.10.0-py3-none-any.whl
- Upload date:
- Size: 39.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
884702860d0b8867e7ddb655f9cc3ec421d9fd9b026621c16c247fe5542f325a
|
|
| MD5 |
c92658f1e10316bd89fc074964c6d9ad
|
|
| BLAKE2b-256 |
135f024a3e75165aa105596adbc72aba4fe1d154ae7e9c1e6737476e32d8f3d8
|
Provenance
The following attestation bundles were made for hypothesis_awkward-0.10.0-py3-none-any.whl:
Publisher:
pypi.yml on scikit-hep/hypothesis-awkward
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
hypothesis_awkward-0.10.0-py3-none-any.whl -
Subject digest:
884702860d0b8867e7ddb655f9cc3ec421d9fd9b026621c16c247fe5542f325a - Sigstore transparency entry: 1189080928
- Sigstore integration time:
-
Permalink:
scikit-hep/hypothesis-awkward@e0f048708d1a69a5294305f0d61c8511de293bb4 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/scikit-hep
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
pypi.yml@e0f048708d1a69a5294305f0d61c8511de293bb4 -
Trigger Event:
workflow_run
-
Statement type: