Manipulate JSON-like data with NumPy-like idioms.

These details have not been verified by PyPI

Project links

Project description

Awkward Array is a library for nested, variable-sized data, including arbitrary-length lists, records, mixed types, and missing data, using NumPy-like idioms.

Arrays are dynamically typed, but operations on them are compiled and fast. Their behavior coincides with NumPy when array dimensions are regular and generalizes when they're not.

Motivating example

Given an array of objects with x, y fields and variable-length nested lists like

array = ak.Array([
    [{"x": 1.1, "y": [1]}, {"x": 2.2, "y": [1, 2]}, {"x": 3.3, "y": [1, 2, 3]}],
    [],
    [{"x": 4.4, "y": {1, 2, 3, 4]}, {"x": 5.5, "y": [1, 2, 3, 4, 5]}]
])

the following slices out the y values, drops the first element from each inner list, and runs NumPy's np.square function on everything that is left:

output = np.square(array["y", ..., 1:])

The result is

[
    [[], [4], [4, 9]],
    [],
    [[4, 9, 16], [4, 9, 16, 25]]
]

The equivalent using only Python is

output = []
for sublist in array:
    tmp1 = []
    for record in sublist:
        tmp2 = []
        for number in record["y"][1:]:
            tmp2.append(np.square(number))
        tmp1.append(tmp2)
    output.append(tmp1)

Not only is the expression using Awkward Arrays more concise, using idioms familiar from NumPy, but it's much faster and uses less memory.

For a similar problem 10 million times larger than the one above (on a single-threaded 2.2 GHz processor),

the Awkward Array one-liner takes 4.6 seconds to run and uses 2.1 GB of memory,
the equivalent using Python lists and dicts takes 138 seconds to run and uses 22 GB of memory.

Speed and memory factors in the double digits are common because we're replacing Python's dynamically typed, pointer-chasing virtual machine with type-specialized, precompiled routines on contiguous data. (In other words, for the same reasons as NumPy.) Even higher speedups are possible when Awkward Array is paired with Numba.

Our presentation at SciPy 2020 provides a good introduction, showing how to use these arrays in a real analysis.

Installation

Awkward Array can be installed from PyPI using pip:

pip install awkward

You will likely get a precompiled binary (wheel), depending on your operating system and Python version. If not, pip attempts to compile from source (which requires a C++ compiler, make, and CMake).

Awkward Array is also available using conda, which always installs a binary:

conda install -c conda-forge awkward

If you have already added conda-forge as a channel, the -c conda-forge is unnecessary. Adding the channel is recommended because it ensures that all of your packages use compatible versions:

conda config --add channels conda-forge
conda update --all

Getting help

How-to tutorials

Python API reference

C++ API reference

Report bugs, request features, and ask for additional documentation on GitHub Issues.
If you have a "How do I...?" question, ask about it on StackOverflow with the [awkward-array] tag. Be sure to include tags for any other libraries that you use, such as Pandas or PyTorch.
To ask questions in real time, try the Gitter Scikit-HEP/awkward-array chat room.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.0.0

Dec 5, 2020

1.0.0rc1 pre-release

Dec 1, 2020

0.4.5

Nov 24, 2020

0.4.4

Nov 17, 2020

0.4.3

Nov 5, 2020

0.4.2

Nov 4, 2020

0.4.1

Nov 3, 2020

0.4.0

Oct 29, 2020

0.3.1

Sep 21, 2020

0.3.0

Sep 21, 2020

0.2.38

Sep 18, 2020

0.2.37

Sep 12, 2020

0.2.36

Sep 8, 2020

0.2.35

Aug 26, 2020

0.2.34

Aug 25, 2020

0.2.33

Aug 22, 2020

0.2.32

Aug 19, 2020

0.2.31

Aug 15, 2020

0.2.30

Aug 11, 2020

0.2.29

Aug 10, 2020

0.2.28

Aug 7, 2020

0.2.27

Jul 23, 2020

0.2.26

Jul 23, 2020

0.2.25

Jul 11, 2020

0.2.24

Jun 12, 2020

0.2.23

Jun 6, 2020

0.2.22

May 29, 2020

0.2.21

May 28, 2020

0.2.20

May 14, 2020

0.2.19

May 8, 2020

0.2.10

Apr 7, 2020

0.2.0

Mar 6, 2020

0.1.128

Feb 29, 2020

0.1.92

Jan 28, 2020

0.1.38

Dec 28, 2019

0.1.25

Nov 27, 2019

0.1.17

Oct 22, 2019

0.1.7

Sep 26, 2019

0.1.0

Aug 27, 2019

0.0.14

Aug 17, 2019

0.0.10

Aug 16, 2019

0.0.6

Aug 16, 2019

0.0.1

Aug 16, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

awkward1-1.0.0.tar.gz (817.2 kB view details)

Uploaded Dec 5, 2020 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

awkward1-1.0.0-py3-none-any.whl (4.7 kB view details)

Uploaded Dec 5, 2020 Python 3

File details

Details for the file awkward1-1.0.0.tar.gz.

File metadata

Download URL: awkward1-1.0.0.tar.gz
Upload date: Dec 5, 2020
Size: 817.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.24.0 setuptools/49.6.0.post20201009 requests-toolbelt/0.9.1 tqdm/4.52.0 CPython/3.8.5

File hashes

Hashes for awkward1-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`4e51fed29c3a3618cdc07719ff3d1cdd1d567643f9629532bd7d965f9fb76c50`
MD5	`f44c6acb4c165d5f468a4322882f2acc`
BLAKE2b-256	`05e1de4607482cd18eb43bfb4c7381571ad0928f7ebf0ed5815f93b21cc5e46a`

See more details on using hashes here.

File details

Details for the file awkward1-1.0.0-py3-none-any.whl.

File metadata

Download URL: awkward1-1.0.0-py3-none-any.whl
Upload date: Dec 5, 2020
Size: 4.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.24.0 setuptools/49.6.0.post20201009 requests-toolbelt/0.9.1 tqdm/4.52.0 CPython/3.8.5

File hashes

Hashes for awkward1-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`36e6cd64faf3246da030fa7a088ca23f60a5683ef57ab1068f22665599a578df`
MD5	`6bc1bb0bdd8879f19f5ca1f5192fb8f9`
BLAKE2b-256	`801654835de4e8b154eee13fecc32f4ef22ee2fca734a01d74a63e12e367686c`

See more details on using hashes here.

awkward1 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Motivating example

Installation

Getting help

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes