Memory-mapped numeric arrays, based on a format that is self-explanatory and tool-independent
Project description
Darr is a Python science library that enables you to work efficiently with disk-based numeric arrays without depending on tool-specific data formats. This makes it easy to share your data with those who do not use Darr or even Python. No exporting required and, as the data is saved in a self-explanatory way, not much explanation required either. Tool-independent and easy access to data is in line with good scientific practice as it promotes wide and long-term availability, to others but also to yourself. More rationale for this approach is provided here.
Darr supports efficient read/write/append access and is based on universally readable flat binary files and automatically generated text files, containing human-readable explanation of precisely how your binary data is stored. It also provides specific code that reads the data in a variety of current scientific data tools such as Python, R, Julia, IDL, Matlab, Maple, and Mathematica (see example array).
Darr currently supports numerical N-dimensional arrays, and experimentally supports numerical ragged arrays, i.e. a series of arrays in which one dimension varies in length.
See this tutorial for a brief introduction, or the documentation for more info.
Darr is currently pre-1.0, still undergoing significant development. It is open source and freely available under the New BSD License terms.
Features
Pro’s:
Purely based on flat binary and text files, tool independence.
Human-readable explanation of how the binary data is stored is saved in a README text file.
Includes examples of how to read the array in popular analysis environments such as Python (without Darr), R, Julia, Octave/Matlab, GDL/IDL, and Mathematica.
Supports very large data arrays, larger than RAM.
Data read/write access is simple through NumPy indexing (see here).
Data is easily appendable.
Many numeric types are supported: (u)int8-(u)int64, float16-float64, complex64, complex128.
Easy use of metadata, stored in a separate JSON text file.
Minimal dependencies, only NumPy.
Integrates easily with the Dask library for numeric computation on very large Darr arrays.
Supports ragged arrays (still experimental).
See the [documentation](http://darr.readthedocs.io/) for more information.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file darr-0.2.1.tar.gz
.
File metadata
- Download URL: darr-0.2.1.tar.gz
- Upload date:
- Size: 52.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.2.0.post20200210 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 183886142de227c8179409b8c69a5dfec0c74d467f9aa78be8224070961cad1b |
|
MD5 | a49239f6207721b98c0b689c847a7b7c |
|
BLAKE2b-256 | 3a85bc7688148bbcfe32b23f2012985c6c84adc9abb7e29be78c6c9c8c3a49fd |
File details
Details for the file darr-0.2.1-py3-none-any.whl
.
File metadata
- Download URL: darr-0.2.1-py3-none-any.whl
- Upload date:
- Size: 41.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3.post20200330 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 53d2cc25ac8b4f75f66458dee02e34698fa62d77d06e458f9330bdcde8b8bdef |
|
MD5 | 196cf35f675fb046d69b8bc10da5c4d7 |
|
BLAKE2b-256 | c988e6460bf156387047bbc4386c7dce264491ff85522408011c4adbbc08b473 |