Avro reader for Dask.
Project description
Dask-Avro
Avro reader for Dask.
Free software: MIT license
Documentation: https://dask-avro.readthedocs.org.
Python versions: 2.7, 3.5+
Features
This projects provides an Avro format reader for Dask. Provides a convenient function to read one or more Avro files and partition them arbitrarily.
Quickstart
Usage:
import dask.bag import dask_avro delayeds = dask_avro.read_avro("data-*.avro", blocksize=2**26) data = dask.bag.from_delayed(delayeds)
Credits
This package was created with Cookiecutter and the rmax/cookiecutter-pypackage project template.
History
0.1.2 (2018-02-12)
Fix compatibility with dask 0.17.0.
0.1.1 (2018-01-18)
Pin fastavro version to <0.16 as latest versions don’t allow to use internal C-based _iter_avro function.
0.1.0 (2017-02-02)
First release on PyPI.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
dask-avro-0.1.2.tar.gz
(11.0 MB
view hashes)
Built Distribution
Close
Hashes for dask_avro-0.1.2-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | dde921b7244cb0417815fdbf5b50243219596d32b8ddec5e730ffd3cba87c583 |
|
MD5 | 16718a386632d9411cfaf46ef09d0b73 |
|
BLAKE2b-256 | f70e14645f858abaedc6fc37aabf0c7e545ffd3730ca0c2203cd7b670ebbda67 |