Skip to main content

Avro reader for Dask.

Project description

Dask-Avro

https://img.shields.io/pypi/v/dask-avro.svg https://img.shields.io/pypi/pyversions/dask-avro.svg Documentation Status https://img.shields.io/travis/rmax/dask-avro.svg Coverage Status Code Quality Status Requirements Status

Avro reader for Dask.

Features

This projects provides an Avro format reader for Dask. Provides a convenient function to read one or more Avro files and partition them arbitrarily.

Quickstart

Usage:

import dask.bag
import dask_avro

delayeds = dask_avro.read_avro("data-*.avro", blocksize=2**26)
data = dask.bag.from_delayed(delayeds)

Credits

This package was created with Cookiecutter and the rmax/cookiecutter-pypackage project template.

History

0.3.0 (2018-06-16)

  • Fixed support for latest fastavro release.

  • Require fastavro>=0.17.

0.2.1 (2018-06-15)

  • Pin fastavro version to <=0.19.6 due to breaking changes.

0.2.0 (2018-02-12)

  • Added support for fastavro 0.16+.

0.1.2 (2018-02-12)

  • Fix compatibility with dask 0.17.0.

0.1.1 (2018-01-18)

  • Pin fastavro version to <0.16 as latest versions don’t allow to use internal C-based _iter_avro function.

0.1.0 (2017-02-02)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dask-avro-0.3.0.tar.gz (11.0 MB view details)

Uploaded Source

Built Distribution

dask_avro-0.3.0-py2.py3-none-any.whl (3.8 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file dask-avro-0.3.0.tar.gz.

File metadata

  • Download URL: dask-avro-0.3.0.tar.gz
  • Upload date:
  • Size: 11.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for dask-avro-0.3.0.tar.gz
Algorithm Hash digest
SHA256 60e8cf79e5527f2d24a19b6956fdf5f090d9c44faf473876eddc16a21bece833
MD5 6f679df679e6db6374e740d82a6bbeac
BLAKE2b-256 c4da1fb1cbecda8bedd880f8a2e8c065d63fdad92ac9561c03a3e72e6f335a07

See more details on using hashes here.

File details

Details for the file dask_avro-0.3.0-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for dask_avro-0.3.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 1784a523826dec30cd20a6e360c0caca3f2268fbfe6556e28e110cd2b1e3b655
MD5 56e4037275ae45b7126368e90c83b7e0
BLAKE2b-256 36674e54ea449224829476e053a20e55291da150a549829afa20892f67ebb956

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page