Skip to main content

A polars io-plugin wrapper around fastavro

Project description

polars-fastavro

build pypi docs

A polars io-plugin that wraps fastavro

This plugin allows reading, writing, and scanning avro files into polars DataFrames using the fastavro library.

Usage

from polars_fastavro import scan_avro, read_avro, write_avro

frame = scan_avro(...).collect()  # read_avro()
write_avro(frame, dest)

Limitations

  1. Because it uses python types an an intermediary, it's slow, (30x read to 80x write).
  2. Since this is ultimately converting between avro and arrow, it has no support for avro maps, unions (other than null), names for certain types
  3. Every type is treated as as nullable.
  4. Additionally, some types could in theory be supported by aren't for technical reasons. These include fixed, decimal, uuid, time, and duration.
  5. Timestamp support is limited. local-timestamp-*s are treated as Datetime without tz info, while timestamp-*s are reated as UTC Datetime. Writing Datetimes with nano-precision is also not supported.
  6. This can't read cloud files, as that functionality isn't exposed in python to my knowledge.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

polars_fastavro-0.3.0.tar.gz (58.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

polars_fastavro-0.3.0-py3-none-any.whl (8.1 kB view details)

Uploaded Python 3

File details

Details for the file polars_fastavro-0.3.0.tar.gz.

File metadata

  • Download URL: polars_fastavro-0.3.0.tar.gz
  • Upload date:
  • Size: 58.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.28

File hashes

Hashes for polars_fastavro-0.3.0.tar.gz
Algorithm Hash digest
SHA256 d38efd10973834083cdee2398be53d74b393bc57ddf8db74a5f70ee57bd87a8f
MD5 daa5e71f9a4523ececbd8cadf08f5738
BLAKE2b-256 86296944a728d1f6bb9d2ed6a7134b3d633cc3d5f872bdfdbf364e539cc004b8

See more details on using hashes here.

File details

Details for the file polars_fastavro-0.3.0-py3-none-any.whl.

File metadata

File hashes

Hashes for polars_fastavro-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 cf5e08eb686fad794110b69db9b51ebffca9c82b21262e697cc922afa750c2e1
MD5 2d82378a88f4dfb81c6af28488d60cb9
BLAKE2b-256 297212558f0f7bd1ecafdc171d4a679199b8f5921bdd6f3d358d65b65c3e214d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page