Skip to main content

Library for building blockchain pipelines

Project description

Introduction

PyPI Telegram Documentation GitHub

Cherry is a python library for building blockchain data pipelines.

It is designed to make building production-ready blockchain data pipelines easy.

Getting Started

See getting started section of the docs.

Features

  • Pure python library. Don't need yaml, SQL, toml etc.
  • High-level datasets API and flexible pipeline API.
  • High-performance, low-cost and uniform data access. Ability to use advanced providers without platform lock-in.
  • Included functionality to decode, validate, transform blockchain data. All implemented in rust for performance.
  • Write transformations using polars, pyarrow, datafusion, pandas, duckdb or any other pyarrow compatible library.
  • Schema inference automatically creates output tables.
  • Keep datasets fresh with continuous ingestion.
  • Parallelized, next batch of data is being fetched while your pre-processing function is running, while the database writes are being executed in parallel. Don't need to hand optimize anything.
  • Included library of transformations.
  • Included functionality to implement crash-resistance.

Data providers

Provider Ethereum (EVM) Solana (SVM)
HyperSync
SQD
Yellowstone-GRPC

Supported output formats

  • ClickHouse
  • Iceberg
  • Deltalake
  • DuckDB
  • Arrow Datasets
  • Parquet

Usage examples

Logging

Python code uses the standard logging module of python, so it can be configured according to python docs.

Set RUST_LOG environment variable according to env_logger docs in order to see logs from rust modules.

To run an example with trace level logging for rust modules:

RUST_LOG=trace uv run examples/path/to/my/example

Development

This repo uses uv for development.

  • Format the code with uv run ruff format
  • Lint the code with uv run ruff check
  • Run type checks with uv run pyright
  • Run the tests with uv run pytest

Core libraries we use for ingesting/decoding/validating/transforming blockchain data are implemented in cherry-core repo.

License

Licensed under either of

at your option.

Contribution

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

Sponsors

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cherry_etl-0.5.1.tar.gz (24.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cherry_etl-0.5.1-py3-none-any.whl (25.3 kB view details)

Uploaded Python 3

File details

Details for the file cherry_etl-0.5.1.tar.gz.

File metadata

  • Download URL: cherry_etl-0.5.1.tar.gz
  • Upload date:
  • Size: 24.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.6.10

File hashes

Hashes for cherry_etl-0.5.1.tar.gz
Algorithm Hash digest
SHA256 1e28f599880d49592180eaa57145cee8914e04822a7262e30affd9cfe409b073
MD5 fd680a41f6c522582fd4fb7ca2336e8c
BLAKE2b-256 26b22ef020154c4da0a8e8718ba5c0113e6baf0166588e47cb1ca81e0c0530c7

See more details on using hashes here.

File details

Details for the file cherry_etl-0.5.1-py3-none-any.whl.

File metadata

  • Download URL: cherry_etl-0.5.1-py3-none-any.whl
  • Upload date:
  • Size: 25.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.6.10

File hashes

Hashes for cherry_etl-0.5.1-py3-none-any.whl
Algorithm Hash digest
SHA256 68a675c3598451f0e200af657bf1b332e9617158e4340c465d4cb375abc9cdc0
MD5 c9b61f4c6c656b948480a210d70d0c3d
BLAKE2b-256 35540282f09d7c27d76b5f8a17099152d7b4f419cb55cfdbb9f135538ada3019

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page