Skip to main content

Python Library to connect to the NEAR Lake S3 and stream the data

Project description

NEAR Lake Framework for Python

Available in programming languages: Rust | Javascript | Python3

NEAR Lake Framework is a small library companion to NEAR Lake. It allows you to build your own indexer that subscribes to the stream of blocks from the NEAR Lake data source and create your own logic to process the NEAR Protocol data.

PyPI version MIT or Apache 2.0 licensed


Official NEAR Lake Framework launch announcement has been published on the NEAR Gov Forum Greetings from the Data Platform Team! We are happy and proud to announce an MVP release of a brand new word in indexer building approach - NEAR Lake Framework.


Example

import asyncio
import logging
import os

from near_lake_framework import LakeConfig, streamer, Network
from near_lake_framework.utils import fetch_latest_block

# Suppress warning logs from specific dependencies
logging.getLogger("near_lake_framework").setLevel(logging.INFO)


async def main():
    network = Network.TESTNET
    latest_final_block = fetch_latest_block(network=network)
    config = LakeConfig(
        network,
        start_block_height=latest_final_block,
        # These fields must be set!
        aws_access_key_id=os.environ["AWS_ACCESS_KEY_ID"],
        aws_secret_key=os.environ["AWS_SECRET_ACCESS_KEY"],
    )

    stream_handle, streamer_messages_queue = streamer(config)
    while True:
        streamer_message = await streamer_messages_queue.get()
        print(
            f"Received Block #{streamer_message.block.header.height} from Lake Framework"
        )


if __name__ == "__main__":
    loop = asyncio.new_event_loop()
    loop.run_until_complete(main())

Try it yourself as follows

pip3 install -r requirements.txt
python3 example/run.py

How to use

Dependencies

Install near-lake-framework

$ pip3 install near-lake-framework

Credentials

To be able to access the data from NEAR Lake you need to provide credentials. Please, see the Credentials article

Configuration

Everything should be configured before the start of your indexer application via LakeConfig struct.

Available parameters:

  • network: Network - provide network
  • s3_bucket_name: str - provide the AWS S3 bucket name (near-lake-testnet, near-lake-mainnet or yours if you run your own NEAR Lake)
  • s3_region_name: str - provide the region for AWS S3 bucket
  • start_block_height: BlockHeight - block height to start the stream from
  • blocks_preload_pool_size: int - provide the number of blocks to preload (default: 200)

Cost estimates

TL;DR approximately $18.15 per month (for AWS S3 access, paid directly to AWS) for the reading of fresh blocks

Explanation:

Assuming NEAR Protocol produces accurately 1 block per second (which is really not, the average block production time is 1.3s). A full day consists of 86400 seconds, that's the max number of blocks that can be produced.

According the Amazon S3 prices list requests are charged for $0.005 per 1000 requests and get is charged for $0.0004 per 1000 requests.

Calculations (assuming we are following the tip of the network all the time):

86400 blocks per day * 5 requests for each block / 1000 requests * $0.0004 per 1k requests = $0.173 * 30 days = $5.19

Note: 5 requests for each block means we have 4 shards (1 file for common block data and 4 separate files for each shard)

And a number of list requests we need to perform for 30 days:

86400 blocks per day / 1000 requests * $0.005 per 1k list requests = $0.432 * 30 days = $12.96

$5.19 + $12.96 = $18.15

The price depends on the number of shards

Real-life Example

Pagoda, a leading blockchain infrastructure company, utilizes the NEAR Lake Framework for Python in their Gas Station Event Indexer. This real-world implementation showcases the framework's capabilities in efficiently processing and indexing blockchain data.

Publishing to PyPi

Follow this guide to safely publish PyPi package

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

near_lake_framework-0.1.3.tar.gz (12.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

near_lake_framework-0.1.3-py3-none-any.whl (12.2 kB view details)

Uploaded Python 3

File details

Details for the file near_lake_framework-0.1.3.tar.gz.

File metadata

  • Download URL: near_lake_framework-0.1.3.tar.gz
  • Upload date:
  • Size: 12.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.9.6

File hashes

Hashes for near_lake_framework-0.1.3.tar.gz
Algorithm Hash digest
SHA256 3789a1788a2dd8096bdaa84b7c3c2c70ed10cae61f3ad37ea66573d11712ef2d
MD5 f0e00a1c5d85e02c6d0a4c8a3e73b735
BLAKE2b-256 65f116aa722b58af283668472298d9c635e628d6f1173578fb470894c67d82d1

See more details on using hashes here.

File details

Details for the file near_lake_framework-0.1.3-py3-none-any.whl.

File metadata

File hashes

Hashes for near_lake_framework-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 2d0c53a6b04db43c5e6e0340f7036a8c375379e33670cbf0ea94e12c3ababa36
MD5 36b4f1874f37f1c34250a7a0eac17f48
BLAKE2b-256 cd250f3ee85e382d612895ab5c396e2c0698286909fcf52b50a8aa650d4745af

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page