Skip to main content

PySpark package for Catalyst logical plan capture with Base64 logging marker SPARK_LOGICAL_PLAN_CAPTURE_V2.

Project description

spark-logical-plan-capture-v2

PyPI-пакет для PySpark 3.5.x, который подключает Spark SQL extension:

  • перехватывает логические планы Catalyst
  • сериализует payload в Java binary form
  • кодирует payload в Base64
  • пишет маркер в лог: SPARK_LOGICAL_PLAN_CAPTURE_V2:<base64>

Installation

pip install spark-logical-plan-capture-v2

Usage

from pyspark.sql import SparkSession
from spark_logical_plan_capture import configure_spark_builder

builder = SparkSession.builder.appName("capture-demo").master("local[*]")
spark = configure_spark_builder(builder).getOrCreate()

spark.sql("select 1 as value").show()

Decode SQL from log line

from spark_logical_plan_capture import decode_captured_sql_from_logline

log_line = "SPARK_LOGICAL_PLAN_CAPTURE_V2:<base64>"
sql = decode_captured_sql_from_logline(spark, log_line)
print(sql)

Build and publish

  1. Соберите JVM jar:
sbt clean test package
  1. Скопируйте jar в Python package data:
python scripts/prepare_python_package.py
  1. Соберите wheel/sdist:
python -m build
  1. Опубликуйте:
python -m twine upload dist/*

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spark_logical_plan_capture_v2-0.1.1.tar.gz (4.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file spark_logical_plan_capture_v2-0.1.1.tar.gz.

File metadata

File hashes

Hashes for spark_logical_plan_capture_v2-0.1.1.tar.gz
Algorithm Hash digest
SHA256 977cfc05748a48d8bebe26d85d89f1db78f07aa114616d5e2b60c828a01a73f0
MD5 444ad2d1f431f6c7711b514cbbb096a1
BLAKE2b-256 2322dc905e42bad31b0763c99f1691a683ca78b76eb0abd717d3211b52a70263

See more details on using hashes here.

File details

Details for the file spark_logical_plan_capture_v2-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for spark_logical_plan_capture_v2-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 d315843ab1eedbf9d6ab3cb3b2957f69b1405a96da6d400b19ced49b0a907d6f
MD5 495155dd551fd79a88f9a9843d4b54bc
BLAKE2b-256 6d184f1424906d25b19ccf4b8dc703e5baa34662af56a803b9006a6531b43bc3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page