PySpark package for Catalyst logical plan capture with Base64 logging marker SPARK_LOGICAL_PLAN_CAPTURE_V2.
Project description
spark-logical-plan-capture-v2
PyPI-пакет для PySpark 3.5.x, который подключает Spark SQL extension:
- перехватывает логические планы Catalyst
- сериализует payload в Java binary form
- кодирует payload в Base64
- пишет маркер в лог:
SPARK_LOGICAL_PLAN_CAPTURE_V2:<base64>
Installation
pip install spark-logical-plan-capture-v2
Usage
from pyspark.sql import SparkSession
from spark_logical_plan_capture import configure_spark_builder
builder = SparkSession.builder.appName("capture-demo").master("local[*]")
spark = configure_spark_builder(builder).getOrCreate()
spark.sql("select 1 as value").show()
Decode SQL from log line
from spark_logical_plan_capture import decode_captured_sql_from_logline
log_line = "SPARK_LOGICAL_PLAN_CAPTURE_V2:<base64>"
sql = decode_captured_sql_from_logline(spark, log_line)
print(sql)
Build and publish
- Соберите JVM jar:
sbt clean test package
- Скопируйте jar в Python package data:
python scripts/prepare_python_package.py
- Соберите wheel/sdist:
python -m build
- Опубликуйте:
python -m twine upload dist/*
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file spark_logical_plan_capture_v2-0.1.1.tar.gz.
File metadata
- Download URL: spark_logical_plan_capture_v2-0.1.1.tar.gz
- Upload date:
- Size: 4.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
977cfc05748a48d8bebe26d85d89f1db78f07aa114616d5e2b60c828a01a73f0
|
|
| MD5 |
444ad2d1f431f6c7711b514cbbb096a1
|
|
| BLAKE2b-256 |
2322dc905e42bad31b0763c99f1691a683ca78b76eb0abd717d3211b52a70263
|
File details
Details for the file spark_logical_plan_capture_v2-0.1.1-py3-none-any.whl.
File metadata
- Download URL: spark_logical_plan_capture_v2-0.1.1-py3-none-any.whl
- Upload date:
- Size: 1.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d315843ab1eedbf9d6ab3cb3b2957f69b1405a96da6d400b19ced49b0a907d6f
|
|
| MD5 |
495155dd551fd79a88f9a9843d4b54bc
|
|
| BLAKE2b-256 |
6d184f1424906d25b19ccf4b8dc703e5baa34662af56a803b9006a6531b43bc3
|