CLI for querying the Apache Spark History Server REST API
Project description
spark-history-cli
A CLI for querying the Apache Spark History Server REST API.
Prerequisites
- Python 3.10+
- A running Spark History Server (default:
http://localhost:18080)
Start the History Server:
$SPARK_HOME/sbin/start-history-server.sh
Installation
cd spark-history-cli
pip install -e .
Or install from PyPI:
pip install spark-history-cli
Install as agent skills
Install skills for any supported agent (Claude Code, Copilot, Cursor, Codex, and 39 more):
npx skills add yaooqinn/spark-history-cli
This installs two skills:
- spark-history-cli — Query the Spark History Server
- spark-advisor — Diagnose, compare, and optimize Spark applications
Or install via the bundled CLI command (Copilot CLI / Claude Code only):
spark-history-cli install-skill
Usage
REPL Mode (default)
spark-history-cli
# or specify a server:
spark-history-cli --server http://my-shs:18080
One-Shot Commands
# List applications
spark-history-cli apps
spark-history-cli apps --status completed --limit 10
# Application details
spark-history-cli app <app-id>
# Jobs, stages, executors (requires --app-id or 'use' in REPL)
spark-history-cli --app-id <id> jobs
spark-history-cli --app-id <id> stages
spark-history-cli --app-id <id> executors --all
spark-history-cli --app-id <id> sql
spark-history-cli --app-id <id> env
spark-history-cli --app-id <id> summary
# SQL execution plans
spark-history-cli --app-id <id> sql-plan <exec-id> # full plan
spark-history-cli --app-id <id> sql-plan <exec-id> --view initial # pre-AQE plan
spark-history-cli --app-id <id> sql-plan <exec-id> --view final # post-AQE plan
spark-history-cli --app-id <id> sql-plan <exec-id> --dot # Graphviz DOT
spark-history-cli --app-id <id> sql-plan <exec-id> --dot -o plan.dot # save to file
# Jobs for a SQL execution
spark-history-cli --app-id <id> sql-jobs <exec-id>
# Download event logs
spark-history-cli --app-id <id> logs output.zip
# JSON output for scripting/agents
spark-history-cli --json apps
spark-history-cli --json --app-id <id> jobs
spark-history-cli --json --app-id <id> sql-plan <exec-id>
spark-history-cli --json --app-id <id> sql-jobs <exec-id>
REPL Commands
apps List applications
app <id> Show app details and set as current
attempts List attempts for current app
attempt <id> Show attempt details
use <id> Set current app context
jobs List jobs for current app
job <id> Show job details
job-stages <id> Show stages for a job
stages List stages
stage <id> [attempt] Show stage details
stage-summary <id> Task metric quantiles (p5-p95)
stage-tasks <id> List tasks (--length N, --sort-by)
executors [--all] List executors
sql [id] List or show SQL executions
sql-plan <id> [opts] Show SQL plan (--view, --dot, -o)
sql-jobs <id> Show jobs for a SQL execution
summary Application overview (config + workload)
processes List miscellaneous processes
rdds List cached RDDs
env Show environment/config
logs [path] Download event logs
version Show Spark version
server <url> Change server URL
status Show session state
help Show help
quit Exit
Environment Variables
SPARK_HISTORY_SERVER— Default server URL (overrideshttp://localhost:18080)
API Coverage
Wraps all 20 endpoints of the Spark History Server REST API (/api/v1/):
- Applications (list, get, attempts)
- Jobs (list, get)
- Stages (list, get, attempts, task summary, task list)
- Executors (active, all)
- SQL Executions (list, get with plan graph)
- Storage (RDD list, detail)
- Environment
- Event Logs (download as ZIP)
- Miscellaneous Processes
- Version
License
Apache License 2.0
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file spark_history_cli-1.4.0.tar.gz.
File metadata
- Download URL: spark_history_cli-1.4.0.tar.gz
- Upload date:
- Size: 35.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
419cb73e18a038bf0867921e0addc0fee585163be862afed95acf4cc8c83bb5c
|
|
| MD5 |
10fb862654f054f0a93e9f8a9e4279ff
|
|
| BLAKE2b-256 |
a1cc34c287542f7b23420b846088e5e453434b2ee81a9806649bf211cae3ebe8
|
Provenance
The following attestation bundles were made for spark_history_cli-1.4.0.tar.gz:
Publisher:
publish.yml on yaooqinn/spark-history-cli
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
spark_history_cli-1.4.0.tar.gz -
Subject digest:
419cb73e18a038bf0867921e0addc0fee585163be862afed95acf4cc8c83bb5c - Sigstore transparency entry: 1147557823
- Sigstore integration time:
-
Permalink:
yaooqinn/spark-history-cli@0c6082b2f44779e044f4b4e5ef7486d827a74226 -
Branch / Tag:
refs/tags/v1.4.0 - Owner: https://github.com/yaooqinn
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@0c6082b2f44779e044f4b4e5ef7486d827a74226 -
Trigger Event:
release
-
Statement type:
File details
Details for the file spark_history_cli-1.4.0-py3-none-any.whl.
File metadata
- Download URL: spark_history_cli-1.4.0-py3-none-any.whl
- Upload date:
- Size: 38.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d31d04de862c1e82e3c6888e68afe3937369c3cd798d4c0a32efc459a63c1aa6
|
|
| MD5 |
cb92c61b8b508ec1c9ee4ac0a3015222
|
|
| BLAKE2b-256 |
2d2ff162ce15cf4a1bbe015e26c0307cd95abb552fa9235ffa8cc2fc4ce339d2
|
Provenance
The following attestation bundles were made for spark_history_cli-1.4.0-py3-none-any.whl:
Publisher:
publish.yml on yaooqinn/spark-history-cli
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
spark_history_cli-1.4.0-py3-none-any.whl -
Subject digest:
d31d04de862c1e82e3c6888e68afe3937369c3cd798d4c0a32efc459a63c1aa6 - Sigstore transparency entry: 1147558083
- Sigstore integration time:
-
Permalink:
yaooqinn/spark-history-cli@0c6082b2f44779e044f4b4e5ef7486d827a74226 -
Branch / Tag:
refs/tags/v1.4.0 - Owner: https://github.com/yaooqinn
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@0c6082b2f44779e044f4b4e5ef7486d827a74226 -
Trigger Event:
release
-
Statement type: