Skip to main content

Health inspector for Iceberg, Delta, and Hudi lakehouse tables

Project description

lake-inspect

Health inspector for Iceberg, Delta, and Hudi lakehouse tables. No Spark, no catalog, no cluster needed.

Install

pip install lake-inspect

Usage

# single table
lake-inspect --path /path/to/table --format iceberg
lake-inspect --path /path/to/table --format delta
lake-inspect --path /path/to/table --format hudi

# auto-detect format and scan entire directory
lake-inspect --scan-dir /path/to/tables

# export as JSON
lake-inspect --path /path/to/table --format iceberg --output json
lake-inspect --path /path/to/table --format iceberg --output json --out-file report.json

What it checks

Iceberg

  • Snapshot count: too many slows query planning
  • Small files: impacts read performance
  • Schema drift: too many schema versions
  • Partition spec: missing means full scans

Delta

  • Commit count: too many without checkpoint slows reads
  • Small files
  • Schema changes
  • Remove/add file ratio

Hudi

  • Inflight commits: stuck or failed writes
  • Commit count
  • Operation types (insert vs upsert)
  • Write sizes: too small means unbatched writes

Requirements

Python 3.8+

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lake_inspect-0.1.1.tar.gz (1.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

lake_inspect-0.1.1-py3-none-any.whl (1.8 kB view details)

Uploaded Python 3

File details

Details for the file lake_inspect-0.1.1.tar.gz.

File metadata

  • Download URL: lake_inspect-0.1.1.tar.gz
  • Upload date:
  • Size: 1.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for lake_inspect-0.1.1.tar.gz
Algorithm Hash digest
SHA256 52fb2431ee08a6d52fca6a6eed1dc2477978340fa3fb6a8319c00fc7589f8dfd
MD5 18cfae276718a7f94e7a4895212fea62
BLAKE2b-256 6113f2fcbeadf829f1f4b30ebdb5650958a64d3c54bad706686c4ff7b23b1ba2

See more details on using hashes here.

File details

Details for the file lake_inspect-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: lake_inspect-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 1.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for lake_inspect-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 50c723ea3698e48d29ae8ba4bea2afef5e87bbd87f8736fc2fc70308de967e17
MD5 5d714a1cef1f1067bf32e03634c8e6ba
BLAKE2b-256 144eefc4d3963fa587ad47d65c9da6f21a2a6779fb262122ab04940ecdc571b6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page