Skip to main content

Explore the anatomy of your columnar data files (Parquet, Arrow, and more)

Project description

Datanomy

Explore the anatomy of your columnar data files

Datanomy is a terminal-based tool for inspecting and understanding data files. It provides an interactive view of your data's structure, metadata, and internal organization.

Supported formats

  • Parquet (.parquet, .parq)
  • Arrow IPC (.arrow, .feather, .ipc)

Features for Parquet view

General Structure

General Structure

Schema

Schema

Data

Data

Metadata

Metadata

Stats

Stats

Features for Arrow IPC view

Structure

File-level layout showing header, record batches, and footer.

Schema

Arrow schema with per-column type and nullability details.

Data

Preview of the first 50 rows.

Metadata

File and schema-level metadata.

Buffers

Physical buffer layout for each column — validity bitmap bits (color-coded valid/null), hex preview of values, offsets, and data buffers. For nested types (list, struct, map, dictionary) child array buffers are shown recursively.

Installation

# From PyPI
uv tool install datanomy
## with pip
pip install datanomy

# From source
uv tool install "datanomy @ git+https://github.com/raulcd/datanomy.git"
## cloning the repo 
git clone https://github.com/raulcd/datanomy.git
cd datanomy
uv sync

Usage

# Run without installing using uvx
uvx datanomy data.parquet

# Inspect a Parquet file
datanomy data.parquet

# Inspect an Arrow IPC file
datanomy data.arrow

You can also use from source using uvx. This uses the development version:

uvx "git+https://github.com/raulcd/datanomy.git" data.parquet
uvx "git+https://github.com/raulcd/datanomy.git" data.arrow

Keyboard Shortcuts

  • q - Quit the application

Development

# Install dependencies
uv sync

# Run from source
uv run datanomy path/to/file.parquet
uv run datanomy path/to/file.arrow
# Install dev dependencies
uv sync --extra dev

# Run tests
uv run pytest

# Format code
uv run ruff format .

# Lint
uv run ruff check .

# Lint
uv run mypy .

License

Apache License 2.0

Contributing

Contributions welcome! Please open an issue or PR.


Built with Textual and PyArrow

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datanomy-0.3.1.tar.gz (20.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datanomy-0.3.1-py3-none-any.whl (25.7 kB view details)

Uploaded Python 3

File details

Details for the file datanomy-0.3.1.tar.gz.

File metadata

  • Download URL: datanomy-0.3.1.tar.gz
  • Upload date:
  • Size: 20.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.15 {"installer":{"name":"uv","version":"0.11.15","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for datanomy-0.3.1.tar.gz
Algorithm Hash digest
SHA256 ec9c7638b30ee7019ed7eca87e2a33e5f8065437d98e444402e4c690da8cfa09
MD5 57b6ad044eb32c9732f8acd566336246
BLAKE2b-256 cfcba061056fce0e3f23c483e3bd72bde0ce88569ac401c89140b54ac1a2f7d7

See more details on using hashes here.

File details

Details for the file datanomy-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: datanomy-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 25.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.15 {"installer":{"name":"uv","version":"0.11.15","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for datanomy-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 f3fab82c2f699bd19b1b2ba1d26a39bb3827b810f3e966c4fe62e039a64a1d97
MD5 99902e990fa78fdf46a787ebc9f42afe
BLAKE2b-256 85e79e2d75b5654bfea1387ff2753463ef16adb20c504130c932b93a62945942

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page