Skip to main content

Explore the anatomy of your columnar data files (Parquet, Arrow, and more)

Project description

Datanomy

Explore the anatomy of your columnar data files

Datanomy is a terminal-based tool for inspecting and understanding data files. It provides an interactive view of your data's structure, metadata, and internal organization.

Features (MVP)

  • Schema inspection: View column names and types
  • Row group details: See how your data is organized
  • File metadata: Size, row counts, and more
  • Rich terminal UI: Navigate with ease using Textual

Installation

# From PyPI (coming soon)
pip install datanomy

# From source
git clone https://github.com/raulcd/datanomy.git
cd datanomy
uv sync

Usage

# Inspect a Parquet file
datanomy data.parquet

Keyboard Shortcuts

  • q - Quit the application

Development

# Install dependencies
uv sync

# Run from source
uv run datanomy path/to/file.parquet
# Install dev dependencies
uv sync --extra dev

# Run tests
uv run pytest

# Format code
uv run ruff format .

# Lint
uv run ruff check .

# Lint
uv run mypy .

Roadmap

  • Data preview
  • Column statistics
  • Arrow IPC file support
  • Compression details
  • Dictionary encoding info
  • Export reports

License

Apache License 2.0

Contributing

Contributions welcome! Please open an issue or PR.


Built with Textual and PyArrow

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datanomy-0.1.0.tar.gz (3.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datanomy-0.1.0-py3-none-any.whl (4.7 kB view details)

Uploaded Python 3

File details

Details for the file datanomy-0.1.0.tar.gz.

File metadata

  • Download URL: datanomy-0.1.0.tar.gz
  • Upload date:
  • Size: 3.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.7

File hashes

Hashes for datanomy-0.1.0.tar.gz
Algorithm Hash digest
SHA256 3fef58a79951f70cb4ee893620d837183f34bba8acc8af7132a52e7baee42659
MD5 9ef5f8237ed067202040caf50e4fcb1f
BLAKE2b-256 f426085ce0a45bb692715aac42a0fc55ce8afe9a035678418f9b26e4b67175db

See more details on using hashes here.

File details

Details for the file datanomy-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: datanomy-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 4.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.7

File hashes

Hashes for datanomy-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b08cea4b32ab96e6d675a7117bc5891b443990473a42bab2693cacd160614faf
MD5 78a6f1dccb16fd2190916569f8861b1b
BLAKE2b-256 2de52d9883bd95b51abdfb870ac93259cb8023482b34287aec6d97c561fecded

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page