Skip to main content

Python library for datannur catalog metadata management

Project description

datannur logo

MIT License PyPI version Python CI codecov

datannurpy

datannurpy is the Python builder for datannur catalogs: it scans files and databases, extracts metadata and statistics, and exports a ready-to-use catalog.

Key features:

  • Broad format support - CSV, Excel, Parquet, Delta Lake, Iceberg, SAS, SPSS, Stata
  • Database introspection - PostgreSQL, MySQL, Oracle, SQL Server, SQLite, DuckDB
  • Remote & cloud storage - SFTP, S3, Azure Blob, GCS via fsspec
  • Rich metadata - schema, statistics, frequencies, modalities, auto-tagging
  • Incremental scans - only rescan what changed between runs
  • YAML or Python API - declarative configuration or programmatic control

Quick start

pip install datannurpy
# catalog.yml
app_path: ./my-catalog
open_browser: true

add:
  - folder: ./data
    include: ["*.csv", "*.xlsx", "*.parquet"]

  - database: sqlite:///mydb.sqlite
python -m datannurpy catalog.yml

Documentation

📖 Full documentation: docs.datannur.com/builder

🌐 Website: datannur.com

🚀 Demo: dev.datannur.com

Contributing

For development documentation and contributing guidelines, see CONTRIBUTING.md.

License

MIT - see LICENSE. All dependencies are MIT/Apache 2.0/BSD compatible.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datannurpy-0.20.0a1.tar.gz (7.6 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datannurpy-0.20.0a1-py3-none-any.whl (7.6 MB view details)

Uploaded Python 3

File details

Details for the file datannurpy-0.20.0a1.tar.gz.

File metadata

  • Download URL: datannurpy-0.20.0a1.tar.gz
  • Upload date:
  • Size: 7.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for datannurpy-0.20.0a1.tar.gz
Algorithm Hash digest
SHA256 9bf9f2ef034a010d4dbac7cc84bf12100a20d320129dcd6d87fdaf042faa58e6
MD5 4645c18300bab616f5b5eeaed829c031
BLAKE2b-256 00e73b03d578eafcda9753ea381fc606f7f443493a844fa3426855d8492e4b8e

See more details on using hashes here.

Provenance

The following attestation bundles were made for datannurpy-0.20.0a1.tar.gz:

Publisher: release.yml on datannur/datannurpy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file datannurpy-0.20.0a1-py3-none-any.whl.

File metadata

  • Download URL: datannurpy-0.20.0a1-py3-none-any.whl
  • Upload date:
  • Size: 7.6 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for datannurpy-0.20.0a1-py3-none-any.whl
Algorithm Hash digest
SHA256 641e1967ccbfdac0e0543ad5d0e6f0e4a387fced287c1ef3716383665d1cce09
MD5 7b260df999cd437c9b6bb069237e7e4d
BLAKE2b-256 729d3251a585a575affe30917bcf9862695ebcc33c1a05cee70fc119ca08159d

See more details on using hashes here.

Provenance

The following attestation bundles were made for datannurpy-0.20.0a1-py3-none-any.whl:

Publisher: release.yml on datannur/datannurpy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page