Skip to main content

Document dataset metadata. For use in Statistics Norway's metadata system.

Project description

Datadoc

Datadoc Unit tests Code coverage PyPI version Code style: black

Document datasets in Statistics Norway

Usage

DataDoc in use

From Jupyter

  1. Open https://jupyter.dapla-staging.ssb.no or another Jupyter Lab environment
  2. Datadoc comes preinstalled in Statistics Norway environments. Elsewhere, run Run pip install ssb-datadoc to install
  3. Upload a dataset to your Jupyter server (e.g. https://github.com/statisticsnorway/datadoc/blob/master/klargjorte_data/befolkning/person_testdata_p2021-12-31_p2021-12-31_v1.parquet)
  4. Run the demo.ipynb Notebook
  5. Datadoc will open in the notebook

Contributing

Prerequisites

  • Python >=3.10
  • Poetry, install via curl -sSL https://install.python-poetry.org | python3 -

Dependency Management

Poetry is used for dependency management. Poe the Poet is used for running poe tasks within poetry's virtualenv. Upon cloning this project first install necessary dependencies, then run the tests to verify everything is working.

Install all dependencies

poetry install --all-extras

Add dependencies

Main

poetry add <python package name>

Dev

poetry add --group dev <python package name>

Run tests

poetry run poe test

Run project locally

To run the project locally:

poetry run poe datadoc

Run project locally in Jupyter

To run the project locally in Jupyter run:

poetry run poe jupyter

A Jupyter instance should open in your browser. Open and run the cells in the .ipynb file to demo datadoc.

Running the Dockerized Application Locally

docker run -p 8050:8050 \
-v $HOME/.config/gcloud/application_default_credentials.json/:/application_default_credentials.json \
-e GOOGLE_APPLICATION_CREDENTIALS="/application_default_credentials.json" \
datadoc

Bump version

poetry run poe bump-patch-version

:warning: Run this on the default branch

This command will:

  1. Increment version strings in files
  2. Commit the changes
  3. Tag the commit with the new version

Then just run git push origin --tags to push the changes and trigger the release process.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ssb_datadoc-0.3.0.tar.gz (323.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ssb_datadoc-0.3.0-py3-none-any.whl (339.3 kB view details)

Uploaded Python 3

File details

Details for the file ssb_datadoc-0.3.0.tar.gz.

File metadata

  • Download URL: ssb_datadoc-0.3.0.tar.gz
  • Upload date:
  • Size: 323.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.10.2 Linux/5.15.0-1041-azure

File hashes

Hashes for ssb_datadoc-0.3.0.tar.gz
Algorithm Hash digest
SHA256 bd3ebb040c320f6b86c59988eb4b6d8a59e1b72b7815bb2ddc0c088a54cbf44f
MD5 04bd7ccf1cec824a3033a00d1828072f
BLAKE2b-256 28e299896796619eafa7af8f4be00a9a29ad96d6bfcb4f9fb61072a110794849

See more details on using hashes here.

File details

Details for the file ssb_datadoc-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: ssb_datadoc-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 339.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.10.2 Linux/5.15.0-1041-azure

File hashes

Hashes for ssb_datadoc-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 269cddd319f46451be68c679ea53b3b0a52735097a0db0980f7f33db316b8da9
MD5 4a36cfdf334beb250536e3085eaa1aa2
BLAKE2b-256 7329746ee11a22008e2a8addd48510dbd3b35870cbdc8bdae8321ff85ee8bc37

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page