Skip to main content

Document dataset metadata. For use in Statistics Norway's metadata system.

Project description

Datadoc

Datadoc Unit tests Code coverage PyPI version Code style: black

Document datasets in Statistics Norway

Usage

DataDoc in use

From Jupyter

  1. Open https://jupyter.dapla-staging.ssb.no or another Jupyter Lab environment
  2. Datadoc comes preinstalled in Statistics Norway environments. Elsewhere, run Run pip install ssb-datadoc to install
  3. Upload a dataset to your Jupyter server (e.g. https://github.com/statisticsnorway/datadoc/blob/master/klargjorte_data/befolkning/person_testdata_p2021-12-31_p2021-12-31_v1.parquet)
  4. Run the demo.ipynb Notebook
  5. Datadoc will open in the notebook

Contributing

Prerequisites

  • Python >=3.10
  • Poetry, install via curl -sSL https://install.python-poetry.org | python3 -

Dependency Management

Poetry is used for dependency management. Poe the Poet is used for running poe tasks within poetry's virtualenv. Upon cloning this project first install necessary dependencies, then run the tests to verify everything is working.

Install all dependencies

poetry install --all-extras

Add dependencies

Main

poetry add <python package name>

Dev

poetry add --group dev <python package name>

Run tests

poetry run poe test

Run project locally

To run the project locally:

poetry run poe datadoc

Run project locally in Jupyter

To run the project locally in Jupyter run:

poetry run poe jupyter

A Jupyter instance should open in your browser. Open and run the cells in the .ipynb file to demo datadoc.

Running the Dockerized Application Locally

docker run -p 8050:8050 \
-v $HOME/.config/gcloud/application_default_credentials.json/:/application_default_credentials.json \
-e GOOGLE_APPLICATION_CREDENTIALS="/application_default_credentials.json" \
datadoc

Release process

Run the relevant version command on a branch e.g.

poetry version patch
poetry version minor

Commit with message like Bump version x.x.x -> y.y.y.

Open and merge a PR.

Use Github to tag and release.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ssb_datadoc-0.3.1.tar.gz (324.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ssb_datadoc-0.3.1-py3-none-any.whl (339.4 kB view details)

Uploaded Python 3

File details

Details for the file ssb_datadoc-0.3.1.tar.gz.

File metadata

  • Download URL: ssb_datadoc-0.3.1.tar.gz
  • Upload date:
  • Size: 324.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.10.2 Linux/5.15.0-1041-azure

File hashes

Hashes for ssb_datadoc-0.3.1.tar.gz
Algorithm Hash digest
SHA256 c65ab9914ee58afc9ebc7b2c45aa66bc82b74bff229af87a2e0277bed4f9b758
MD5 e828ab5b8a622c23739da4a0c77142af
BLAKE2b-256 30cf06916cba2185e1ac85563ddcd33a947d192150cb7d3394a13830b52ef05e

See more details on using hashes here.

File details

Details for the file ssb_datadoc-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: ssb_datadoc-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 339.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.10.2 Linux/5.15.0-1041-azure

File hashes

Hashes for ssb_datadoc-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c93f48265a57b4962527746b5795a1100e69bf3ab78140bf1e85d17b1a4f0a8d
MD5 42288c3a317076daf8d740c9b30398b0
BLAKE2b-256 9dbff45573fd9e2cf358202fea1bb4903e995619b6d7403b5866ad8fb795f17b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page