Skip to main content

NUDB Use - is a usage-package for the Norwegian National Education Database cloud-data. Both for data-consumers and data-deliverers

Project description

SSB-NUDB-USE

PyPI Status Python Version License

Documentation Tests Coverage Quality Gate Status

pre-commit Black Ruff Poetry

Description

NUDB is the National Education Database of Norway. It is operated by Statsitics Norway - section 360. This package is the main "usage-package" for those seeking to use NUDB-data, or deliver data to NUDB.

NUDBs data is kept as parquet files in GCP, and you will need seperate access to this data to utilize this package. Some features in this package might require access to other data, like BRREG (Brønnøysundregisteret), BOF (befolkningsregisteret), BOF (virksomhetsregisteret) etc.

Installation

You can install SSB Nudb Use via poetry from PyPI:

poetry add ssb-nudb-use

Dependencies

This package depends on the package "ssb-nudb-config", which contains metadata, but also points to content in other metadatasystems like Vardef, Klass and Datadoc.

Usage

Please see the Reference Guide for details.

Usage for extraction (data from NUDB)

Find the latest of each file shared.

from nudb_use import latest_shared_paths
latest_shared_paths()

Get the periods out of any paths following the SSB-naming standard.

from nudb_use import get_periods_from_path
get_periods_from_path(path)

Deriving variables not stored in data, is done by the derive module:

from nudb_use import derive
df = derive.utd_skoleaar_slutt(df)

Usage for delivery (data to NUDB)

We have renamed a lot of our variables transitioning from the old on-prem systems. If you are looking for the new or old names of variables, you can use the find_var or find_vars functions:

from nudb_use import find_vars
find_vars(["snr", "sosbak"])

Find the dtype and length (char-width) of strings using a dataeset name:

from nudb_use import look_up_dtype_length_for_dataset
print(look_up_dtype_length_for_dataset("igang_videregaaende"))

If you want to update the column names you have in a pandas dataframe, to the new column names - there's a function for that:

from nudb_use import update_colnames
df = update_colnames(df)

After renaming, you can get the pandas dtypes the columns should have with get_dtypes:

from nudb_use import get_dtypes
dtypes = get_dtypes(df)
df = df.astype(dtypes)

If you are delivering to NUDB, we want you to run our quality suite before sharing the data with us:

from nudb_use import run_quality_suite
run_quality_suite(df, "avslutta")

Data about your delivery, like "avslutta", should first have its data entered into, and released in the ssb-nudb-config package before available in this function. Contact the NUDB-team to define a new delivery.

Contributing

Contributions are very welcome. To learn more, see the Contributor Guide.

License

Distributed under the terms of the MIT license, SSB Nudb Use is free and open source software.

Issues

If you encounter any problems, please file an issue along with a detailed description.

Credits

This project was generated from Statistics Norway's SSB PyPI Template.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ssb_nudb_use-2026.5.6.tar.gz (97.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ssb_nudb_use-2026.5.6-py3-none-any.whl (139.7 kB view details)

Uploaded Python 3

File details

Details for the file ssb_nudb_use-2026.5.6.tar.gz.

File metadata

  • Download URL: ssb_nudb_use-2026.5.6.tar.gz
  • Upload date:
  • Size: 97.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for ssb_nudb_use-2026.5.6.tar.gz
Algorithm Hash digest
SHA256 448d56c08c59aebb88ef24d226305a02f027d3e7ff29ce7ffd7d0a5cc0b6d45e
MD5 777801baa8276c16d0d50360d71b9257
BLAKE2b-256 4c4ba5498c510c78dc254b0c53f36ad7769f004b8f9d90be8ddaa65a5e903e1e

See more details on using hashes here.

Provenance

The following attestation bundles were made for ssb_nudb_use-2026.5.6.tar.gz:

Publisher: release.yml on statisticsnorway/ssb-nudb-use

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file ssb_nudb_use-2026.5.6-py3-none-any.whl.

File metadata

  • Download URL: ssb_nudb_use-2026.5.6-py3-none-any.whl
  • Upload date:
  • Size: 139.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for ssb_nudb_use-2026.5.6-py3-none-any.whl
Algorithm Hash digest
SHA256 04c8b875748888503524fc4337f825ef8e2988ee809954d7e6a80786db730887
MD5 ab2c6d39f091c87d4135b80d896397a6
BLAKE2b-256 0948e6f3e13a657c6e6420ace82a3d3a4abb89c208312dcadb705c5704dce349

See more details on using hashes here.

Provenance

The following attestation bundles were made for ssb_nudb_use-2026.5.6-py3-none-any.whl:

Publisher: release.yml on statisticsnorway/ssb-nudb-use

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page