Skip to main content

A library for loading datasets and models whose metadata is provided in the DCAT-AP format.

Project description

DCAT-AP Hub

This library enables easy downloading and loading of datasets and models whose metadata is provided in the DCAT-AP format. Currently, only JSON-LD is supported.

How To Install

# Base install (datasets, processing)
pip install dcat-ap-hub

# Install with ONNX model loading support
pip install "dcat-ap-hub[onnx]"

# Install with Hugging Face model loading support
pip install "dcat-ap-hub[huggingface]"

How To Load Datasets

from dcat_ap_hub import Dataset

url = "https://data.europa.eu/api/hub/repo/datasets/7b715249-0c76-4592-9df6-f36b9a47f6e5.jsonld"

ds = Dataset.from_url(url)
files = ds.download(data_dir="./data")

How To Load SKLearn Models

from dcat_ap_hub import Dataset

url = "https://ki-daten.hlrs.de/hub/repo/datasets/6f75de8b60a9f8a2fdf7b69cbd86d9e64bcb3837.jsonld"

ds = Dataset.from_url(url)
ds.download(data_dir="./data")
model = ds.load_model(model_dir="./models")

How To Load Huggingface Models

from dcat_ap_hub import Dataset

url = "https://ki-daten.hlrs.de/hub/repo/datasets/6f75de8b60a9f8a2fdf7b69cbd86d9e64bcb3837.jsonld"

ds = Dataset.from_url(url)
ds.download(data_dir="./data")
model, processor, metadata = ds.load_model(model_dir="./models")

How To Process Datasets If Supported

from dcat_ap_hub import Dataset

url = "https://data.europa.eu/api/hub/repo/datasets/7b715249-0c76-4592-9df6-f36b9a47f6e5.jsonld"

ds = Dataset.from_url(url)
ds.download(data_dir="./data")
files = ds.process(processed_dir="./processed")

Funding

This project was developed using resources from the HammerHAI project, an EU co-funded AI Factory initiative operated by the High-Performance Computing Center Stuttgart and supported by the European Commission as well as German federal and state ministries. It is funded by the European High Performance Computing Joint Undertaking under Grant Agreement No. 101234027.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dcat_ap_hub-0.1.3.tar.gz (137.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dcat_ap_hub-0.1.3-py3-none-any.whl (21.5 kB view details)

Uploaded Python 3

File details

Details for the file dcat_ap_hub-0.1.3.tar.gz.

File metadata

  • Download URL: dcat_ap_hub-0.1.3.tar.gz
  • Upload date:
  • Size: 137.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.9

File hashes

Hashes for dcat_ap_hub-0.1.3.tar.gz
Algorithm Hash digest
SHA256 f42c3880c55e2b780a619dcf5760591560c3f233447242b75a22dda5d5adf7ad
MD5 320a2dd8850d45d646975e6c3f6bbf98
BLAKE2b-256 d85ab8aa52c3b624b4119f5a62553f7aac3e991dd4b2d8144a1749231d06a156

See more details on using hashes here.

File details

Details for the file dcat_ap_hub-0.1.3-py3-none-any.whl.

File metadata

File hashes

Hashes for dcat_ap_hub-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 da8a2f4cf0201d6b716cf9886ee98767456a99a56a69f53dabfeb6ec79cdf475
MD5 072831bbce502a5023923fe32fefb8af
BLAKE2b-256 362a3e9f335bb3efecaf50f671778f23086c3fb64d0f0106d3e1ae260ea839a3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page