Skip to main content

A library for loading datasets and models whose metadata is provided in the DCAT-AP format.

Project description

DCAT-AP Hub

This library enables easy downloading and loading of datasets and models whose metadata is provided in the DCAT-AP format. Currently, only JSON-LD is supported.

How To Install

# Base install (datasets, processing)
pip install dcat-ap-hub

# Install with ONNX model loading support
pip install "dcat-ap-hub[onnx]"

# Install with Hugging Face model loading support
pip install "dcat-ap-hub[huggingface]"

How To Load Datasets

from dcat_ap_hub import Dataset

url = "https://data.europa.eu/api/hub/repo/datasets/7b715249-0c76-4592-9df6-f36b9a47f6e5.jsonld"

ds = Dataset.from_url(url)
files = ds.download(data_dir="./data")

How To Load SKLearn Models

from dcat_ap_hub import Dataset

url = "https://ki-daten.hlrs.de/hub/repo/datasets/6f75de8b60a9f8a2fdf7b69cbd86d9e64bcb3837.jsonld"

ds = Dataset.from_url(url)
ds.download(data_dir="./data")
model = ds.load_model(model_dir="./models")

How To Load Huggingface Models

from dcat_ap_hub import Dataset

url = "https://ki-daten.hlrs.de/hub/repo/datasets/6f75de8b60a9f8a2fdf7b69cbd86d9e64bcb3837.jsonld"

ds = Dataset.from_url(url)
ds.download(data_dir="./data")
model, processor, metadata = ds.load_model(model_dir="./models")

How To Process Datasets If Supported

from dcat_ap_hub import Dataset

url = "https://data.europa.eu/api/hub/repo/datasets/7b715249-0c76-4592-9df6-f36b9a47f6e5.jsonld"

ds = Dataset.from_url(url)
ds.download(data_dir="./data")
files = ds.process(processed_dir="./processed")

Funding

This project was developed using resources from the HammerHAI project, an EU co-funded AI Factory initiative operated by the High-Performance Computing Center Stuttgart and supported by the European Commission as well as German federal and state ministries. It is funded by the European High Performance Computing Joint Undertaking under Grant Agreement No. 101234027.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dcat_ap_hub-0.1.2.tar.gz (137.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dcat_ap_hub-0.1.2-py3-none-any.whl (21.2 kB view details)

Uploaded Python 3

File details

Details for the file dcat_ap_hub-0.1.2.tar.gz.

File metadata

  • Download URL: dcat_ap_hub-0.1.2.tar.gz
  • Upload date:
  • Size: 137.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.9

File hashes

Hashes for dcat_ap_hub-0.1.2.tar.gz
Algorithm Hash digest
SHA256 cf17229926cc5b5270fb2be4659d37e0eba116a77697fd89219b465df29bba1a
MD5 9740745cf959cb96c42685b75c22f209
BLAKE2b-256 468676d6f7b227d9fd61b04350f551839543a9d3fc29cea7493577b51f88e66f

See more details on using hashes here.

File details

Details for the file dcat_ap_hub-0.1.2-py3-none-any.whl.

File metadata

File hashes

Hashes for dcat_ap_hub-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 edd71acd5cb89b5ad7dd384b92f6a3f304dc7a362010fc7e73fc25ff98aab775
MD5 8814745d10ce090ec05e47664ee3c672
BLAKE2b-256 3602611ff41711e157b827a835d453ad3032910f735c1ea3ceefe4c86988811a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page