CKAN extension automatically generating CSVW metadata for uploaded textual tabular data and importing the first table into the CKAN datastore.
Project description
ckanext-csvtocsvw
Extension automatically generating csvw metadata for uploaded textual tabular data. It uploads the data of the first table documented into a datastore for the source csv file. should be used as replacement for datapusher
Requirements
Needs a running instance of the CSVToCSVW Application. Point at it through env variables. Also needed is a Api Token for an account with the right privaledges to make the background job work on private datasets and ressources.
CKANINI__CKANEXT__CSVTOCSVW_URL=http://${CSVTOCSVW_HOST}:${CSVTOCSVW_APP_PORT}
CKANINI__CKANEXT__CSVTOCSVW__CKAN_TOKEN=${CKAN_API_TOKEN}
You can set the default formats to annotate by setting the env variable CSVTOCSVW_FORMATS for example
CKANINI__CKANEXT__CSVTOCSVW__FORMATS="csv txt asc"
else it will react to the following formats: "csv", "txt", "asc", "tsv"
If you need to process files that are not hosted through https (CKAN is not ssl configured), you can disable ssl verification
CKANINI__CKANEXT__CSVTOCSVW__SSL_VERIFY=False
Purpose
Reacts to CSV files uploaded. DEFAULT_FORMATS are "csv; txt" It creates two to sites for each resource.
- /annotate creates CSVW annotation file for a CSV in json-ld format named <csv_filename>-metadata.json, uploades table-1 to ckan datastore o u can explorer it with recline views
- /transform utilizes CSVW metadata to transform the whole content of the csv file to rdf, output is <csv_filename>.ttl The plugins default behavior includes a trigger to csv file uploads, so it runs annotation automatically on upload. The transformation is a bonus feature and outputs standard tabular data as mentioned in the CSVW documentation of the W3C. It must be triggered manually.
Compatibility with core CKAN versions:
| CKAN version | Compatible? |
|---|---|
| 2.9 and earlier | not tested |
| 2.10 | ✓ CI tested |
| 2.11 | ✓ CI tested |
Installation
To install the extension:
- Activate your CKAN virtual environment, for example:
. /usr/lib/ckan/default/bin/activate
- Use pip to install package
pip install ckanext-csvtocsvw
-
Add
csvtocsvwto theckan.pluginssetting in your CKAN config file (by default the config file is located at/etc/ckan/default/ckan.ini). -
Restart CKAN. For example, if you've deployed CKAN with Apache on Ubuntu:
sudo service apache2 reload
Developer installation
To install ckanext-csvtocsvw for development, activate your CKAN virtualenv and do:
git clone https://github.com/Mat-O-Lab/ckanext-csvtocsvw.git
cd ckanext-csvtocsvw
pip install -e ".[dev]"
Tests
To run the tests, do:
pytest --ckan-ini=test.ini
Citation
If you use this software, please cite it. GitHub shows a "Cite this repository" button (top right of the repo page) that exports the CITATION.cff in APA or BibTeX format.
After the first Zenodo release, a DOI-specific BibTeX entry will be available on the Zenodo record. Until then:
@software{hanke_ckanext_csvtocsvw,
author = {Hanke, Thomas},
title = {ckanext-csvtocsvw},
url = {https://github.com/Mat-O-Lab/ckanext-csvtocsvw},
license = {AGPL-3.0-or-later},
}
License
Acknowledgments
The authors would like to thank the Federal Government and the Heads of Government of the Länder for their funding and support within the framework of the Platform Material Digital consortium. Funded by the German Federal Ministry of Education and Research (BMBF) through the MaterialDigital Call in Project KupferDigital - project id 13XP5119.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ckanext_csvtocsvw-1.0.4.tar.gz.
File metadata
- Download URL: ckanext_csvtocsvw-1.0.4.tar.gz
- Upload date:
- Size: 41.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7a9ba7657db98e4f7e3fffc7a3d86e8e307675b2f08deeb1d251fb0fce19765e
|
|
| MD5 |
b21655d10e0192be6e450f6314159744
|
|
| BLAKE2b-256 |
6e46a82d68b49526d0d2f459d6f45747a4285b113d04dec77b22e6ad318233b1
|
Provenance
The following attestation bundles were made for ckanext_csvtocsvw-1.0.4.tar.gz:
Publisher:
pypi.yml on Mat-O-Lab/ckanext-csvtocsvw
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
ckanext_csvtocsvw-1.0.4.tar.gz -
Subject digest:
7a9ba7657db98e4f7e3fffc7a3d86e8e307675b2f08deeb1d251fb0fce19765e - Sigstore transparency entry: 1132428292
- Sigstore integration time:
-
Permalink:
Mat-O-Lab/ckanext-csvtocsvw@e02583afd1d0d9f3e9b3dd3bb2b8d76dc617aa16 -
Branch / Tag:
refs/tags/v1.0.4 - Owner: https://github.com/Mat-O-Lab
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
pypi.yml@e02583afd1d0d9f3e9b3dd3bb2b8d76dc617aa16 -
Trigger Event:
release
-
Statement type:
File details
Details for the file ckanext_csvtocsvw-1.0.4-py3-none-any.whl.
File metadata
- Download URL: ckanext_csvtocsvw-1.0.4-py3-none-any.whl
- Upload date:
- Size: 42.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c4ba21dcb02db99d40b362fa197708f98a440eaf66deabecfc079e83939279fc
|
|
| MD5 |
b91ddb9f0134be0459ccf3a645f8b040
|
|
| BLAKE2b-256 |
ab0bd3b00a10f37655bde9caf44fe601c49143ceb1df9e39fb49589e5f2b3406
|
Provenance
The following attestation bundles were made for ckanext_csvtocsvw-1.0.4-py3-none-any.whl:
Publisher:
pypi.yml on Mat-O-Lab/ckanext-csvtocsvw
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
ckanext_csvtocsvw-1.0.4-py3-none-any.whl -
Subject digest:
c4ba21dcb02db99d40b362fa197708f98a440eaf66deabecfc079e83939279fc - Sigstore transparency entry: 1132428337
- Sigstore integration time:
-
Permalink:
Mat-O-Lab/ckanext-csvtocsvw@e02583afd1d0d9f3e9b3dd3bb2b8d76dc617aa16 -
Branch / Tag:
refs/tags/v1.0.4 - Owner: https://github.com/Mat-O-Lab
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
pypi.yml@e02583afd1d0d9f3e9b3dd3bb2b8d76dc617aa16 -
Trigger Event:
release
-
Statement type: