A library for mapping CKAN metadata <=> Frictionless metadata.
Project description
Frictionless CKAN Mapper
A library for mapping CKAN metadata <=> Frictionless metadata.
The library has zero dependencies (not even on Data Package libs). You can use it directly or use it for inspiration. Detailed outline of the algorithm is in the docs or you can read the code.
Installation
- Python: install Python. The library is compatible with both Python 2.7+ and Python 3.3+.
pip install frictionless-ckan-mapper
Note: The package is installed as frictionless-ckan-mapper
and then imported as frictionless_ckan_mapper
.
Getting started
CKAN => Frictionless
# get a CKAN metadata item
ckan_dataset = {
"name": "my-dataset",
"title": "My awesome dataset",
"url": "http://www.example.com/data.csv"
}
# or load from an API e.g.
# ckan_dataset = json.load(urllib.urlopen(
# https://demo.ckan.org/api/3/package_show?id=my_dataset
# ))
from frictionless_ckan_mapper import ckan_to_frictionless as converter
# convert to frictionless
frictionless_package = converter.dataset(ckan_dict)
print(frictionless_package)
Frictionless => CKAN
frictionless = {
'name': "f11s-dataset",
'path': "https://datahub.io/data.csv"
}
from frictionless_ckan_mapper import frictionless_to_ckan as f2c
ckanout = f2c.dataset(frictionless)
print(ckanout)
Reference
This package contains two modules:
frictionless_to_ckan
ckan_to_frictionless
You can import them directly like so:
from frictionless_ckan_mapper import ckan_to_frictionless
from frictionless_ckan_mapper import frictionless_to_ckan
ckan_to_frictionless
resource(ckandict)
from frictionless_ckan_mapper import ckan_to_frictionless as converter
# ... Some code with a CKAN dictionary ...
output_frictionless_dict = converter.resource(ckan_dictionary)
dataset(ckandict)
from frictionless_ckan_mapper import ckan_to_frictionless as converter
# ... Some code with a CKAN dictionary ...
output_frictionless_dict = converter.dataset(ckan_dictionary)
frictionless_to_ckan
resource(fddict)
from frictionless_ckan_mapper import frictionless_to_ckan as converter
# ... Some code with a Frictionless dictionary ...
output_ckan_dict = converter.resource(frictionless_dictionary)
package(fddict)
from frictionless_ckan_mapper import frictionless_to_ckan as converter
# ... Some code with a Frictionless dictionary ...
output_ckan_dict = converter.package(frictionless_dictionary)
Design
Frictionless <=> CKAN
--------------------------------------
Data Package <=> Package (Dataset)
Data Resource <=> Resource
Table Schema <=> Data Dictionary?? (datastore resources can have schemas)
CKAN reference
Summary:
- Class diagram below of key objects (without attributes)
- Objects with their attributes in this spreadsheet: https://docs.google.com/spreadsheets/d/1XdqGTFni5Jfs8AMbcbfsP7m11h9mOHS0eDtUZtqGVSg/edit#gid=1925460244
classDiagram
class Package
class Resource
class DataDictionary
Package *-- Resource
Resource o-- DataDictionary
Source for CKAN metadata structure:
- Dataset (Package): https://docs.ckan.org/en/2.8/api/index.html#ckan.logic.action.create.package_create
- Resource: https://docs.ckan.org/en/2.8/api/index.html#ckan.logic.action.create.resource_create
Algorithm: CKAN => Frictionless
For Package and Resource
- Expand extras into the dict.
- Map those attributes we have known mappings for (and parse items with values e.g. a schema will be un-jsonified).
- Drop anything we explicitly drop, e.g.
package_id
on a resource object. - Copy everything else over as is.
Optional extras:
- Add a profile field
Algorithm: Frictionless => CKAN
Developers
Install the source
-
Clone the repo:
git clone https://github.com/frictionlessdata/frictionless-ckan-mapper.git
-
And install it with pip:
pip install -e .
Run the tests
Use the excellent pytest
suite as follows:
pytest tests
To test under both Python 2 and Python 3 environments, we use tox
. You can run the following command:
make test
Note: Make sure that the necessary Python versions are in your environment PATH
(Python 2.7 and Python 3.6).
Building and publishing the package
To see a list of available commands from the Makefile
, execute:
make list
Build the distribution package
make dist
Alternatively, this command will accomplish the same to build packages for both Python 2 and Python 3:
python setup.py sdist bdist_wheel --universal
Test the package at test.pypy.org
python -m twine upload --repository testpypi dist/*
The package will be publicly available at https://test.pypi.org/project/frictionless-ckan-mapper/ and you will be able to pip install
it as usual.
Tag a new Git release and publish to PyPi
Note: You will need to publish from your personal PyPi account and be given access as a contributor to the project.
Make sure to update the version of the package in the file frictionless_ckan_mapper/VERSION
. Then:
make release
You can quickly review the version to release with make version
, which will print the current version stored in VERSION
.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for frictionless-ckan-mapper-1.0.2.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | cb4651f6cf5fe4f9384c3cf9b9c8fcb7594c225ecfa5f27c5d35a0e6b89fa323 |
|
MD5 | aba5804bf1c20aa852d8542721418e70 |
|
BLAKE2b-256 | 45cabdc1e3400a7a912b636a189db3b14b57ef7bdc07d580e69b4d8e7458cb90 |
Hashes for frictionless_ckan_mapper-1.0.2-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 063610a0fd57d7b96e3d89813cbc4db6a1e7c74646730401222915d4965b1e10 |
|
MD5 | 77da8416332cbf79a83b55a180e77458 |
|
BLAKE2b-256 | cf8e86a5f31f407e1794bf3362867ecf5800513fb942af0e5c93368f89ec5115 |