A generic pipeline that can be used to map raw data to RDF.
Project description
data2rdf
A pipeline for generating data representation in RDF out of raw data given in ASCII, CSV, JSON or EXCEL format.
https://data2rdf.readthedocs.io/en/latest/
Coverage Report
File | Stmts | Miss | Cover | Missing |
---|---|---|---|---|
data2rdf | ||||
init.py | 5 | 0 | 100% | |
config.py | 19 | 0 | 100% | |
utils.py | 8 | 1 | 1 | 88% |
warnings.py | 2 | 0 | 100% | |
data2rdf/models | ||||
init.py | 3 | 0 | 100% | |
base.py | 54 | 5 | 5 | 91% |
graph.py | 100 | 7 | 7 | 93% |
mapping.py | 39 | 1 | 1 | 97% |
utils.py | 58 | 14 | 14 | 76% |
data2rdf/modes | ||||
init.py | 4 | 0 | 100% | |
data2rdf/parsers | ||||
init.py | 6 | 0 | 100% | |
base.py | 135 | 11 | 11 | 92% |
csv.py | 168 | 20 | 20 | 88% |
excel.py | 174 | 17 | 17 | 90% |
json.py | 205 | 37 | 37 | 82% |
utils.py | 88 | 10 | 10 | 89% |
data2rdf/pipelines | ||||
init.py | 2 | 0 | 100% | |
main.py | 82 | 9 | 9 | 89% |
data2rdf/qudt | ||||
init.py | 0 | 0 | 100% | |
utils.py | 42 | 12 | 12 | 71% |
TOTAL | 1194 | 144 | 88% |
Installation
Install for using the package
Either install the package from the pypi
pip install data2rdf
Install for development
Install the package from the source code:
git clone git@github.com:MI-FraunhoferIWM/data2rdf.git
cd data2rdf
pip install -e .
Unit tests
Before running the unit tests, please install the needed packages:
pip install data2rdf[tests]
Afterwards, run the unittest with:
pytest
Building the docs locally
HTML
A server will start, generate the docs and listen for changes in the source files. This can be done by using docker or installing the development environment directly on the you machine. Next are installation guides for Docker and Linux OS.
Docker
First, build the Docker image by running the following command:
$ docker build -f docs.Dockerfile -t data2rdf-docs .
Then, start the program by running:
$ docker run -it --rm -v $PWD:/app -p 8000:8000 data2rdf-docs
Linux
At an OS level (these commands work on Linux Debian):
$ sudo apt install pandoc graphviz default-jre
$ sudo apt-get install texlive-latex-recommended \
texlive-latex-extra \
texlive-fonts-recommended \
latexmk
The python dependencies:
$ pip install .[docs]
Now you can start the server and render the docs:
$ sphinx-autobuild docs/source docs/build/html
The documentation will be available on http://127.0.0.1:8000
.
PDF (LaTeX)
To generate a PDF of the documentation, simply run (from the root project folder):
make -C docs latexpdf
The generated PDF can be found under docs/build/latex/data2rdf_docs.pdf
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file data2rdf-2.2.1.tar.gz
.
File metadata
- Download URL: data2rdf-2.2.1.tar.gz
- Upload date:
- Size: 39.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b2d7c9243215b2b8836453eb8387a8421307496c36e546abd5fc23dcb49ae9ad |
|
MD5 | 79d2a3fcbf2eb5bbf3ba9cf175532f8b |
|
BLAKE2b-256 | fbf8dd0dda179917ce8ab15997311aee64279575b6a3b5466ee26fbdff969f9a |
File details
Details for the file data2rdf-2.2.1-py3-none-any.whl
.
File metadata
- Download URL: data2rdf-2.2.1-py3-none-any.whl
- Upload date:
- Size: 66.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7467e457f63b14aad1a1b83490bb71af9af528ef163404459d8c38076efd1aaf |
|
MD5 | 0bd19e8ec6c04694859041a932ab38c8 |
|
BLAKE2b-256 | 8fb7d700bf40e3b5623f7f6d4c901c25aaedcc9c92afb50b11dc4de518ba7438 |