Skip to main content

A generic pipeline that can be used to map raw data to RDF.

Project description

data2rdf

A pipeline for generating data representation in RDF out of raw data given in ASCII, CSV, JSON or EXCEL format.

https://data2rdf.readthedocs.io/en/latest/

Coverage

Coverage Report
FileStmtsMissCoverMissing
data2rdf
   init.py50100% 
   config.py2711 96%
   utils.py1222 83%
   warnings.py30100% 
data2rdf/models
   init.py30100% 
   base.py5355 91%
   graph.py14488 94%
   mapping.py4711 98%
   utils.py581414 76%
data2rdf/modes
   init.py40100% 
data2rdf/parsers
   init.py60100% 
   base.py1531717 89%
   csv.py1682020 88%
   excel.py1831717 91%
   json.py2293939 83%
   utils.py901010 89%
data2rdf/pipelines
   init.py20100% 
   main.py951616 83%
data2rdf/qudt
   init.py00100% 
   utils.py551616 71%
TOTAL133716688% 

Installation

Install for using the package

Either install the package from the pypi

pip install data2rdf

Install for development

Install the package from the source code:

git clone git@github.com:MI-FraunhoferIWM/data2rdf.git
cd data2rdf
pip install -e .

Unit tests

Before running the unit tests, please install the needed packages:

pip install data2rdf[tests]

Afterwards, run the unittest with:

pytest

Building the docs locally

HTML

A server will start, generate the docs and listen for changes in the source files. This can be done by using docker or installing the development environment directly on the you machine. Next are installation guides for Docker and Linux OS.

Docker

First, build the Docker image by running the following command:

$ docker build -f docs.Dockerfile -t data2rdf-docs .

Then, start the program by running:

$ docker run -it --rm -v $PWD:/app -p 8000:8000 data2rdf-docs

Linux

At an OS level (these commands work on Linux Debian):

$ sudo apt install pandoc graphviz default-jre
$ sudo apt-get install texlive-latex-recommended \
                       texlive-latex-extra \
                       texlive-fonts-recommended \
                       latexmk

The python dependencies:

$ pip install .[docs]

Now you can start the server and render the docs:

$ sphinx-autobuild docs/source docs/build/html

The documentation will be available on http://127.0.0.1:8000.

PDF (LaTeX)

To generate a PDF of the documentation, simply run (from the root project folder):

make -C docs latexpdf

The generated PDF can be found under docs/build/latex/data2rdf_docs.pdf

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

data2rdf-2.4.2.tar.gz (47.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

data2rdf-2.4.2-py3-none-any.whl (75.4 kB view details)

Uploaded Python 3

File details

Details for the file data2rdf-2.4.2.tar.gz.

File metadata

  • Download URL: data2rdf-2.4.2.tar.gz
  • Upload date:
  • Size: 47.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.15

File hashes

Hashes for data2rdf-2.4.2.tar.gz
Algorithm Hash digest
SHA256 4c6b3b7bfa5732a281eec759b99ad7cbaef87920f14a5e204d3f67adf710e0fc
MD5 b7eff3f6081c31cc143d72ed8360c7a1
BLAKE2b-256 3f8ee9c0a90ea25a4edda00b048d2804d76b797096c7cdcec34111ca94fb157f

See more details on using hashes here.

File details

Details for the file data2rdf-2.4.2-py3-none-any.whl.

File metadata

  • Download URL: data2rdf-2.4.2-py3-none-any.whl
  • Upload date:
  • Size: 75.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.15

File hashes

Hashes for data2rdf-2.4.2-py3-none-any.whl
Algorithm Hash digest
SHA256 7ebe5e3e0166dd0bc212d2cf92f7e587b401647b1d70e1ba052dd5339fad85bf
MD5 63df86fa0cbc8ab080d500c690e0a2e8
BLAKE2b-256 fd48a291f533d1824a8951646ff415e7f37eca2fd97f7e92648af2a838240939

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page