Skip to main content

Knowledge Graph Creator: converts text to RDF triples

Project description

Knowledge Graph Creator command line tool: kgcreator

PyPI Changelog Tests License

Knowledge Graph Creator: converts text to RDF triples and Neo4J graph and Memgraph database data

The Knowledge Graph Creator (kgcreator) is a tool for automating the generation of RDF data for Knowledge Graphs from raw text data read from an input directory path.

The Knowledge Graph Creator creates an output file containing RDF triples suitable for loading into any linked data/semantic web data store.

This Python command line utility is one of the example programs in my book Practical Python Artificial Intelligence Programming that is available on Leanpub.com or available (in the future, this book is under development) to read for free on my web site https://markwatson.com.

Installation

Install this tool using pip:

pip install kgcreator
pip install spacy
python -m spacy download en_core_web_sm

Usage

For help, run:

kgcreator --help
kgcreator --inputdir=test_data --outputfile=out.rdf  --outputfileneo4j=out.cypher

You can also use:

python -m kgcreator --help

Development

To contribute to this tool, first checkout the code. Then create a new virtual environment:

cd kgcreator
python -m venv venv
source venv/bin/activate

Now install the dependencies and test dependencies:

pip install -e '.[test]'

To run the tests:

pytest

Notes on generating Neo4J graph data

Start by identifying:

  • Names of entities (node labels).
  • Names of relationships.
  • Names of properties for nodes and relationships.
  • Constraints to be defined.
  • Indexes required.
  • The most important queries?

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kgcreator-0.21.tar.gz (8.1 kB view details)

Uploaded Source

File details

Details for the file kgcreator-0.21.tar.gz.

File metadata

  • Download URL: kgcreator-0.21.tar.gz
  • Upload date:
  • Size: 8.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.0

File hashes

Hashes for kgcreator-0.21.tar.gz
Algorithm Hash digest
SHA256 c12fd90e77a978b20d910557407043ca8443794d15d18297d1bcc463ed52a3f6
MD5 b6bbc0f5cf8a1504631fcc2d28033b45
BLAKE2b-256 8ce270d885131bf36f74d667b3d6332e69b5fbf892a9d342e69ce7b918c650d3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page