Library to process dumps of knowledge graphs (Wikipedia, DBpedia, Wikidata)
Project description
kgdata
KGData is a library to process dumps of Wikipedia, Wikidata. What it can do:
- Clean up the dumps to ensure the data is consistent (resolve redirect, remove dangling references)
- Create embedded key-value databases to access entities from the dumps.
- Extract Wikidata ontology.
- Extract Wikipedia tables and convert the hyperlinks to Wikidata entities.
- Create Pyserini indices to search Wikidata’s entities.
- and more
For a full documentation, please seethe website.
Installation
From PyPI (using pre-built binaries):
pip install kgdata
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
kgdata-3.0.3.tar.gz
(59.7 kB
view hashes)
Built Distribution
kgdata-3.0.3-py3-none-any.whl
(84.2 kB
view hashes)