Library to process dumps of knowledge graphs (Wikipedia, DBpedia, Wikidata)
Project description
kgdata
KGData is a library to process dumps of Wikipedia, Wikidata. What it can do:
- Clean up the dumps to ensure the data is consistent (resolve redirect, remove dangling references)
- Create embedded key-value databases to access entities from the dumps.
- Extract Wikidata ontology.
- Extract Wikipedia tables and convert the hyperlinks to Wikidata entities.
- Create Pyserini indices to search Wikidata’s entities.
- and more
For a full documentation, please see the website.
Installation
From PyPI (using pre-built binaries):
pip install kgdata[spark] # omit spark to manually specify its version if your cluster has different version
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
kgdata-7.0.0.tar.gz
(147.6 kB
view hashes)
Built Distributions
Close
Hashes for kgdata-7.0.0-pp310-pypy310_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a58270797f0154cc300c15c442551969946ba8717be69054e07449c858d6bd7b |
|
MD5 | aeec260ce82146a851b5680ffca021c1 |
|
BLAKE2b-256 | a2d1196f9096c6ab4c567a859ed6991d8bc6a383a5182c87328a261a27f614b4 |
Close
Hashes for kgdata-7.0.0-pp39-pypy39_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9b6be28386bef3be96f74b298a53a94c97d55a3e716b1d6da2397587c7d9d516 |
|
MD5 | 4c454085274970ce7c28e4d2243dc568 |
|
BLAKE2b-256 | b22e5bd2ad8bdc123734c47558f17b36ed812efdccb49e7bc6e82c6947c4f4e2 |
Close
Hashes for kgdata-7.0.0-cp312-none-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9148c8f21e08fc36629a3777c63607c64e6b1083c0e91fd49e2a6ea8b653df09 |
|
MD5 | e715897ed8ba2c7600635a0a578f6832 |
|
BLAKE2b-256 | e634b2bb0d176423cb657b4751885b6a7d0d4fe5948497a2e8bc97631951bc47 |
Close
Hashes for kgdata-7.0.0-cp312-cp312-manylinux_2_35_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e11231d31d06331450e38c057a928e63ae410c9796ca01412227897b490d4277 |
|
MD5 | 0ea686b184c52d6f55df1c26c022378b |
|
BLAKE2b-256 | 77ee76a07ba2035c6eef682e2643c983afeee3cf39ecbabaec80d558d26b768c |
Close
Hashes for kgdata-7.0.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9be40f3de9efe04acc99777b7a587e4bbbe7be04f288a9fa490639ce27195439 |
|
MD5 | 5da4d912dd176b34b7610e7b14466641 |
|
BLAKE2b-256 | 92a1f8827546ac0e5b61a7b33aff21937617598d71306d77afb7bd5e4e3f1692 |
Close
Hashes for kgdata-7.0.0-cp312-cp312-macosx_10_14_x86_64.macosx_11_0_arm64.macosx_10_14_universal2.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 50f2d7dd3243a5e094ac8377c64c7b6ce44e6dc0170c445a06affe9fd2fb2cef |
|
MD5 | 04b940cee64ed330b9bf3fcabd74880b |
|
BLAKE2b-256 | 5588d4e1ce9f9222adace126885f58eec34b76a66dbd04c72e884f095f378760 |
Close
Hashes for kgdata-7.0.0-cp311-none-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a50601598db613e90f7b0067bc05c85fd776c8151296e73b27e67a236a690e7c |
|
MD5 | a692023af7a1c043d9f1561f6e7cb03c |
|
BLAKE2b-256 | 36f13b31eacfe91c470988261b9f2d51ab8d8e7b3a29b4b1719b775c92f93852 |
Close
Hashes for kgdata-7.0.0-cp311-cp311-manylinux_2_35_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | bee04e7d699f45555b5ccc22f7dd14d9e1e74e8612d6ce801d77c1e89fbcebb0 |
|
MD5 | 64a77e15d022f5c9f910f983a7744d4b |
|
BLAKE2b-256 | f82ff343332d27d130d27e4c67ef84043d2bcb74cf54eb2f76341ab5a2479226 |
Close
Hashes for kgdata-7.0.0-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | de9fe17d462306dcc08791a9777b6a84898ba471d36052ecb2be0cfa395b5d73 |
|
MD5 | dc495d6b9312f0b13be5354cf3d0dd79 |
|
BLAKE2b-256 | fcaeb103c78ce9d2294bd3d5e3f75503bc98eeb6345c78cf52110fcecbb21b8c |
Close
Hashes for kgdata-7.0.0-cp311-cp311-macosx_10_14_x86_64.macosx_11_0_arm64.macosx_10_14_universal2.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0ee76a248fc32795f056ffdc26218128157c73d18cc770d855042be03f89726f |
|
MD5 | c1a834a49ea3fd2449708494bc8d8220 |
|
BLAKE2b-256 | 84a5b350b0a4255ec1ed0d7710659daa452579d323d175c26ff87e5611034cda |
Close
Hashes for kgdata-7.0.0-cp310-none-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 990c7deaaa9c1da5438a1acae2d64dfc3474a93572f6c1aa8c8f6187504cf961 |
|
MD5 | 68ac7f9b74bdfbe41788ffd198c13adb |
|
BLAKE2b-256 | 1d2264fc9f2e8023b1a8d74ec08f0cbfdb3ad362e057710919df081195275b7a |
Close
Hashes for kgdata-7.0.0-cp310-cp310-manylinux_2_35_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e47cbd0714c057ecbf97648cd8edef15d08a3050e3b4f45e24c1cc81334bd00f |
|
MD5 | 75e02ed94d8257778b8e5fdc2db511f1 |
|
BLAKE2b-256 | 5ea633f4276894bb53f86941f1cef20413d8d3b96942ccc6910c473d1989d7f2 |
Close
Hashes for kgdata-7.0.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 13b6833c4ec4cbff63195248e7f6b27ca72811339d6db8a9525cc357fc3a3cfb |
|
MD5 | 0f05465b925ccc44f5bf6a3d48fd378d |
|
BLAKE2b-256 | 27457d7b2546b7bd3255d46ecf80604dfc5ca48a53120031debd186bc25d1f88 |
Close
Hashes for kgdata-7.0.0-cp310-cp310-macosx_10_14_x86_64.macosx_11_0_arm64.macosx_10_14_universal2.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e8f9111b29411bf0a9916a41b31bc52244a227be7929e8259944243cbf5924c7 |
|
MD5 | 59f9b17845e2fa411c9c8d15e35be1a3 |
|
BLAKE2b-256 | d42fccda83b623c4ffe91a29a7511cafe66e66ccda769bf041660ffcca3bb78a |
Close
Hashes for kgdata-7.0.0-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b1d6728b16067e99c54a58ae23bbe2af0c1a26c1f0197a87b0d4076be5e1e6d3 |
|
MD5 | bdb0cd4778509f21c1c4bc8195501f30 |
|
BLAKE2b-256 | 2698f6453c29658c3e0f9a1f937f0c6bb19741b67a641f3c898741434f6e0a15 |