Skip to main content

Library to process dumps of knowledge graphs (Wikipedia, DBpedia, Wikidata)

Project description

kgdata PyPI Documentation

KGData is a library to process dumps of Wikipedia, Wikidata. What it can do:

  • Clean up the dumps to ensure the data is consistent (resolve redirect, remove dangling references)
  • Create embedded key-value databases to access entities from the dumps.
  • Extract Wikidata ontology.
  • Extract Wikipedia tables and convert the hyperlinks to Wikidata entities.
  • Create Pyserini indices to search Wikidata’s entities.
  • and more

For a full documentation, please see the website.

Installation

From PyPI (using pre-built binaries):

pip install kgdata[spark]   # omit spark to manually specify its version if your cluster has different version

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kgdata-7.0.4.tar.gz (150.3 kB view details)

Uploaded Source

Built Distributions

kgdata-7.0.4-pp310-pypy310_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (4.0 MB view details)

Uploaded PyPy manylinux: glibc 2.17+ x86-64

kgdata-7.0.4-pp39-pypy39_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (4.0 MB view details)

Uploaded PyPy manylinux: glibc 2.17+ x86-64

kgdata-7.0.4-cp313-cp313t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (4.0 MB view details)

Uploaded CPython 3.13t manylinux: glibc 2.17+ x86-64

kgdata-7.0.4-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (4.0 MB view details)

Uploaded CPython 3.13 manylinux: glibc 2.17+ x86-64

kgdata-7.0.4-cp312-none-win_amd64.whl (2.2 MB view details)

Uploaded CPython 3.12 Windows x86-64

kgdata-7.0.4-cp312-cp312-manylinux_2_35_x86_64.whl (3.3 MB view details)

Uploaded CPython 3.12 manylinux: glibc 2.35+ x86-64

kgdata-7.0.4-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (4.0 MB view details)

Uploaded CPython 3.12 manylinux: glibc 2.17+ x86-64

kgdata-7.0.4-cp312-cp312-macosx_10_14_x86_64.macosx_11_0_arm64.macosx_10_14_universal2.whl (5.5 MB view details)

Uploaded CPython 3.12 macOS 10.14+ universal2 (ARM64, x86-64) macOS 10.14+ x86-64 macOS 11.0+ ARM64

kgdata-7.0.4-cp311-none-win_amd64.whl (2.3 MB view details)

Uploaded CPython 3.11 Windows x86-64

kgdata-7.0.4-cp311-cp311-manylinux_2_35_x86_64.whl (3.3 MB view details)

Uploaded CPython 3.11 manylinux: glibc 2.35+ x86-64

kgdata-7.0.4-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (4.0 MB view details)

Uploaded CPython 3.11 manylinux: glibc 2.17+ x86-64

kgdata-7.0.4-cp311-cp311-macosx_10_14_x86_64.macosx_11_0_arm64.macosx_10_14_universal2.whl (5.5 MB view details)

Uploaded CPython 3.11 macOS 10.14+ universal2 (ARM64, x86-64) macOS 10.14+ x86-64 macOS 11.0+ ARM64

kgdata-7.0.4-cp310-none-win_amd64.whl (2.3 MB view details)

Uploaded CPython 3.10 Windows x86-64

kgdata-7.0.4-cp310-cp310-manylinux_2_35_x86_64.whl (3.3 MB view details)

Uploaded CPython 3.10 manylinux: glibc 2.35+ x86-64

kgdata-7.0.4-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (4.0 MB view details)

Uploaded CPython 3.10 manylinux: glibc 2.17+ x86-64

kgdata-7.0.4-cp310-cp310-macosx_10_14_x86_64.macosx_11_0_arm64.macosx_10_14_universal2.whl (5.5 MB view details)

Uploaded CPython 3.10 macOS 10.14+ universal2 (ARM64, x86-64) macOS 10.14+ x86-64 macOS 11.0+ ARM64

kgdata-7.0.4-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (4.0 MB view details)

Uploaded CPython 3.9 manylinux: glibc 2.17+ x86-64

File details

Details for the file kgdata-7.0.4.tar.gz.

File metadata

  • Download URL: kgdata-7.0.4.tar.gz
  • Upload date:
  • Size: 150.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.5.1

File hashes

Hashes for kgdata-7.0.4.tar.gz
Algorithm Hash digest
SHA256 13eb9ec6b781c201dd6607d19940b37f739f568f65c4654aa373a383d7f45219
MD5 646ed05148d6d0c8589deeed03400592
BLAKE2b-256 80c8b64411a2bc1bd4b7cb6801badd40e0d1f2fdc461c786a3fb0480cb34ef2a

See more details on using hashes here.

File details

Details for the file kgdata-7.0.4-pp310-pypy310_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.4-pp310-pypy310_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 867abaa9f2f0db21bb40c7b98e7567de5bd855f9fbb55141ef884f769fa6de16
MD5 2d6f69b32957ed5c55ad12d033f6b831
BLAKE2b-256 2f7b4841cc2edd6a1ec3226de74e6c0238603ee799652239c0fee658d20e1453

See more details on using hashes here.

File details

Details for the file kgdata-7.0.4-pp39-pypy39_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.4-pp39-pypy39_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 a982abc19eb9dc1aaf50c961173c18519d2af61b5d503805dd14473d7f8dd90f
MD5 731e7bb18a91c69ed82b40e585ed7ee7
BLAKE2b-256 373b9fedb7962c550992b672ab230d320f45253c0ab048be8317fb388b662e04

See more details on using hashes here.

File details

Details for the file kgdata-7.0.4-cp313-cp313t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.4-cp313-cp313t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 0d939acdaad42e23afeefde1f464e0e51016c6a4b03d71602e83974433eab0e4
MD5 c2be1f2a2d1b366742f1014fd29c0f1c
BLAKE2b-256 cdc7f54b19221da70a16e66e02151d2b2ed863d56121632f068ab24cd752a161

See more details on using hashes here.

File details

Details for the file kgdata-7.0.4-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.4-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 96d5929fb7669580efd490e79235297c72a764d8f2b4e407c6e860b84644ccad
MD5 75d5f2bd5d7c0c27f313b86cc1cbc3ec
BLAKE2b-256 ae53dcfa1b0cb8c64d86432083215319392e470aa7316c473ff45ec2d659f912

See more details on using hashes here.

File details

Details for the file kgdata-7.0.4-cp312-none-win_amd64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.4-cp312-none-win_amd64.whl
Algorithm Hash digest
SHA256 7a4641bcb36c0003456bf459b3ace8259b489fa3f63e083ae8dd0c2d762d3972
MD5 55e723cb8711c6331bee0c62fc1d1a0f
BLAKE2b-256 2e9306c174b74a827e6e2e6f05b031219557b128ed0c07a91db1f9ff22d435cb

See more details on using hashes here.

File details

Details for the file kgdata-7.0.4-cp312-cp312-manylinux_2_35_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.4-cp312-cp312-manylinux_2_35_x86_64.whl
Algorithm Hash digest
SHA256 341bcb519e71060ef694dfa65841f6821e9577ce0f50319c79e5cfa3ee2ca762
MD5 a53867ad53efee9f07f1305feb6370a0
BLAKE2b-256 26fac4c3a0adbbc3f967578d4aceffedde9509cb8806b655f4d6ea17eaafaf7c

See more details on using hashes here.

File details

Details for the file kgdata-7.0.4-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.4-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 a0e5d194721779c4f34f9c32fbf5d03693b0ca0feacae8ee327160086b997dd3
MD5 9bc506b408c940d3c0bcd21dc6584ba5
BLAKE2b-256 a92de4d3b7b92e9501402ba7f5adfa0003d250fac434e9c67bed12a29b731896

See more details on using hashes here.

File details

Details for the file kgdata-7.0.4-cp312-cp312-macosx_10_14_x86_64.macosx_11_0_arm64.macosx_10_14_universal2.whl.

File metadata

File hashes

Hashes for kgdata-7.0.4-cp312-cp312-macosx_10_14_x86_64.macosx_11_0_arm64.macosx_10_14_universal2.whl
Algorithm Hash digest
SHA256 53929186d9420d2637805ded26271461232e1ad663d285fad2ce4a8ad0fb91a5
MD5 f9def8e89ce99352b01858618bb76185
BLAKE2b-256 eeefc525b57214eaf35f84ebbb537c45aac0fddabe92b0c1e9a155ba4cfec51d

See more details on using hashes here.

File details

Details for the file kgdata-7.0.4-cp311-none-win_amd64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.4-cp311-none-win_amd64.whl
Algorithm Hash digest
SHA256 29981188670e8d964f4a58b22ac5c25647401c227bdb30fff0ef1b3120e14ce2
MD5 27c2b4f7f215bf696619398898382712
BLAKE2b-256 ecddd8d65e5a6646775a34e4b82c7767a88c77d131d36de766b6f6bc0ddb3953

See more details on using hashes here.

File details

Details for the file kgdata-7.0.4-cp311-cp311-manylinux_2_35_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.4-cp311-cp311-manylinux_2_35_x86_64.whl
Algorithm Hash digest
SHA256 733e184e6d4d2bca07923dea0c1ebe7a4f5bf661fd7a98588236b41730ac41e6
MD5 9bced9b9582d357bc7c8e9f2433d1372
BLAKE2b-256 c3e1bf70d9fa5f45232c6df98da4606dc6bd825b5319acbc628b2ef0769ac688

See more details on using hashes here.

File details

Details for the file kgdata-7.0.4-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.4-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 35aa4ce4eaf02c7ecb4a88166f2b197026d99ac88bd2380f5373f6452dbaa74d
MD5 5b4475028cda9d97017e1d212158e376
BLAKE2b-256 170afbf91adba83fe4bf0385260d1ac4cac8edf3ad7e3cac4779be3743b25ed1

See more details on using hashes here.

File details

Details for the file kgdata-7.0.4-cp311-cp311-macosx_10_14_x86_64.macosx_11_0_arm64.macosx_10_14_universal2.whl.

File metadata

File hashes

Hashes for kgdata-7.0.4-cp311-cp311-macosx_10_14_x86_64.macosx_11_0_arm64.macosx_10_14_universal2.whl
Algorithm Hash digest
SHA256 7ca4b6ac228b2ed1a3f0e51a8fde91bf7741377f81a9f446158d7a27f85fee49
MD5 df6e95cd54376179dfc067463e573e45
BLAKE2b-256 4a9cb4ac35bc95f5e4e0431476a0c2eaf63ebf6a481e7245ddedbb746de7f1fd

See more details on using hashes here.

File details

Details for the file kgdata-7.0.4-cp310-none-win_amd64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.4-cp310-none-win_amd64.whl
Algorithm Hash digest
SHA256 0d6dc8ffec6d4197141e25ed81ad32849a09ceb1450206848f264baac940cf5d
MD5 3a0c76e04b6bb46ff2caa3c2484128ea
BLAKE2b-256 c829304cf7fa3249da664627fb6c75ae7375c49302c44ca8de5757108dc8ac07

See more details on using hashes here.

File details

Details for the file kgdata-7.0.4-cp310-cp310-manylinux_2_35_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.4-cp310-cp310-manylinux_2_35_x86_64.whl
Algorithm Hash digest
SHA256 a3240b8b08d8dd9c89c2ea9f670ddbe3e5b9ee43e4fb5aba41e8682aadbf6141
MD5 c4ee051784dbcec70b93ccfc23ec4d1f
BLAKE2b-256 de28acd3d0bf61f44e8697456198a862ec1f90cd0687d18cba6f0ba8a1436255

See more details on using hashes here.

File details

Details for the file kgdata-7.0.4-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.4-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 d346320f23670b7cb18503076124a68d07be360042579ff36abaffe622fe8c9f
MD5 348cde176b575f8d95839810740ba59d
BLAKE2b-256 592e0385f2cfb33dcc4d718d0334fbec17a560c752e08f91b33a462e48c7ec6b

See more details on using hashes here.

File details

Details for the file kgdata-7.0.4-cp310-cp310-macosx_10_14_x86_64.macosx_11_0_arm64.macosx_10_14_universal2.whl.

File metadata

File hashes

Hashes for kgdata-7.0.4-cp310-cp310-macosx_10_14_x86_64.macosx_11_0_arm64.macosx_10_14_universal2.whl
Algorithm Hash digest
SHA256 26155cd09d9fb89459cfbc68d0ebe61689ae58dcaf37f708b55465b1af4dc223
MD5 3f4ef5ac0d23f13ea253d44cb0678e9e
BLAKE2b-256 c139e53cc4f5c43879a7513fe110c5383d5abd597d3d446f1f16789e17dd6fb6

See more details on using hashes here.

File details

Details for the file kgdata-7.0.4-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.4-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 9ddc3f342f7da134a608b2ac49a642844f530acf1f2935331926e9f709821423
MD5 38dc2e13deaa8a13f115212e30b001d7
BLAKE2b-256 fa27468d61eed6561e0cc942b28a55f258a576ab47ba50bdd2d69e61962afb48

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page