Skip to main content

Library to process dumps of knowledge graphs (Wikipedia, DBpedia, Wikidata)

Project description

kgdata PyPI Documentation

KGData is a library to process dumps of Wikipedia, Wikidata. What it can do:

  • Clean up the dumps to ensure the data is consistent (resolve redirect, remove dangling references)
  • Create embedded key-value databases to access entities from the dumps.
  • Extract Wikidata ontology.
  • Extract Wikipedia tables and convert the hyperlinks to Wikidata entities.
  • Create Pyserini indices to search Wikidata’s entities.
  • and more

For a full documentation, please see the website.

Installation

From PyPI (using pre-built binaries):

pip install kgdata[spark]   # omit spark to manually specify its version if your cluster has different version

Project details


Release history Release notifications | RSS feed

This version

7.0.7

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

kgdata-7.0.7-pp311-pypy311_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.3 MB view details)

Uploaded PyPymanylinux: glibc 2.17+ x86-64

kgdata-7.0.7-pp310-pypy310_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.3 MB view details)

Uploaded PyPymanylinux: glibc 2.17+ x86-64

kgdata-7.0.7-cp313-cp313t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.3 MB view details)

Uploaded CPython 3.13tmanylinux: glibc 2.17+ x86-64

kgdata-7.0.7-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.3 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64

kgdata-7.0.7-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.3 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64

kgdata-7.0.7-cp310-cp310-win_amd64.whl (2.3 MB view details)

Uploaded CPython 3.10Windows x86-64

kgdata-7.0.7-cp310-cp310-manylinux_2_35_x86_64.whl (3.3 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.35+ x86-64

kgdata-7.0.7-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.3 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.17+ x86-64

kgdata-7.0.7-cp310-cp310-macosx_11_0_x86_64.macosx_11_0_arm64.macosx_11_0_universal2.whl (5.5 MB view details)

Uploaded CPython 3.10macOS 11.0+ ARM64macOS 11.0+ universal2 (ARM64, x86-64)macOS 11.0+ x86-64

kgdata-7.0.7-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.3 MB view details)

Uploaded CPython 3.9manylinux: glibc 2.17+ x86-64

File details

Details for the file kgdata-7.0.7-pp311-pypy311_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.7-pp311-pypy311_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 7cf18902982813ca1f62d2bd20391cb06388a27858bc6e6dd503fc3128e16296
MD5 9cfd909680ba3022183840696343d475
BLAKE2b-256 183aa8dcd6209841464ee0ba60650b52932f2ffedf09876a35ca13f480ddd98a

See more details on using hashes here.

File details

Details for the file kgdata-7.0.7-pp310-pypy310_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.7-pp310-pypy310_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 6d76ceb484bf89a058978dc79ab1bfd59a3549913b07cff7abd79f2241d50a8a
MD5 437433bed36a410b4f31011d32b07d37
BLAKE2b-256 bdc94d9f109f0222224f6cd2c9258edbc3f7c994ad61efa1849d04def04c3fb7

See more details on using hashes here.

File details

Details for the file kgdata-7.0.7-cp313-cp313t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.7-cp313-cp313t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 91b5960bed4fa7aedf7195757a4dcfe1578a6176acf581ab57b210a9d77515d7
MD5 6a11a189b0954ef470472408e7ac0d34
BLAKE2b-256 9bd8607e91170b56c7b199e9b79fdf0ea90dc8a5dcd8ebd83678ad6313a5acdb

See more details on using hashes here.

File details

Details for the file kgdata-7.0.7-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.7-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 d03f89f5a56ed1f99b8b155083ead55e520d786e94e6c437616be9b7957346c6
MD5 2e6bab7770cca924b074e86a74cb0399
BLAKE2b-256 509a12dc88ffa1d5d660dd77864c8a4237ed1c689c395af1fc83e8c9574b16fd

See more details on using hashes here.

File details

Details for the file kgdata-7.0.7-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.7-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 55791a6905350d7fe3b7f9f362dbcb76cf7595a6432f9bcb5fd9e3d775ce9003
MD5 7ffa27e7d4809a62fc11d97b9e003caf
BLAKE2b-256 040840e60c0b96611382318325c782e070e77d9e5dea2939ebd13d8a70ebd66f

See more details on using hashes here.

File details

Details for the file kgdata-7.0.7-cp310-cp310-win_amd64.whl.

File metadata

  • Download URL: kgdata-7.0.7-cp310-cp310-win_amd64.whl
  • Upload date:
  • Size: 2.3 MB
  • Tags: CPython 3.10, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.12

File hashes

Hashes for kgdata-7.0.7-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 5149881966434bab3528211bf693d21b9c290332e5ae081aacc6ca013eb1f9b1
MD5 963f9becfd86f41f67ab1935333d43c3
BLAKE2b-256 fac815217fd43a16d4d107043a65a817656bafebc6da091dfa148ad6cd68702e

See more details on using hashes here.

File details

Details for the file kgdata-7.0.7-cp310-cp310-manylinux_2_35_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.7-cp310-cp310-manylinux_2_35_x86_64.whl
Algorithm Hash digest
SHA256 9e593360797d76a40de7b6332e96b15274170f0e7c1ceab26ea524d6208a5fb3
MD5 00cd364929e910b96ede826246c1b615
BLAKE2b-256 4c601a450771db5e40e4a5fee620066ae07e029aac94f78a75c2b9c34f9938c2

See more details on using hashes here.

File details

Details for the file kgdata-7.0.7-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.7-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 9c693dd1e9eb42154a0aad9e877632dfaeed40fac1042b76bf88e314bca8b0dc
MD5 983f0cf2359a0360c21402f47b409b62
BLAKE2b-256 d4ea5ecc8aca5ee552c14c7a965be59c7e9be04c56a72d09d209453ab06b9e9f

See more details on using hashes here.

File details

Details for the file kgdata-7.0.7-cp310-cp310-macosx_11_0_x86_64.macosx_11_0_arm64.macosx_11_0_universal2.whl.

File metadata

File hashes

Hashes for kgdata-7.0.7-cp310-cp310-macosx_11_0_x86_64.macosx_11_0_arm64.macosx_11_0_universal2.whl
Algorithm Hash digest
SHA256 17bca32779d255b3fccb57a0bd43ef5c3b5d403348e0a5231befb74d5fe3923c
MD5 0a3c874861a1af2f29a9934f9f33ba6e
BLAKE2b-256 a890626c32a03521d89bb5b4d247145b78c1dd6860fe22ac5998ce90db8f342e

See more details on using hashes here.

File details

Details for the file kgdata-7.0.7-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.7-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 eb17ac5967e8268501eb9247468e4248633d96650bb5f1a703e737a04ed1147d
MD5 467fabfcd88a174a2c6cadf291084903
BLAKE2b-256 7b0371f26c163c65426ca28c9585c8b18528cd9daa2e9c55ee0f89f31ca30086

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page