Skip to main content

Library to process dumps of knowledge graphs (Wikipedia, DBpedia, Wikidata)

Project description

kgdata PyPI Documentation

KGData is a library to process dumps of Wikipedia, Wikidata. What it can do:

  • Clean up the dumps to ensure the data is consistent (resolve redirect, remove dangling references)
  • Create embedded key-value databases to access entities from the dumps.
  • Extract Wikidata ontology.
  • Extract Wikipedia tables and convert the hyperlinks to Wikidata entities.
  • Create Pyserini indices to search Wikidata’s entities.
  • and more

For a full documentation, please see the website.

Installation

From PyPI (using pre-built binaries):

pip install kgdata[spark]   # omit spark to manually specify its version if your cluster has different version

Project details


Release history Release notifications | RSS feed

This version

7.0.6

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

kgdata-7.0.6-pp311-pypy311_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.3 MB view details)

Uploaded PyPymanylinux: glibc 2.17+ x86-64

kgdata-7.0.6-pp310-pypy310_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.3 MB view details)

Uploaded PyPymanylinux: glibc 2.17+ x86-64

kgdata-7.0.6-cp313-cp313t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.3 MB view details)

Uploaded CPython 3.13tmanylinux: glibc 2.17+ x86-64

kgdata-7.0.6-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.3 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64

kgdata-7.0.6-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.3 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64

kgdata-7.0.6-cp310-cp310-win_amd64.whl (2.3 MB view details)

Uploaded CPython 3.10Windows x86-64

kgdata-7.0.6-cp310-cp310-manylinux_2_35_x86_64.whl (3.3 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.35+ x86-64

kgdata-7.0.6-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.3 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.17+ x86-64

kgdata-7.0.6-cp310-cp310-macosx_10_14_x86_64.macosx_11_0_arm64.macosx_10_14_universal2.whl (5.5 MB view details)

Uploaded CPython 3.10macOS 10.14+ universal2 (ARM64, x86-64)macOS 10.14+ x86-64macOS 11.0+ ARM64

kgdata-7.0.6-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.3 MB view details)

Uploaded CPython 3.9manylinux: glibc 2.17+ x86-64

File details

Details for the file kgdata-7.0.6-pp311-pypy311_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.6-pp311-pypy311_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 5dc508a3e0cb4add2a0f597c5dca3fa26602f9b3eb6323560d876bda9bf152cf
MD5 1b0e5d5ee4f8e3acd638f0775bfcee33
BLAKE2b-256 51bcb31da11773837dedb557c4f35aef82c45a1b872828c6507ee4428d297cc4

See more details on using hashes here.

File details

Details for the file kgdata-7.0.6-pp310-pypy310_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.6-pp310-pypy310_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 b108c5b9a91ffd87f73e1065c1e85ca0b4548b3badc8b0e3cb8c87e5ecfce504
MD5 0cbc140ab83415f87ed7b914684a48f4
BLAKE2b-256 4b52eb48542fe7c5674f5de91d371b8ca2c1ff4d6f8ae42288f4c3a24e56ede3

See more details on using hashes here.

File details

Details for the file kgdata-7.0.6-cp313-cp313t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.6-cp313-cp313t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 16329637df021d59e1a23480a82a034a898f931324d25c8ac043a3ddb1df58c7
MD5 8380ac909b5d0bdb58283262d774dff4
BLAKE2b-256 2ad73cea9b9443d46d3368644ae02ba28327add656d464ae2b74dbf137c669f7

See more details on using hashes here.

File details

Details for the file kgdata-7.0.6-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.6-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 9e9287937292c5f728f1463a3672ec6e8cde694c1d02d71f040cb4276413f4a5
MD5 8a61a51c8e1f070bae86c4181cd40b60
BLAKE2b-256 8af5ec33a249794d31149a940fe6f857bbc38d61b5442854b5fab1db7d7073eb

See more details on using hashes here.

File details

Details for the file kgdata-7.0.6-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.6-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 5fdfb2087a3977cbf0126d30db9d5c7bfc3bd038054832d492dccc6686007f7c
MD5 a2c36f566ed0b4c1033c6c72298a852f
BLAKE2b-256 629aba43b72937602662981f6be4eb9187a64158fc7d3c6bef1e0f12c1692bea

See more details on using hashes here.

File details

Details for the file kgdata-7.0.6-cp310-cp310-win_amd64.whl.

File metadata

  • Download URL: kgdata-7.0.6-cp310-cp310-win_amd64.whl
  • Upload date:
  • Size: 2.3 MB
  • Tags: CPython 3.10, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.12

File hashes

Hashes for kgdata-7.0.6-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 4bdc2d89286882c03127077d216f770f354956b9dc662702156d80403ccdd536
MD5 dd51911cd5018ba772919010001661e0
BLAKE2b-256 a5440b727ec1c4e2a85e36bc8d39f853ecf82d27c6241c7e5d330c0dcbdbabf0

See more details on using hashes here.

File details

Details for the file kgdata-7.0.6-cp310-cp310-manylinux_2_35_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.6-cp310-cp310-manylinux_2_35_x86_64.whl
Algorithm Hash digest
SHA256 b411bfb0ab99995d9195dc51b168a4336a30699141fb9a37b2f0d77b413cc625
MD5 3286eadf4ee4385a33dbd5a2fe1c0831
BLAKE2b-256 d61702378c443fd65cdbf22c2bd47ed6079647a8b43d958ad523dba3ddfd2b10

See more details on using hashes here.

File details

Details for the file kgdata-7.0.6-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.6-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 5f2888e22bacb7e6363e83c359dc5aafeebfa2db62307f8b53dd41ed84b2d868
MD5 d0d08227f9027fa98e85ab52304d6ee9
BLAKE2b-256 6bad3dcf809486806d3da046995ddafc519a2ffb684e24d7ae1cc6b47c451f9a

See more details on using hashes here.

File details

Details for the file kgdata-7.0.6-cp310-cp310-macosx_10_14_x86_64.macosx_11_0_arm64.macosx_10_14_universal2.whl.

File metadata

File hashes

Hashes for kgdata-7.0.6-cp310-cp310-macosx_10_14_x86_64.macosx_11_0_arm64.macosx_10_14_universal2.whl
Algorithm Hash digest
SHA256 3bfc1ea4014c53348da2380a19b6c447597a76a2373433d1079a0619be21e9eb
MD5 b52767f578af684b05e365854814fdea
BLAKE2b-256 f5fc6083f0b90e852694cd44c4fa6b5c4588d1a3a6f448ed673129639afe98df

See more details on using hashes here.

File details

Details for the file kgdata-7.0.6-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for kgdata-7.0.6-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 f49a5a4f8bcf9f030c1c8e28f871dee879009e879e6045c8b799d45aae7cbd75
MD5 c9e7b1e72867608dd13c9202ef92185b
BLAKE2b-256 baf053c3d37dd7d190b7bc8d4ca21a2eec5240a41e5e749e615927b86801adf1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page