Library to process dumps of knowledge graphs (Wikipedia, DBpedia, Wikidata)
Project description
kgdata
KGData is a library to process dumps of Wikipedia, Wikidata. What it can do:
- Clean up the dumps to ensure the data is consistent (resolve redirect, remove dangling references)
- Create embedded key-value databases to access entities from the dumps.
- Extract Wikidata ontology.
- Extract Wikipedia tables and convert the hyperlinks to Wikidata entities.
- Create Pyserini indices to search Wikidata’s entities.
- and more
For a full documentation, please see the website.
Installation
From PyPI (using pre-built binaries):
pip install kgdata[spark] # omit spark to manually specify its version if your cluster has different version
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
kgdata-7.0.1.tar.gz
(150.1 kB
view hashes)
Built Distributions
Close
Hashes for kgdata-7.0.1-pp310-pypy310_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 925cb4d485fd9e08abd5bf0968bc297006c248c348516a8fde461d11ea502b17 |
|
MD5 | 6fdb77d72642e1596cfab8a642a0458f |
|
BLAKE2b-256 | fe7136d1cb23b620b10eb8ef5fe51742e8e25fa871faa31794e9b34d99301470 |
Close
Hashes for kgdata-7.0.1-pp39-pypy39_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | fc77434d7b3e32f87c779a379196deb962374adaf06e845e087f65a6b1e4c562 |
|
MD5 | e7799af1593e8a00b8290a6a21a7af60 |
|
BLAKE2b-256 | c89dd49a58997d091ebac15d4586c147d7aaf6957840f05c91d6df45b2c52a70 |
Close
Hashes for kgdata-7.0.1-cp312-none-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1ebc82e0c7af6a0661585c9826d52383783967a5fdb7a1255901bc45a4100aab |
|
MD5 | a7d048d27584e5a5e639aa7efd058ea0 |
|
BLAKE2b-256 | 620b3e21bfb6a4be8db95254728f18017b9320c8e49d2dde525b661d9ebb1d74 |
Close
Hashes for kgdata-7.0.1-cp312-cp312-manylinux_2_35_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b5ee5fb77b8fb253a3f09990ea026fadadc165df87dd36942dd460a9c28f12c3 |
|
MD5 | 06bc0b351e46c1e4936885a173d22151 |
|
BLAKE2b-256 | 87755a102894a67141cdad5ca4e830eae8055dc785b8b4bf7c813f272d8f81bc |
Close
Hashes for kgdata-7.0.1-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 04e40ddd2ae7537327cb81b18970c72793ddc151ae09eda5d4af56ea8ef46956 |
|
MD5 | c4b7b2843c58cbcff80973a487eb3044 |
|
BLAKE2b-256 | 54eb28c0218727d43fc52a932ad17b98dad4be72cd0cdb799589b48790597835 |
Close
Hashes for kgdata-7.0.1-cp312-cp312-macosx_10_14_x86_64.macosx_11_0_arm64.macosx_10_14_universal2.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4557f95da135b7cb22165b526671ce4631b8d44cab0321499174a3c488ccc175 |
|
MD5 | 88593d91d71606c12583ca92adb66f08 |
|
BLAKE2b-256 | 7e4d9f2bda4daa1e9fe785399e7b5417ace4ee9baeb80e1215b6d1b6c2f5cb98 |
Close
Hashes for kgdata-7.0.1-cp311-none-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9deeb0ccd096976d29fcbd59a0bdec35fc67b618b13c4969d26b1b8cac633626 |
|
MD5 | 4d7d4152998f3118e92ca687af77521b |
|
BLAKE2b-256 | 1be59e153914c599f2c14178bf6b7eebd04e77e116e0d32f698e861c76e5dbc6 |
Close
Hashes for kgdata-7.0.1-cp311-cp311-manylinux_2_35_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8c82afea6c81319273402afc16aa9995391e644061a690ff66121e69a8108627 |
|
MD5 | 503bcc19050c208cb62478b1c0641324 |
|
BLAKE2b-256 | e2c4c0f491e43d6adf14b642dbf6b24dacc43a3589f921a22c9b9db00ba80cfb |
Close
Hashes for kgdata-7.0.1-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | bd5e6a76f2cf2adcaa52ea421629196e47153a3f2b3ec0443343e6004448112a |
|
MD5 | 62aac918c27e74206eaae8c62f82a229 |
|
BLAKE2b-256 | 3c6d39a07392d241354307bc07512be50ff1fa9760a2e3c19d72df5568521a04 |
Close
Hashes for kgdata-7.0.1-cp311-cp311-macosx_10_14_x86_64.macosx_11_0_arm64.macosx_10_14_universal2.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 963c7cfdcf9de93e0639617ad19fd0478ce5b7f151710108a981b3883b19c83a |
|
MD5 | 34d9486f3c3602d35f571b477a709b0b |
|
BLAKE2b-256 | 02757608907d789f2b9562bdb0156ac6e99856ca97d200f1b033e38f394c7fde |
Close
Hashes for kgdata-7.0.1-cp310-none-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ea44fc7687837b41c980eeffa044669b4dfbb5bf256ac1030f5181377b9b3dd3 |
|
MD5 | cd064e94b338a9ab2e243f34c9d47db7 |
|
BLAKE2b-256 | 8fe27c9fa8c6d702dd75f42753678dc88be08776ee6686cd016e5d1b27748318 |
Close
Hashes for kgdata-7.0.1-cp310-cp310-manylinux_2_35_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 809a39fa289c0256dfc0a845f8f457b789c1f618e7ac8ab67e2e82f050a8697c |
|
MD5 | b257fef04b93b47f78153c290dce249b |
|
BLAKE2b-256 | 1f307fbced35364fef44d0cf30918443fa49abe129f117a260870db0c66087af |
Close
Hashes for kgdata-7.0.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3255476f3675d6887272683ffc4985e7617dbb2bcd036c4fa00cbe516af08e13 |
|
MD5 | edee53e4363b7bdae619400f1df9ed96 |
|
BLAKE2b-256 | 19631287267b9747be3ec257d843bb1f04c172b8935df33d5b5f75c3dbf49850 |
Close
Hashes for kgdata-7.0.1-cp310-cp310-macosx_10_14_x86_64.macosx_11_0_arm64.macosx_10_14_universal2.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a3460d774e1983e510b6fe8c17402731a35a2f2bd291e2ebb0912e4e3ed18f12 |
|
MD5 | 17ec779662e453181bd5ea61919f2c43 |
|
BLAKE2b-256 | ace4a27917f652d67eb02a7b8528444decafcf6d6303581dc5094a867a8ea89e |
Close
Hashes for kgdata-7.0.1-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 70d2ccbc6e56d16500c7a826366109590fec62639c819d90a082656bf0330ab6 |
|
MD5 | 8039459fe1b804f77f32b754b3447e92 |
|
BLAKE2b-256 | 202497963033949936a2e09cbe3f3bf94c29bc7c415e07a4d7bca687bc0543a5 |