Pythonic wrapper around the Wikidata SPARQL API for quick creation of datasets from Wikidata
Project description
wikidataloader
Easy pythonic wrapper around the Wikidata SPARQL API for quick creation of datasets from Wikidata.
Only supports simple, non-recursive queries - for complex queries please directly use the SPARQL API provided by Wikidata.
It does not support complex operators (ordering, datetime conversion, string/numeric filtering etc.), because these can be substituted by preprocessing the dataset in Python after retrieval.
Usage
Look up the URIs for properties (e.g. P31) and objects (e.g. Q5) on Wikidata's search engine.
from wikidataloader import WikidataQuery
# Linguists from Germany with birth places and gender
results = WikidataQuery.search(
# {is_instance:human, country_of_origin:Germany, profession:linguist}
filters={"P31": "Q5", "P27": "Q183", "P106": "Q14467526"},
# selects the properties "Gender" and "Birth Place" as columns in the dataframe and names them "Gender" and "City of Birth"
select=[("P21", "Gender"), ("P19", "City of Birth")],
# returns a maximum of 5 results
limit=5,
# retrieves labels in English, if available
default_language="en"
).to_pandas()
results
>>> item Gender City_of_Birth
>>> 0 Hermann Weller male Schwäbisch Gmünd
>>> 1 Hans Wehr male Leipzig
>>> 2 Theodor Haecker male Mulfingen
>>> 3 Gottfried Bernhardy male Gorzów Wielkopolski
>>> 4 Wilhelm Streitberg male Rüdesheim am Rhein
For more examples, see example.ipynb
Install
Install using pip:
pip install wikidataloader
Limitations
- Does not support recursive queries
- Does not support labels for Lexeme queries
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file wikidataloader-0.0.4.tar.gz.
File metadata
- Download URL: wikidataloader-0.0.4.tar.gz
- Upload date:
- Size: 17.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c4caa1501349930142fe0216b86f1595aca8b655f616c9ef92ff7bb6cbbe5d99
|
|
| MD5 |
cbac9125219d6a5edcc887588bde1caf
|
|
| BLAKE2b-256 |
7b1d0dc5f2b30335d9984459d7c5b8094fe0bd3cee073b5b603afbec1eda3fef
|
File details
Details for the file wikidataloader-0.0.4-py3-none-any.whl.
File metadata
- Download URL: wikidataloader-0.0.4-py3-none-any.whl
- Upload date:
- Size: 17.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f5416d2c381ab5edbe512eeab352e3b81c8f7cb160beb58a9ea69c67dcea4662
|
|
| MD5 |
2dd5727436fefa3e1e5732e40d211b1a
|
|
| BLAKE2b-256 |
ce4b49e04bdc9ecfd9313ccd51adfd72d2b595993b10bea62abfeb53f93939fd
|