Use MediaWiki Wiki page content as read-only database
Project description
wiki_as_base-py
[MVP] Use MediaWiki Wiki page content as read-only database. Python library implementation. See https://github.com/fititnt/openstreetmap-serverless-functions/tree/main/function/wiki-as-base
Installing
pip install wiki_as_base --upgrade
Usage
Environment variables
Customize for your needs. They're shared between command line and the library.
export WIKI_API='https://wiki.openstreetmap.org/w/api.php'
export WIKI_INFOBOXES='ValueDescription\nKeyDescription'
export WIKI_DATA_LANGS='yaml\nturtle'
Command line
wiki_as_base --help
## Use remote storage (defined on WIKI_API)
wiki_as_base --page-title 'User:EmericusPetro/sandbox/Wiki-as-base'
# The output is JSON-LD. Feel free to further filter the data
wiki_as_base --page-title 'User:EmericusPetro/sandbox/Wiki-as-base' | jq .data[0]
## Example of, instead of use WIKI_API, parse Wiki markup directly. Output JSON- LD
cat tests/data/multiple.wiki.txt | wiki_as_base --input-stdin
## Output zip file instead of JSON-LD. --verbose also adds wikiasbase.jsonld to file
cat tests/data/chatbot-por.wiki.txt | wiki_as_base --input-stdin --verbose --output-zip-file tests/temp/chatbot-por.zip
Library
Disclaimer / Trivia
The wiki_as_base allows no-as-complete data extraction from MediaWiki markup text directly by its API or direct input, without need to install server extensions.
Check also the wikimedia/Wikibase, a full server version (which inspired the name).
License
Public domain
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
wiki_as_base-0.5.1.tar.gz
(11.1 kB
view hashes)
Built Distribution
Close
Hashes for wiki_as_base-0.5.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9371eba0b65c5d90271e42c5adfefd8adf33f83169516d3da6f3bfbd49c308a8 |
|
MD5 | bfbaf54e1d326d8b46a7df52abad1709 |
|
BLAKE2b-256 | 805108ecb225e8dd8bcacbdde3e90ecb5161d3d274eedead25088e8283dee85d |