Convert MediaWiki language names and codes
Project description
MediaWiki langcodes
Convert MediaWiki language names and language codes.
Build dependencies
-
php
-
wget
Build
$ python -m venv .venv
$ source .venv/bin/activate.fish
$ python -m pip install -U pip build
$ make
Update extra language names
Extract language names from Lua modules in the Wiktionary dump file, use code from https://github.com/tatuylonen/wiktextract/blob/master/languages/get_data.py
$ python languages/get_data.py en --db-file ../en_20231220.db
$ python languages/get_data.py zh --db-file ../zh_20231220.db
then in this project folder:
// download files
$ make lang_files
// remove old extra files
$ rm build_script/extra_names/*.json
// create a db file without extra languages
$ python build_script/main.py
$ python build_script/add_extra.py en path_to_en_extra_json
// add extra languages from the English Wiktionary
$ python build_script/main.py
$ python build_script/add_extra.py zh path_to_zh_extra_json
Usage
from mediawiki_langcodes import code_to_name, name_to_code
code_to_name("fr") # return "français"
code_to_name("fr", "en") # return "French"
name_to_code("français") # return "fr"
name_to_code("français", "fr") # return "fr"
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for mediawiki_langcodes-0.2.1.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | fdb6c114bcbb32ef4819ca12128f1168e03c222d9347b5acac019daed2df7c2c |
|
MD5 | 76aeba732706c68e79a026cc8514bc36 |
|
BLAKE2b-256 | 30cdb2dcfd5a97f8a9d542219ad44b3bdd22ed397f43061d52fde78419f77167 |
Close
Hashes for mediawiki_langcodes-0.2.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8ae4fb9c00e46ef79153ec5565929438a28a8d52a201824aa1546b0caeace2d6 |
|
MD5 | a8ad1795b4c7864015a289c7f1ae14e6 |
|
BLAKE2b-256 | f69c9ee3f364c0996bcdc01547610ee0ab8cfecb0599ea12bc37efec46051c05 |