Skip to main content

Convert MediaWiki language names and codes

Project description

MediaWiki langcodes

Convert MediaWiki language names and language codes.

Build dependencies

  • php

  • wget

Build

$ python -m venv .venv
$ source .venv/bin/activate.fish
$ python -m pip install -U pip build
$ make

Update extra language names

Extract language names from Lua modules in the Wiktionary dump file, use code from https://github.com/tatuylonen/wiktextract/blob/master/languages/get_data.py

$ python languages/get_data.py en --db-file ../en_20231220.db
$ python languages/get_data.py zh --db-file ../zh_20231220.db

then in this project folder:

// download files
$ make lang_files
// remove old extra files
$ rm build_script/extra_names/*.json
// create a db file without extra languages
$ python build_script/main.py
$ python build_script/add_extra.py en path_to_en_extra_json
// add extra languages from the English Wiktionary
$ python build_script/main.py
$ python build_script/add_extra.py zh path_to_zh_extra_json

Usage

from mediawiki_langcodes import code_to_name, name_to_code

code_to_name("fr")  # return "français"
code_to_name("fr", "en")  # return "French"
name_to_code("français")  # return "fr"
name_to_code("français", "fr")  # return "fr"

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mediawiki_langcodes-0.2.0.tar.gz (4.5 MB view hashes)

Uploaded Source

Built Distribution

mediawiki_langcodes-0.2.0-py3-none-any.whl (4.5 MB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page