Skip to main content

Unicode support libraries

Project description

jkUnicode

A Python module for Unicode, glyph name, and orthography information.

The orthography functions can be used via the command line script ortho. The Unicode info for one or more codepoints can be shown via the command uniinfo.

For using the module from inside Python, see the docs.

uniinfo

uniinfo – Show information about Unicode codepoints.

Usage

uniinfo [-h] codepoint [codepoint ...]

Codepoints can be given in decimal (e.g. 7838), hexadecimal (e.g. 0x1e9e), or Unicode (U+1E9E) notation.

ortho

ortho – Query fonts about orthographic support.

Usage

usage: ortho [-h] [-b] [-f] [-i] [-k] [-m] [-p] [-n NEAR_MISS] [-s SUPPORT] font [font ...]

Options

-b

-b, --bcp47

Output orthographies as BCP47 language subtags instead of friendly names.

-f

-f | --full-only

When called without any options, ortho will determine the orthographic support of the supplied font(s) by looking at the required characters for each orthography. The -f option only lists orthographies for which all required and optional characters are present in the font.

Example
$ ortho ComicJens.ttf 
The font supports 104 orthographies:
Afrikaans
Albanian
Asu
Azeri
Basque
Bemba
Bena
Bosnian
Catalan
[...]
Zulu

$ ortho -f ComicJens.ttf
The font supports 98 orthographies:
Afrikaans
Albanian
Asu
Azeri
Basque
Bemba
Bena
Bosnian
Catalan
[...]
Zulu

-i

-i | --minimum-inclusive

Prints a list of orthographies for which at least all characters from the basic category are present in the font.

Example
$ ortho -i ComicJens-Italic.ttf
The font has minimal or better support for 123 orthographies:
Afrikaans
Albanian
Asu
Azeri
[...]
Zulu

-k

k | --kill-list

Output a list of letters that don't appear together in any supported orthography.

-m

m | --minimum

Report orthographies that have only basic support, i.e. no optional characters and no punctuation present.

-p

-p | --punctuation

Prints a list of orthographies for which all letter category characters are present in the font, but have missing punctuation category characters. For the missing characters, Unicode, glyph name, and Unicode name are reported.

Example
$ ortho -p ComicJens.ttf
Orthographies which can be supported by adding punctuation characters:

Scottish Gaelic
    0x204A	uni204A	Tironian Sign Et

-n

-n NEAR_MISS | --near-miss NEAR_MISS

Prints a list of orthographies which are lacking up to a number of NEAR_MISS characters to be supported. For the missing characters, Unicode, glyph name, and Unicode name are reported.

Example
$ ortho -n 1 ComicJens.ttf
Orthographies which can be supported with max. 1 additional character:

Breton
    0x02BC	uni02BC	Modifier Letter Apostrophe

Hawaiian
    0x02BB	uni02BB	Modifier Letter Turned Comma

Quechua
    0x02BC	uni02BC	Modifier Letter Apostrophe

Tongan
    0x02BB	uni02BB	Modifier Letter Turned Comma

-s

-s SUPPORT | --support SUPPORT

Prints a report of characters missing to support an orthography specified by the supplied BCP47 language subtag. For the missing characters, Unicode, glyph name, and Unicode name are reported. Interacts with the options -m and -p to only show missing required characters or missing punctuation.

Example
$ ortho -p -s gd ComicJens.ttf
Scottish Gaelic
    0x204A	uni204A	Tironian sign et

$ ortho -m -s agq ComicJens.ttf
ortho -s agq -m /Users/kuti/Documents/Schriften/Comic-Jens-Font/build/ComicJensFreePro-Regular.ttf
Aghem
    0x0186	uni0186	Latin capital letter Open O
    0x0190	uni0190	Latin capital letter Open E
    0x0197	uni0197	Latin capital letter I with stroke
    0x0244	uni0244	Latin capital letter U Bar
    0x0254	uni0254	Latin small letter open o
    0x025B	uni025B	Latin small letter open e
    0x0268	uni0268	Latin small letter i with stroke
    0x0289	uni0289	Latin small letter u bar
    0x0294	uni0294	Latin letter glottal stop

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jkUnicode-2.0.4.tar.gz (505.2 kB view details)

Uploaded Source

Built Distribution

jkUnicode-2.0.4-py2.py3-none-any.whl (522.0 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file jkUnicode-2.0.4.tar.gz.

File metadata

  • Download URL: jkUnicode-2.0.4.tar.gz
  • Upload date:
  • Size: 505.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.4

File hashes

Hashes for jkUnicode-2.0.4.tar.gz
Algorithm Hash digest
SHA256 cc192ca188c7036a46f2906e18f84267556c9d9bf3f06644dc1af341d88c44a2
MD5 24c7a2a86ab92f13c84821623409c1ed
BLAKE2b-256 19a1a49fbeaa6330692e985aada2a6f8d4afe48ace42c833675d48689b4e9248

See more details on using hashes here.

File details

Details for the file jkUnicode-2.0.4-py2.py3-none-any.whl.

File metadata

  • Download URL: jkUnicode-2.0.4-py2.py3-none-any.whl
  • Upload date:
  • Size: 522.0 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.4

File hashes

Hashes for jkUnicode-2.0.4-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 b285f213f95ed49d376c52ec4a48cb6dfd95f944385e4688e4c26cc4c14660e6
MD5 1a2c9d49cb6d9d67ecb1ba73feb09d70
BLAKE2b-256 8cac15fe2e020ce9c7aa080ca5df847e645e1d6996cfe0035e7abdd3a31b5092

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page