Command-line tool to display Unicode character information
Project description
uni
NAME
uni - display Unicode character information
SYNOPSIS
uni [option]... [character]...
DESCRIPTION
Displays information about Unicode characters.
In the absence of a name match option, each character option can independently be any one of:
- a single character;
- an integer code point number;
- a character name;
- the form
U+hexdigits.
NAME MATCH OPTIONS
--egrep, -e
The character arguments are extended regular expressions, matched against character names.
--glob, -g
The character arguments are shell-style patterns, matched against character names.
--html, -H
The character arguments are HTML entity names.
Surrounding the name with & and ; is optional.
--match, -m
The character arguments are text that can match anywhere in a character name.
--string, -S
The character arguments are treated as sequences of individual characters.
For example, if the argument is tilde, instead of reporting the single
character ~, the result will be the five characters t, i, l, d,
and e.
--word, -w
The character arguments are full words that must appear in a character name.
BLOCK MATCH OPTION
--block=block, -b block
Limits matches to characters within the given block(s). (Unlike other match
options, this is always restrictive even with --any.) A block can be a
Unicode block name. It can also be any of the character forms, designating
the block containing that character.
PROPERTY MATCH OPTIONS
Currently, these are not user-friendly; the argument must exactly match
the form returned by Python unicodedata.
--bidi=bidi--category=category--combining=combining--decimal=decimal--decomposition=decomposition--digit=digit--mirrored=mirrored--numeric=numeric--width=width
GENERAL MATCH OPTIONS
--all
Given multiple match conditions, all must apply in order for a character to be selected. This is the default.
--any, -a
Given multiple match conditions, any single match is sufficient for a character to be selected.
FORMAT OPTIONS
--format=format, -f format
Print according to a format string. The string may contain keywords surrounded by curly braces, and other text printed as-is.
The keywords are:
{bidirectional}- the character's bidirectional class.{block}- the name of the block containing the character.{category}- the character's category.{char}- the character.{combining}- the character's combining value.{decimal}- the character's decimal value.{decomposition}- the character's decomposition.{digit}- the character's digit value.{eol}- the end-of-line character (newline unless changed by command-line options).{html}- the HTML entity name for the character.{id}- the character name with blanks and hyphends replaced by_.{mirrored}- whether the character is mirrored in bidirectional text.{name}- the character's name.{nfc}- the NFC normalization form, as character names.{NFC}- the NFC normalization form.{nfd}- the NFD normalization form, as character names.{NFD}- the NFD normalization form.{nfkc}- the NFKC normalization form, as character names.{NFKC}- the NFKC normalization form.{nfkd}- the NFKD normalization form, as character names.{NFKD}- the NFKD normalization form.{numeric}- the character's numeric value.{ordinal}- the character's code point number.{utf8}- the UTF-8 encoding of the character, as a sequence of two-digit hexadecimal numbers.{utf16}- the UTF-16 encoding of the character, as one or two four-digit hexadecimal numbers.{u}- the code point number, in the formU+hexdigits.{un}- as{u}, but zero-padded to at least n digits.{v}- the code point number, in decimal.{width}- the character's width class (‘East Asian width’).{x}- the code point number, in hexdecimal (with no prefix).
--char
Print the character alone.
Equivalent to --format '{char}'.
--compose
Print in a form useful in XCompose files.
Equivalent to --format ': "{char}" U{x} # {name}'.
--long
Print full details about the character
--name
Print the character name.
Equivalent to --format '{name}'.
--short
Print the character, code, and name. This is the default.
Equivalent to --format '{char} {u} {name}'.
SEPARATOR OPTIONS
--nonewline, -n
Do not print a newline between output for different characters.
--separator=eol, -s eol
Print eol between output for different characters.
--null, -0
Print an ASCII NUL between output for different characters.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file uniccin-1.9.5.tar.gz.
File metadata
- Download URL: uniccin-1.9.5.tar.gz
- Upload date:
- Size: 28.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fb9e872501ceae040429f33cc8939c5134cd6e05411ad09702597ca8b2b3dbd7
|
|
| MD5 |
387d080190b848e1ec3a82353b98bd03
|
|
| BLAKE2b-256 |
57610d1a11d6ec71c0c23a01d2f8f5cd2e4edc93826f8c0cd832bfd125aa98be
|
File details
Details for the file uniccin-1.9.5-py3-none-any.whl.
File metadata
- Download URL: uniccin-1.9.5-py3-none-any.whl
- Upload date:
- Size: 27.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3ae649378fed59e3d8d7ba682aafd78b71b411d15588c7584fe8aa0c85530e20
|
|
| MD5 |
cbf15f4f7b9b18da3aa8832419c1993c
|
|
| BLAKE2b-256 |
04c0e9a6da0cc9099b7aeec7490ef53fbe27678ead04c4f555f2462a3043cf81
|