Skip to main content

Command-line tool to display Unicode character information

Project description

uni

NAME

uni - display Unicode character information

SYNOPSIS

uni [option]... [character]...

DESCRIPTION

Displays information about Unicode characters.

In the absence of a name match option, each character option can independently be any one of:

  • a single character;
  • an integer code point number;
  • a character name;
  • the form U+hexdigits.

NAME MATCH OPTIONS

--egrep, -e

The character arguments are extended regular expressions, matched against character names.

--glob, -g

The character arguments are shell-style patterns, matched against character names.

--html, -H

The character arguments are HTML entity names. Surrounding the name with & and ; is optional.

--match, -m

The character arguments are text that can match anywhere in a character name.

--string, -S

The character arguments are treated as sequences of individual characters. For example, if the argument is tilde, instead of reporting the single character ~, the result will be the five characters t, i, l, d, and e.

--word, -w

The character arguments are full words that must appear in a character name.

BLOCK MATCH OPTION

--block=block, -b block

Limits matches to characters within the given block(s). (Unlike other match options, this is always restrictive even with --any.) A block can be a Unicode block name. It can also be any of the character forms, designating the block containing that character.

PROPERTY MATCH OPTIONS

Currently, these are not user-friendly; the argument must exactly match the form returned by Python unicodedata.

  • --bidi=bidi
  • --category=category
  • --combining=combining
  • --decimal=decimal
  • --decomposition=decomposition
  • --digit=digit
  • --mirrored=mirrored
  • --numeric=numeric
  • --width=width

GENERAL MATCH OPTIONS

--all

Given multiple match conditions, all must apply in order for a character to be selected. This is the default.

--any, -a

Given multiple match conditions, any single match is sufficient for a character to be selected.

FORMAT OPTIONS

--format=format, -f format

Print according to a format string. The string may contain keywords surrounded by curly braces, and other text printed as-is.

The keywords are:

  • {bidirectional} - the character's bidirectional class.
  • {block} - the name of the block containing the character.
  • {category} - the character's category.
  • {char} - the character.
  • {combining} - the character's combining value.
  • {decimal} - the character's decimal value.
  • {decomposition} - the character's decomposition.
  • {digit} - the character's digit value.
  • {eol} - the end-of-line character (newline unless changed by command-line options).
  • {html} - the HTML entity name for the character.
  • {id} - the character name with blanks and hyphends replaced by _.
  • {mirrored} - whether the character is mirrored in bidirectional text.
  • {name} - the character's name.
  • {nfc} - the NFC normalization form, as character names.
  • {NFC} - the NFC normalization form.
  • {nfd} - the NFD normalization form, as character names.
  • {NFD} - the NFD normalization form.
  • {nfkc} - the NFKC normalization form, as character names.
  • {NFKC} - the NFKC normalization form.
  • {nfkd} - the NFKD normalization form, as character names.
  • {NFKD} - the NFKD normalization form.
  • {numeric} - the character's numeric value.
  • {ordinal} - the character's code point number.
  • {utf8} - the UTF-8 encoding of the character, as a sequence of two-digit hexadecimal numbers.
  • {utf16} - the UTF-16 encoding of the character, as one or two four-digit hexadecimal numbers.
  • {u} - the code point number, in the form U+hexdigits.
  • {un} - as {u}, but zero-padded to at least n digits.
  • {v} - the code point number, in decimal.
  • {width} - the character's width class (‘East Asian width’).
  • {x} - the code point number, in hexdecimal (with no prefix).

--char

Print the character alone. Equivalent to --format '{char}'.

--compose

Print in a form useful in XCompose files. Equivalent to --format ': "{char}" U{x} # {name}'.

--long

Print full details about the character

--name

Print the character name. Equivalent to --format '{name}'.

--short

Print the character, code, and name. This is the default. Equivalent to --format '{char} {u} {name}'.

SEPARATOR OPTIONS

--nonewline, -n

Do not print a newline between output for different characters.

--separator=eol, -s eol

Print eol between output for different characters.

--null, -0

Print an ASCII NUL between output for different characters.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

uniccin-1.9.5.tar.gz (28.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

uniccin-1.9.5-py3-none-any.whl (27.0 kB view details)

Uploaded Python 3

File details

Details for the file uniccin-1.9.5.tar.gz.

File metadata

  • Download URL: uniccin-1.9.5.tar.gz
  • Upload date:
  • Size: 28.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.2

File hashes

Hashes for uniccin-1.9.5.tar.gz
Algorithm Hash digest
SHA256 fb9e872501ceae040429f33cc8939c5134cd6e05411ad09702597ca8b2b3dbd7
MD5 387d080190b848e1ec3a82353b98bd03
BLAKE2b-256 57610d1a11d6ec71c0c23a01d2f8f5cd2e4edc93826f8c0cd832bfd125aa98be

See more details on using hashes here.

File details

Details for the file uniccin-1.9.5-py3-none-any.whl.

File metadata

  • Download URL: uniccin-1.9.5-py3-none-any.whl
  • Upload date:
  • Size: 27.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.2

File hashes

Hashes for uniccin-1.9.5-py3-none-any.whl
Algorithm Hash digest
SHA256 3ae649378fed59e3d8d7ba682aafd78b71b411d15588c7584fe8aa0c85530e20
MD5 cbf15f4f7b9b18da3aa8832419c1993c
BLAKE2b-256 04c0e9a6da0cc9099b7aeec7490ef53fbe27678ead04c4f555f2462a3043cf81

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page