Skip to main content

Command-line tool to display Unicode character information

Project description

uni

NAME

uni - display Unicode character information

SYNOPSIS

uni [option]... [character]...

DESCRIPTION

Displays information about Unicode characters.

In the absence of a name match option, each character option can independently be any one of:

  • a single character;
  • an integer code point number;
  • a character name;
  • the form U+hexdigits.

NAME MATCH OPTIONS

--egrep, -e

The character arguments are extended regular expressions, matched against character names.

--glob, -g

The character arguments are shell-style patterns, matched against character names.

--html, -H

The character arguments are HTML entity names. Surrounding the name with & and ; is optional.

--match, -m

The character arguments are text that can match anywhere in a character name.

--string, -S

The character arguments are treated as sequences of individual characters. For example, if the argument is tilde, instead of reporting the single character ~, the result will be the five characters t, i, l, d, and e.

--word, -w

The character arguments are full words that must appear in a character name.

BLOCK MATCH OPTION

--block=block, -b block

Limits matches to characters within the given block(s). (Unlike other match options, this is always restrictive even with --any.) A block can be a Unicode block name. It can also be any of the character forms, designating the block containing that character.

PROPERTY MATCH OPTIONS

Currently, these are not user-friendly; the argument must exactly match the form returned by Python unicodedata.

  • --bidi=bidi
  • --category=category
  • --combining=combining
  • --decimal=decimal
  • --decomposition=decomposition
  • --digit=digit
  • --mirrored=mirrored
  • --numeric=numeric
  • --width=width

GENERAL MATCH OPTIONS

--all

Given multiple match conditions, all must apply in order for a character to be selected. This is the default.

--any, -a

Given multiple match conditions, any single match is sufficient for a character to be selected.

FORMAT OPTIONS

--format=format, -f format

Print according to a format string. The string may contain keywords surrounded by curly braces, and other text printed as-is.

The keywords are:

  • {bidirectional} - the character's bidirectional class.
  • {block} - the name of the block containing the character.
  • {category} - the character's category.
  • {char} - the character.
  • {combining} - the character's combining value.
  • {decimal} - the character's decimal value.
  • {decomposition} - the character's decomposition.
  • {digit} - the character's digit value.
  • {eol} - the end-of-line character (newline unless changed by command-line options).
  • {html} - the HTML entity name for the character.
  • {id} - the character name with blanks and hyphends replaced by _.
  • {mirrored} - whether the character is mirrored in bidirectional text.
  • {name} - the character's name.
  • {nfc} - the NFC normalization form, as character names.
  • {NFC} - the NFC normalization form.
  • {nfd} - the NFD normalization form, as character names.
  • {NFD} - the NFD normalization form.
  • {nfkc} - the NFKC normalization form, as character names.
  • {NFKC} - the NFKC normalization form.
  • {nfkd} - the NFKD normalization form, as character names.
  • {NFKD} - the NFKD normalization form.
  • {numeric} - the character's numeric value.
  • {ordinal} - the character's code point number.
  • {utf8} - the UTF-8 encoding of the character, as a sequence of two-digit hexadecimal numbers.
  • {utf16} - the UTF-16 encoding of the character, as one or two four-digit hexadecimal numbers.
  • {u} - the code point number, in the form U+hexdigits.
  • {un} - as {u}, but zero-padded to at least n digits.
  • {v} - the code point number, in decimal.
  • {width} - the character's width class (‘East Asian width’).
  • {x} - the code point number, in hexdecimal (with no prefix).

--char

Print the character alone. Equivalent to --format '{char}'.

--compose

Print in a form useful in XCompose files. Equivalent to --format ': "{char}" U{x} # {name}'.

--long

Print full details about the character

--name

Print the character name. Equivalent to --format '{name}'.

--short

Print the character, code, and name. This is the default. Equivalent to --format '{char} {u} {name}'.

SEPARATOR OPTIONS

--nonewline, -n

Do not print a newline between output for different characters.

--separator=eol, -s eol

Print eol between output for different characters.

--null, -0

Print an ASCII NUL between output for different characters.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

uniccin-1.9.4.tar.gz (28.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

uniccin-1.9.4-py3-none-any.whl (26.8 kB view details)

Uploaded Python 3

File details

Details for the file uniccin-1.9.4.tar.gz.

File metadata

  • Download URL: uniccin-1.9.4.tar.gz
  • Upload date:
  • Size: 28.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.2

File hashes

Hashes for uniccin-1.9.4.tar.gz
Algorithm Hash digest
SHA256 5565b172bb378cc7d0a7fd65115aa4a64a9f1add8fdfc853b77a388c7bff65e3
MD5 525081927126c46c432d69850fda0b34
BLAKE2b-256 a07f2f044e05a1b89d96236053145051d008f76064f16e59cc30fd1371c2dbda

See more details on using hashes here.

File details

Details for the file uniccin-1.9.4-py3-none-any.whl.

File metadata

  • Download URL: uniccin-1.9.4-py3-none-any.whl
  • Upload date:
  • Size: 26.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.2

File hashes

Hashes for uniccin-1.9.4-py3-none-any.whl
Algorithm Hash digest
SHA256 7518c86c744421cea928b08266df91e7f4ecb757407a99d8892391409e200536
MD5 22c91e5a77874d20439ac4c78d61a07c
BLAKE2b-256 bc4b3dbf9981b58b3d4a323bbade7f79edc285039bde6151d7d1d4f1565343ae

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page