Skip to main content

Unicodedata with extensions for additional properties.

Project description

Github CI Status PyPI

unicodedataplus

Drop-in replacement for unicodedata with extensions for additional Unicode properties.

Currently supported additional Unicode properties:

  • Script: script(chr)
  • Script Extensions: script_extensions(chr)
  • Block: block(chr)
  • Indic Conjunct Break: indic_conjunct_break(chr)
  • Indic Positional Category: indic_positional_category(chr)
  • Indic Syllabic Category: indic_syllabic_category(chr)
  • Grapheme Cluster Break: grapheme_cluster_break(chr)
  • Vertical Orientation: vertical_orientation(chr)
  • Age: age(chr)
  • Total Strokes (CJK): total_strokes(chr, source='G')
    • source='G' = Simplified stroke count. source='T' = Traditional stroke count.
  • Emoji: is_emoji(chr)
  • Emoji Presentation: is_emoji_presentation(chr)
  • Emoji Modifier: is_emoji_modifier(chr)
  • Emoji Modifier Base: is_emoji_modifier_base(chr)
  • Emoji Component: is_emoji_component(chr)
  • Extended Pictographic: is_extended_pictographic(chr)

Additionally, two dictionaries (property_value_aliases and property_value_by_alias) are provided for Property Value Alias lookup.

The versions of this package match unicode versions, so unicodedataplus==16.0.0 is data from unicode 16.0.0.

Forked from https://github.com/mikekap/unicodedata2

Changelog

16.0.0

  • Upgrade to Unicode 16.0.0
  • Fixed total_strokes() to allow selection by source (currently just 'G' and 'T')

15.1.0

  • Upgrade to Unicode 15.1.0
  • Add indic_conjunct_break property

15.0.0-2

  • Add age property

15.0.0-1

  • Add vertical orientation property

15.0.0

  • Upgrade to Unicode 15.0.0

14.0.0-2

  • Added Emoji properties

14.0.0-1

  • Added Python3.10 and PyPy support

14.0.0

  • Upgrade to Unicode 14.0.0

13.0.0-2

  • Added grapheme cluster break property

13.0.0-1

  • Fixed pre-Python 3.8 compatibility problems

13.0.0

  • Upgrade to Unicode 13.0.0

12.1.0

  • Upgrade to Unicode 12.1.0
  • Added property value alias lookup
  • Added script extensions, indic positional categories, and indic syllabic categories lookup.

12.0.0

  • Forked from unicodedata2
  • Added script, block and total strokes lookup

11.0.0

  • Upgrade to Unicode 11.0.0
  • Remove Python 3.3 support as wheel no longer supports Python 3.3.

10.0.0-2

  • Wheel for Python 3.6

10.0.0

  • Upgrade to Unicode 10.0.0
  • Remove Python 2.6 support as wheel no longer supports Python 2.6.

9.0.0-4

  • Re-releasing 9.0.0-3 because pypi doesn't handle bad internet connections well.

9.0.0-3

  • Add binary packages for all platforms. Patch by Cosimo Lupo (anthrotype).

9.0.0-2

  • Python 3 support
  • Fix incorrect digitification caused by using the underlying Python's Unicode tables.

9.0.0

  • Upgrade to Unicode 9.0.0

8.0.0

  • Upgrade to Unicode 8.0.0

7.0.0-2

  • Compiles under Python 2.6 (and older 2.7). Patch by John Vandenberg. Fixes #2
  • Runs regular Unicodedata tests. Adds travis and appveyor CI. Patch by John Vandenberg (jayvdb).

7.0.0

  • Initial release

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

unicodedataplus-16.0.0.tar.gz (763.0 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page