Skip to main content
This is a pre-production deployment of Warehouse. Changes made here affect the production instance of PyPI (pypi.python.org).
Help us improve Python packaging - Donate today!

Normalizes files or standard input using a Unicode normalization form.

Project Description
Usage:

unormalize [-f FORM] [-i EXT] [files...]
nfc [-i EXTENSION] [files...]
nfd [-i EXTENSION] [files...]
nfkc [-i EXTENSION] [files...]
nfkd [-i EXTENSION] [files...]

Options

-i EXTENSION -- Modify files inplace, saving back-ups with EXTENSION
-f FORM-- normalization form
Home-page: https://github.com/eddieantonio/unormalize
Author: Eddie Antonio Santos
Author-email: easantos@ualberta.ca
License: MIT
Download-URL: https://github.com/eddieantonio/unormalize
Description: **************************************************
unormalize - Filters that do Unicode normalization
**************************************************

Converts UTF-8 input to the desired UTF-8 in Unicode normalization form.

Read about the `Unicode Normalization Forms`_!

=====
Usage
=====

There are five executables included, that all have the exact same usage and
arguments:

- unormalize
- nfc
- nfd
- nfkc
- nfkd

You may either redirect or pipe input into `unormalize` (and its buddies), or
provide filenames as arguments.

-------
Options
-------

``-f FORM``/``--form=FORM``
Selects the normalization form: one of NFC, NFD, NFKC, or NFKD. The
equivalently named executables imply their respective normalization form;
``unormalize`` is equivilent to ``nfk`` without the ``--form`` arugment.

``-i EXTENSION``/``--in-place EXTENSION``
Filenames **must** be specified as arguments. If so, this opens them, and
converts them into the desired normalization form, in place. ``EXTENSION`` is
the extension given to back-ups of the original files.

========
Examples
========

Convert clipboard contents to NFC (OS X)::

$ pbpaste | nfc | pbcopy

Convert a file, in-place, to NFKD::

$ nfkd --in-place=.bak file.txt && rm file.txt.bak

Convert circled, variants, and half-widths to their compatible forms::

$ echo 'ℍ①カ' | nfkc
H1カ

=======
License
=======

© 2015 Eddie Antonio Santos. MIT Licensed.

.. _`Unicode Normalization Forms`: http://unicode.org/reports/tr15/

Platform: UNKNOWN
Classifier: Development Status :: 5 - Production/Stable
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 2
Classifier: Programming Language :: Python :: 2.7
Classifier: Programming Language :: Python :: 3.3
Classifier: Programming Language :: Python :: 3.4
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Classifier: Environment :: Console
Classifier: License :: OSI Approved :: MIT License
Classifier: Topic :: Text Processing
Classifier: Topic :: Utilities
Release History

Release History

This version
History Node

0.2.0

History Node

0.1.0

Download Files

Download Files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

File Name & Checksum SHA256 Checksum Help Version File Type Upload Date
unormalize-0.2.0-py2.py3-none-any.whl (5.5 kB) Copy SHA256 Checksum SHA256 py2.py3 Wheel Nov 10, 2017
unormalize-0.2.0.tar.gz (4.6 kB) Copy SHA256 Checksum SHA256 Source Nov 10, 2017

Supported By

WebFaction WebFaction Technical Writing Elastic Elastic Search Pingdom Pingdom Monitoring Dyn Dyn DNS Sentry Sentry Error Logging CloudAMQP CloudAMQP RabbitMQ Heroku Heroku PaaS Kabu Creative Kabu Creative UX & Design Fastly Fastly CDN DigiCert DigiCert EV Certificate Rackspace Rackspace Cloud Servers DreamHost DreamHost Log Hosting