Skip to main content

Hanzi Converter for Traditional and Simplified Chinese

Project description

This tool converts between simplified and traditional Chinese Characters. The package has two parts:

  • a command line tool, hanzi-convert
  • a python library: hanziconv

Installation

$ pip install hanziconv

Command Line Tool Usage

$ ./hanzi-convert --help
usage: hanzi-convert [-h] [-o OUTFILE] [-s] [-v] infile

Simplified and Traditional Chinese Character Conversion
Version 0.2 (By Bernard Yue)

Converting to Traditional Hanzi by default with no -s flag

positional arguments:
  infile                filename | "-", corresponds to stdin

optional arguments:
  -h, --help            show this help message and exit
  -o OUTFILE, --output OUTFILE
                        filename to save output, stdout if omitted
  -s, --simplified      convert to simplified characters
  -v, --version         show program's version number and exit

Conversion from stdin (press Ctrl-D to terminate input on POSIX systems)

$ ./hanzi-convert.py -
Typing away
Now write some chinese characters
繁简转换器
Press Ctrl-D after 2 new lines

Typing away
Now write some chinese characters
繁簡轉換器
Press Ctrl-D after 2 new lines

$

Python API

This module works for both Python 2.x and 3.x.

String Conversion

>>> from hanziconv import HanziConv
>>> print(HanziConv.toSimplified('繁簡轉換器'))
繁简转换器
>>> print(HanziConv.toTraditional('繁简转换器'))
繁簡轉換器
>>> print(HanziConv.toSimplified(u'繁簡轉換器'))
繁简转换器
>>> print(HanziConv.toTraditional(u'繁简转换器'))
繁簡轉換器
>>> print(HanziConv.toSimplified(u'mix English and Chinese. 繁簡轉換器')
mix English and Chinese. 繁简转换器
>>> print(HanziConv.toTraditional(u'mix English and Chinese. 繁简转换器'))
mix English and Chinese. 繁簡轉換器
>>> print(HanziConv.toSimplified('mix English and Chinese. 繁簡轉換器'))
mix English and Chinese. 繁简转换器
>>> print(HanziConv.toTraditional('mix English and Chinese. 繁简转换器'))
mix English and Chinese. 繁簡轉換器

Comparing String

>>> from hanziconv import HanziConv
>>> u'繁簡轉換器' ==  u'繁简转换器'
False
>>> HanziConv.same(u'繁簡轉換器', u'繁简转换器')
True
>>> str1 = 'mix English and Chinese. 繁簡轉換器'
>>> str2 = 'mix English and Chinese. 繁简转换器'
>>> str3 = 'mix Chinese and English. 繁简转换器'
>>> str4 = u'mix English and Chinese. 繁簡轉換器'
>>> HanziConv.same(str1, str2)
True
>>> HanziConv.same(str2, str3)
False
>>> HanziConv.same(str1, str4)
True

Testing

This module uses pytest. If you downloaded the source distribution, you can run the tests by just run the py.test command line tool from the directory hanziconv of the source code.

$ taz zxf hanziconv-<version>.tar.gz
$ cd hanziconv-<version>/hanziconv
$ py.test

License

This module is distributed under Apache License Version 2.0.

Project details


Release history Release notifications

History Node

0.3.2

History Node

0.3.1

History Node

0.3

History Node

0.2.2

History Node

0.2.1

This version
History Node

0.2

History Node

0.1

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
hanziconv-0.2.tar.gz (17.3 kB) Copy SHA256 hash SHA256 Source None Aug 13, 2014

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging CloudAMQP CloudAMQP RabbitMQ AWS AWS Cloud computing Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page