Skip to main content

Hanzi Converter for Traditional and Simplified Chinese

Project description

This tool converts between simplified and traditional Chinese Characters. It consists of two parts:

  • a command line tool: hanzi-convert

  • a python library: hanziconv

Build Status Latest Version Documentation License

Installation

$ pip install hanziconv

Command Line Tool

The tool requires Python 2.6+

Synopsis

$ ./hanzi-convert --help
usage: hanzi-convert [-h] [-o OUTFILE] [-s] [-v] infile

Simplified and Traditional Chinese Character Conversion
Version 0.2.3 (By Bernard Yue)

Converting to Traditional Hanzi by default with no -s flag

positional arguments:
  infile                filename | "-", corresponds to stdin

optional arguments:
  -h, --help            show this help message and exit
  -o OUTFILE, --output OUTFILE
                        filename to save output, stdout if omitted
  -s, --simplified      convert to simplified characters
  -v, --version         show program's version number and exit

Example

Conversion from stdin

$ ./hanzi-convert -
Press Crtl-D when finished
Typing away
Now write some chinese characters
繁简转换器
^D
Typing away
Now write some chinese characters
繁簡轉換器
$

Python API

This module requires Python 2.6+. See https://pythonhosted.org/hanziconv/ for full documentation.

String Conversion

>>> from hanziconv import HanziConv
>>> print(HanziConv.toSimplified('繁簡轉換器'))
繁简转换器
>>> print(HanziConv.toTraditional('繁简转换器'))
繁簡轉換器
>>> print(HanziConv.toSimplified(u'繁簡轉換器'))
繁简转换器
>>> print(HanziConv.toTraditional(u'繁简转换器'))
繁簡轉換器
>>> print(HanziConv.toSimplified(u'mix English and Chinese. 繁簡轉換器')
mix English and Chinese. 繁简转换器
>>> print(HanziConv.toTraditional(u'mix English and Chinese. 繁简转换器'))
mix English and Chinese. 繁簡轉換器
>>> print(HanziConv.toSimplified('mix English and Chinese. 繁簡轉換器'))
mix English and Chinese. 繁简转换器
>>> print(HanziConv.toTraditional('mix English and Chinese. 繁简转换器'))
mix English and Chinese. 繁簡轉換器

Comparing String

>>> from hanziconv import HanziConv
>>> u'繁簡轉換器' ==  u'繁简转换器'
False
>>> HanziConv.same(u'繁簡轉換器', u'繁简转换器')
True
>>> str1 = 'mix English and Chinese. 繁簡轉換器'
>>> str2 = 'mix English and Chinese. 繁简转换器'
>>> str3 = 'mix Chinese and English. 繁简转换器'
>>> str4 = u'mix English and Chinese. 繁簡轉換器'
>>> HanziConv.same(str1, str2)
True
>>> HanziConv.same(str2, str3)
False
>>> HanziConv.same(str1, str4)
True

Testing

You can either run the standalone runtests.py or standard python setup.py test

$ tar zxf hanziconv-0.2.3.tar.gz
$ cd hanziconv-0.2.3
$ python setup.py test

License

This module is distributed under Apache License Version 2.0.

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hanziconv-0.3.tar.gz (273.7 kB view details)

Uploaded Source

File details

Details for the file hanziconv-0.3.tar.gz.

File metadata

  • Download URL: hanziconv-0.3.tar.gz
  • Upload date:
  • Size: 273.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for hanziconv-0.3.tar.gz
Algorithm Hash digest
SHA256 c4998094a4194db0355b395d94edf53908bcabcd374d7bbf7c7bdec242e5679c
MD5 fb95bd60e08d3b5a7d5104178ea746bd
BLAKE2b-256 aa78aa953b61c3b4a311728f22ae94ddbcb611b14a174603efd33511927a2ba7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page