Skip to main content

Conversion between Traditional and Simplified Chinese

Project description

Open Chinese Convert 開放中文轉換

Travis AppVeyor C/C++ CI Node.js CI Python CI

Introduction 介紹

本 fork 在原仓库基础上添加了 MacOS arm64 架构的 wheel 包相关构建逻辑。

OpenCC

Open Chinese Convert (OpenCC, 開放中文轉換) is an opensource project for conversions between Traditional Chinese, Simplified Chinese and Japanese Kanji (Shinjitai). It supports character-level and phrase-level conversion, character variant conversion and regional idioms among Mainland China, Taiwan and Hong Kong. This is not translation tool between Mandarin and Cantonese, etc.

中文簡繁轉換開源項目,支持詞彙級別的轉換、異體字轉換和地區習慣用詞轉換(中國大陸、臺灣、香港、日本新字體)。不提供普通話與粵語的轉換。

Discussion (Telegram): https://t.me/open_chinese_convert

Features 特點

  • 嚴格區分「一簡對多繁」和「一簡對多異」。
  • 完全兼容異體字,可以實現動態替換。
  • 嚴格審校一簡對多繁詞條,原則爲「能分則不合」。
  • 支持中國大陸、臺灣、香港異體字和地區習慣用詞轉換,如「裏」「裡」、「鼠標」「滑鼠」。
  • 詞庫和函數庫完全分離,可以自由修改、導入、擴展。

Installation 安裝

See Download.

Usage 使用

Online demo 線上轉換展示

Warning: This is NOT an API. You will be banned if you make calls programmatically.

https://opencc.byvoid.com/

Node.js

npm npm install opencc

JavaScript

const OpenCC = require('opencc');
const converter = new OpenCC('s2t.json');
converter.convertPromise("汉字").then(converted => {
  console.log(converted);  // 漢字
});

TypeScript

import { OpenCC } from 'opencc';
async function main() {
  const converter: OpenCC = new OpenCC('s2t.json');
  const result: string = await converter.convertPromise('汉字');
  console.log(result);
}

See demo.js and ts-demo.ts.

Python

PyPI pip install ds_opencc (Windows, Linux, Mac)

import ds_opencc
converter = ds_opencc.OpenCC('s2t.json')
converter.convert('汉字')  # 漢字

C++

#include "opencc.h"

int main() {
  const opencc::SimpleConverter converter("s2t.json");
  converter.Convert("汉字");  // 漢字
  return 0;
}

C

#include "opencc.h"

int main() {
  opencc_t opencc = opencc_open("s2t.json");
  const char* input = "汉字";
  char* converted = opencc_convert_utf8(opencc, input, strlen(input));  // 漢字
  opencc_convert_utf8_free(converted);
  opencc_close(opencc);
  return 0;
}

Document 文檔: https://byvoid.github.io/OpenCC/

Command Line

  • opencc --help
  • opencc_dict --help
  • opencc_phrase_extract --help

Others (Unofficial)

Configurations 配置文件

預設配置文件

  • s2t.json Simplified Chinese to Traditional Chinese 簡體到繁體
  • t2s.json Traditional Chinese to Simplified Chinese 繁體到簡體
  • s2tw.json Simplified Chinese to Traditional Chinese (Taiwan Standard) 簡體到臺灣正體
  • tw2s.json Traditional Chinese (Taiwan Standard) to Simplified Chinese 臺灣正體到簡體
  • s2hk.json Simplified Chinese to Traditional Chinese (Hong Kong variant) 簡體到香港繁體
  • hk2s.json Traditional Chinese (Hong Kong variant) to Simplified Chinese 香港繁體到簡體
  • s2twp.json Simplified Chinese to Traditional Chinese (Taiwan Standard) with Taiwanese idiom 簡體到繁體(臺灣正體標準)並轉換爲臺灣常用詞彙
  • tw2sp.json Traditional Chinese (Taiwan Standard) to Simplified Chinese with Mainland Chinese idiom 繁體(臺灣正體標準)到簡體並轉換爲中國大陸常用詞彙
  • t2tw.json Traditional Chinese (OpenCC Standard) to Taiwan Standard 繁體(OpenCC 標準)到臺灣正體
  • hk2t.json Traditional Chinese (Hong Kong variant) to Traditional Chinese 香港繁體到繁體(OpenCC 標準)
  • t2hk.json Traditional Chinese (OpenCC Standard) to Hong Kong variant 繁體(OpenCC 標準)到香港繁體
  • t2jp.json Traditional Chinese Characters (Kyūjitai) to New Japanese Kanji (Shinjitai) 繁體(OpenCC 標準,舊字體)到日文新字體
  • jp2t.json New Japanese Kanji (Shinjitai) to Traditional Chinese Characters (Kyūjitai) 日文新字體到繁體(OpenCC 標準,舊字體)
  • tw2t.json Traditional Chinese (Taiwan standard) to Traditional Chinese 臺灣正體到繁體(OpenCC 標準)

Build 編譯

Build with CMake

Linux & Mac OS X

g++ 4.6+ or clang 3.2+ is required.

make

Windows Visual Studio:

build.cmd

Test 測試

Linux & Mac OS X

make test

Windows Visual Studio:

test.cmd

Benchmark 基準測試

make benchmark

Example results (from Github CI):

1: ------------------------------------------------------------------
1: Benchmark                        Time             CPU   Iterations
1: ------------------------------------------------------------------
1: BM_Initialization/hk2s        1.56 ms         1.56 ms          442
1: BM_Initialization/hk2t       0.144 ms        0.144 ms         4878
1: BM_Initialization/jp2t       0.260 ms        0.260 ms         2604
1: BM_Initialization/s2hk        23.8 ms         23.8 ms           29
1: BM_Initialization/s2t         25.6 ms         25.6 ms           28
1: BM_Initialization/s2tw        24.0 ms         23.9 ms           30
1: BM_Initialization/s2twp       24.6 ms         24.6 ms           28
1: BM_Initialization/t2hk       0.052 ms        0.052 ms        12897
1: BM_Initialization/t2jp       0.141 ms        0.141 ms         5012
1: BM_Initialization/t2s         1.30 ms         1.30 ms          540
1: BM_Initialization/tw2s        1.39 ms         1.39 ms          529
1: BM_Initialization/tw2sp       1.69 ms         1.69 ms          426
1: BM_Initialization/tw2t       0.089 ms        0.089 ms         7707
1: BM_Convert2M                   582 ms          582 ms            1
1: BM_Convert/100                1.07 ms         1.07 ms          636
1: BM_Convert/1000               11.0 ms         11.0 ms           67
1: BM_Convert/10000               113 ms          113 ms            6
1: BM_Convert/100000             1176 ms         1176 ms            1

Projects using OpenCC 使用 OpenCC 的項目

License 許可協議

Apache License 2.0

Third Party Library 第三方庫

All these libraries are statically linked by default.

Change History 版本歷史

Links 相關鏈接

Contributors 貢獻者

Please feel free to update this list if you have contributed OpenCC.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

ds_opencc-1.1.11-cp312-cp312-win_amd64.whl (715.0 kB view details)

Uploaded CPython 3.12 Windows x86-64

ds_opencc-1.1.11-cp312-cp312-manylinux1_x86_64.whl (780.1 kB view details)

Uploaded CPython 3.12

ds_opencc-1.1.11-cp312-cp312-macosx_12_0_x86_64.whl (735.7 kB view details)

Uploaded CPython 3.12 macOS 12.0+ x86-64

ds_opencc-1.1.11-cp312-cp312-macosx_12_0_arm64.whl (714.2 kB view details)

Uploaded CPython 3.12 macOS 12.0+ ARM64

ds_opencc-1.1.11-cp311-cp311-win_amd64.whl (714.9 kB view details)

Uploaded CPython 3.11 Windows x86-64

ds_opencc-1.1.11-cp311-cp311-manylinux1_x86_64.whl (780.1 kB view details)

Uploaded CPython 3.11

ds_opencc-1.1.11-cp311-cp311-macosx_12_0_x86_64.whl (735.1 kB view details)

Uploaded CPython 3.11 macOS 12.0+ x86-64

ds_opencc-1.1.11-cp311-cp311-macosx_12_0_arm64.whl (713.6 kB view details)

Uploaded CPython 3.11 macOS 12.0+ ARM64

File details

Details for the file ds_opencc-1.1.11-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for ds_opencc-1.1.11-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 5f37301a0716ade9db3c32d3de0dca993855b0eeadb326cd478090b44170c6d5
MD5 0775d7ecf0dff71f1f6687c9c97e620a
BLAKE2b-256 129385c3d197a695ac9a0e84ebc6628bd679aa0f337cfc5b8e529e55c4cb1d13

See more details on using hashes here.

File details

Details for the file ds_opencc-1.1.11-cp312-cp312-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for ds_opencc-1.1.11-cp312-cp312-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 ab12491cef2d43c859246e3e8c5d87d709658c03af05d7c970b8e4bdfd851cad
MD5 e6efd16c97bbc10fac9781edf099d562
BLAKE2b-256 c7223582828668959a4ba5cd78886cf3e90b0e61b0ec321ad0bb58f36dc9a778

See more details on using hashes here.

File details

Details for the file ds_opencc-1.1.11-cp312-cp312-macosx_12_0_x86_64.whl.

File metadata

File hashes

Hashes for ds_opencc-1.1.11-cp312-cp312-macosx_12_0_x86_64.whl
Algorithm Hash digest
SHA256 fa49ace4ea3217c08d391ddbd76121c1f3140369a0a54602fc5760500eb5463e
MD5 0537d39debe2a8ec18cc08d233bccf80
BLAKE2b-256 4100c285fd617aa47fb34239d6bd964cd4aa46e2badd41113c8e90e31485bd89

See more details on using hashes here.

File details

Details for the file ds_opencc-1.1.11-cp312-cp312-macosx_12_0_arm64.whl.

File metadata

File hashes

Hashes for ds_opencc-1.1.11-cp312-cp312-macosx_12_0_arm64.whl
Algorithm Hash digest
SHA256 c99545c510919ecb598df700a19488d5fd211771ff53324c93f7aa5ab31178d4
MD5 55e13a2e04f84d44a4ea13344a9e0f31
BLAKE2b-256 7d82b2b30c62b944a5e0229549e80a337f3c53ab7c6f0c8e7800d774f9299cac

See more details on using hashes here.

File details

Details for the file ds_opencc-1.1.11-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for ds_opencc-1.1.11-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 ec254093e6344e2d5fe1f812273cef375e928036f6c2640c9aef2883d5c25212
MD5 55a5f3ba17556c3218dbc8df9f2935ad
BLAKE2b-256 ecbd2143b5cb4566f4f36c46dd4047becbea59a41a3a7354f5c9552b38f18a6b

See more details on using hashes here.

File details

Details for the file ds_opencc-1.1.11-cp311-cp311-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for ds_opencc-1.1.11-cp311-cp311-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 ddc74a3d5e16c792464bf300bd732abe7cad1d342ce143780da2d571e094fe71
MD5 12d02ef431ece95a806517fa14741719
BLAKE2b-256 210d1d949618f1bbfcadb41f715b6fd68e41439cd94b59ddb30716e47d5e995d

See more details on using hashes here.

File details

Details for the file ds_opencc-1.1.11-cp311-cp311-macosx_12_0_x86_64.whl.

File metadata

File hashes

Hashes for ds_opencc-1.1.11-cp311-cp311-macosx_12_0_x86_64.whl
Algorithm Hash digest
SHA256 9aad2fc21fe3b85316a9dbe643a05e8e06b428a499a6e4d8c870415832e04b0b
MD5 78159ced3c646ba20677bfb6cca8dc17
BLAKE2b-256 5dc39f813a638f7fad6c89b9d5ccdbc52de65b4e08b8e04e0a8f05dca737a49a

See more details on using hashes here.

File details

Details for the file ds_opencc-1.1.11-cp311-cp311-macosx_12_0_arm64.whl.

File metadata

File hashes

Hashes for ds_opencc-1.1.11-cp311-cp311-macosx_12_0_arm64.whl
Algorithm Hash digest
SHA256 6bbcf69dd0c83f42fe6168fdc5782397b8b285d74045126af26e0ea17065b819
MD5 2d6fa30f945f5a0d2db5c6b142403459
BLAKE2b-256 28c59877e95b19c12db07f26cdc6f7d1d3781d10831a395c5590a5fcd170acad

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page