Skip to main content

OpenCC dictionary data, configs, and test data

Project description

opencc-data PyPI version JSDelivr badge

A collection of dictionary data, configs, and test data for Simplified and Traditional Chinese conversions from the OpenCC project.

Compatibility

This package is intended for opencc-js, opencc-py, and other OpenCC-compatible implementations that consume OpenCC dictionary and config data.

  • Strict Version Matching: Compatibility is only guaranteed when the version of opencc-data strictly matches the version of the consumer package.
  • Breaking Changes: We do not guarantee compatibility between different versions of opencc-data. Structure or file names may change to align with upstream OpenCC updates.

Data Sync Policy

This package syncs dictionary .txt files and config .json files from OpenCC's generated resource zip, plus test/testcases/testcases.json from the upstream repository. Each sync keeps the upstream resource manifest as an internal baseline so future updates can detect substantive data changes by resource hash.

Usage

Use the config files shipped in data/config/ as the source of truth for dictionary order and conversion-chain behavior. Config contents can change between OpenCC versions, so consumers should load the matching config file for the package version they depend on instead of hard-coding dictionary lists from this README.

Dictionary text files are shipped in data/. Config files reference those dictionaries by file name and preserve OpenCC's stage/group ordering semantics.

Python consumers can install the opencc-data package from PyPI and use the opencc_data module to locate packaged resources:

import opencc_data

config_file = opencc_data.config_path("s2t.json")
dictionary_file = opencc_data.data_path("STCharacters.txt")
testcases_file = opencc_data.test_data_path("testcases.json")

Release versions match across npm and PyPI, such as 1.3.2. For next prereleases, PyPI uses the PEP 440 equivalent of the npm version: for example, 1.4.0-next.20260628 is published to PyPI as 1.4.0.dev20260628.


opencc-data PyPI version JSDelivr badge

本套件收錄來自 OpenCC 專案的簡繁轉換詞典資料、配置與測試集。

相容性

本套件適用於 opencc-jsopencc-py,也可供其他相容 OpenCC 詞典與設定檔格式的實作使用。

  • 嚴格版本匹配:只有在 opencc-data 與使用端套件版本完全一致時,才保證相容性。
  • 破壞性變更:不同版本的 opencc-data 之間不保證相容。資料結構或檔名可能會隨上游 OpenCC 更新而調整。

資料同步政策

本套件會從 OpenCC 產生的 resource zip 同步詞典 .txt 檔與設定 .json 檔,並從上游 repository 同步 test/testcases/testcases.json。每次同步都會保留上游 resource manifest 作為內部基準,之後可透過 resource hash 判斷是否有實質資料變更。

使用方式

請以套件內 data/config/ 提供的設定檔作為詞典順序與轉換鏈行為的依據。設定內容可能隨 OpenCC 版本改變,因此使用端應載入其依賴版本對應的設定檔,而不是從 README 固定寫死詞典列表。

詞典文字檔位於 data/。設定檔會以檔名引用這些詞典,並保留 OpenCC 的 stage/group 排序語意。

Python 使用者可以從 PyPI 安裝 opencc-data,並透過 opencc_data module 定位套件內資源:

import opencc_data

config_file = opencc_data.config_path("s2t.json")
dictionary_file = opencc_data.data_path("STCharacters.txt")
testcases_file = opencc_data.test_data_path("testcases.json")

正式版在 npm 與 PyPI 會使用相同版本號,例如 1.3.2next prerelease 則會在 PyPI 使用對應的 PEP 440 版本:例如 1.4.0-next.20260628 會以 1.4.0.dev20260628 發佈到 PyPI。

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

opencc_data-1.3.4.dev20260630.tar.gz (532.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

opencc_data-1.3.4.dev20260630-py3-none-any.whl (542.9 kB view details)

Uploaded Python 3

File details

Details for the file opencc_data-1.3.4.dev20260630.tar.gz.

File metadata

  • Download URL: opencc_data-1.3.4.dev20260630.tar.gz
  • Upload date:
  • Size: 532.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for opencc_data-1.3.4.dev20260630.tar.gz
Algorithm Hash digest
SHA256 05554c37f7ecc0ac7deda7139260b67cfc8d70a02599dbe5a7c37911cbcc556d
MD5 18d0e84b9a3172d689dfc13e56d2b81b
BLAKE2b-256 b05c89baae6b58a7d7058fd91bbe453fece0a675ea071f76933ad9bfe0ebac5f

See more details on using hashes here.

Provenance

The following attestation bundles were made for opencc_data-1.3.4.dev20260630.tar.gz:

Publisher: pypi-publish.yml on nk2028/opencc-data

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file opencc_data-1.3.4.dev20260630-py3-none-any.whl.

File metadata

File hashes

Hashes for opencc_data-1.3.4.dev20260630-py3-none-any.whl
Algorithm Hash digest
SHA256 1d62f63273e05a3311d4db9a45c130b00061a1119c74e6e293880ed762cfec3e
MD5 83d264633352a3af3d6274d97dc41f8b
BLAKE2b-256 750652918d2453ac13dd0b3a95f96194af73c3e30b4093714a1de98cb9919ed3

See more details on using hashes here.

Provenance

The following attestation bundles were made for opencc_data-1.3.4.dev20260630-py3-none-any.whl:

Publisher: pypi-publish.yml on nk2028/opencc-data

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page