Python implementation of StarCC
Project description
StarCC-Py
简繁转换 簡繁轉換 Python implementation of StarCC, the next generation of Simplified-Traditional Chinese conversion framework
安裝 Installation
pip install starcc
用法 Usage
簡轉繁 Simplified to Traditional
不轉換用詞 Without phrase conversion:
from StarCC import PresetConversion
convert = PresetConversion(src='cn', dst='hk', with_phrase=False) # change to `dst='tw'` for Taiwan mode
print(convert('阴天,山容便黯澹无聊,半隐入米家的水墨里去。'))
# 陰天,山容便黯澹無聊,半隱入米家的水墨裏去。
轉換用詞 With phrase conversion:
from StarCC import PresetConversion
convert = PresetConversion(src='cn', dst='tw', with_phrase=True)
print(convert('KB 大桥也被视为帕劳人的后花园。'))
# KB 大橋也被視為帛琉人的後花園。
繁轉簡 Traditional to Simplified
不轉換用詞 Without phrase conversion:
from StarCC import PresetConversion
convert = PresetConversion(src='hk', dst='cn', with_phrase=False) # change to `dst='tw'` for Taiwan mode
print(convert('盆地並不會永久被水覆蓋,而是反覆蒸發循環。'))
# 盆地并不会永久被水覆盖,而是反复蒸发循环。
轉換用詞 With phrase conversion:
from StarCC import PresetConversion
convert = PresetConversion(src='tw', dst='cn', with_phrase=True)
print(convert('在搜尋欄位使用萬用字元。'))
# 在搜索字段使用通配符。
高級用法 Advanced Usage
在簡轉繁時使用外部分詞 Use external segmentation tools when converting from Simplified to Traditional
此功能已預設開啓 This function is enabled by default
from StarCC import PresetConversion
convert = PresetConversion(src='cn', dst='hk', with_phrase=False, use_seg=True)
convert('拥有 116 年历史') # Correct: 擁有 116 年歷史
convert = PresetConversion(src='cn', dst='hk', with_phrase=False, use_seg=False)
convert('拥有 116 年历史') # Wrong: 擁有 116 年曆史
轉換模式一覽 Supported conversion modes
cn
: Simplified Chinese (Mainland China)hk
: Traditional Chinese (Hong Kong)tw
: Traditional Chinese (Taiwan)jp
: Japanese Shinjitai
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
starcc-0.0.3.tar.gz
(7.3 kB
view details)
Built Distribution
starcc-0.0.3-py3-none-any.whl
(478.9 kB
view details)
File details
Details for the file starcc-0.0.3.tar.gz
.
File metadata
- Download URL: starcc-0.0.3.tar.gz
- Upload date:
- Size: 7.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.8.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5c8f62732faf149858d3e2b81f450193a8204db0c68051705255f51776ce6bfd |
|
MD5 | 459a3132c2419d40449005ffc74fbd90 |
|
BLAKE2b-256 | 09b7df93b053b2dbe1cc8e8a8b7317f01ae1808d2e78cca2f0b9baddd4b84b5f |
Provenance
File details
Details for the file starcc-0.0.3-py3-none-any.whl
.
File metadata
- Download URL: starcc-0.0.3-py3-none-any.whl
- Upload date:
- Size: 478.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.8.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | cf16293dd16e590e1885b218b73c456c7eaef312dde9b8a795f9507968fbf88c |
|
MD5 | 6fbdf1d9da9f0c1b9e9545e75675421d |
|
BLAKE2b-256 | 17fccde5d9a7eb38b63d763d1953aff855cc7ca9c0b37db6cb0927df7f12a95c |