Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols).
Project description
Pangu2
Split From English and Chinese.
Installation
$ pip install -U pangu2
Usage
In Python
import pangu2
new_text = pangu2.spacing_text('當你凝視著bug,bug也凝視著你')
# new_text = '當你凝視著 bug,bug 也凝視著你'
nwe_content = pangu2.spacing_file('path/to/file.txt')
# nwe_content = '與 PM 戰鬥的人,應當小心自己不要成為 PM'
In CLI
$ pangu2 "請使用uname -m指令來檢查你的Linux作業系統是32位元或是[敏感词已被屏蔽]位元"
請使用 uname -m 指令來檢查你的 Linux 作業系統是 32 位元或是 [敏感词已被屏蔽] 位元
$ python -m pangu2 "為什麼小明有問題都不Google?因為他有Bing"
為什麼小明有問題都不 Google?因為他有 Bing
$ echo "未來的某一天,Gmail配備的AI可能會得出一個結論:想要消滅垃圾郵件最好的辦法就是消滅人類" >> path/to/file.txt
$ pangu2 -f path/to/file.txt >> pangu_file.txt
$ cat pangu_file.txt
未來的某一天,Gmail 配備的 AI 可能會得出一個結論:想要消滅垃圾郵件最好的辦法就是消滅人類
$ echo "心裡想的是Microservice,手裡做的是Distributed Monolith" | pangu2
心裡想的是 Microservice,手裡做的是 Distributed Monolith
$ echo "你從什麼時候開始產生了我沒使用Monkey Patch的錯覺?" | python -m pangu2
你從什麼時候開始產生了我沒使用 Monkey Patch 的錯覺?
History
5.0 (2019-08-02)
Remove Letter & Number and Paren Split
4.0.6.1 (2019-02-09)
Implement Paranoid Text Spacing algorithm v4
Support Python 3.7
Drop Python 2.7 support
3.3.0.1 (2018-01-20)
Support Python 3.6
Add a method: pangu.spacing_file()
Add a command-line tool: pangu
3.0.0 (2016-01-24)
Support Python 3.5
Refactoring
Rename text_spacing() to spacing_text()
2.5.6.3 (2015-05-18)
Add an alias to spacing(): text_spacing()
Fix unicode issue in Python 2.x
2.5.6.2 (2015-05-17)
Fix setup.py
2.5.6 (2015-05-17)
Synchronize version number with pangu.js
Improve Paranoid Text Spacing algorithm
1.0.0 (2014-02-12)
Hello World
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file pangu2-5.0.tar.gz
.
File metadata
- Download URL: pangu2-5.0.tar.gz
- Upload date:
- Size: 6.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: Python-urllib/3.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 84f73bb3d07d2e0338814e4b908e01fcdce8fcf7102b07a94ebffa724c0cab92 |
|
MD5 | ec793a33b9c5489709c23cdb9fdd2720 |
|
BLAKE2b-256 | 004cf05718052ca54f4118c5f8f30ecbc3229e323be251f47f668ace15b39bc7 |
File details
Details for the file pangu2-5.0-py3.7.egg
.
File metadata
- Download URL: pangu2-5.0-py3.7.egg
- Upload date:
- Size: 8.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: Python-urllib/3.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b8698a2b2eb72c12ddf8e8b9133ae5f5c6da30a7cba0ee46f2a42d79167c4f40 |
|
MD5 | 7eeb6a7efb06d00b53781ff3b648bc53 |
|
BLAKE2b-256 | b18842fbc9f4d78e073663ef9b33f0dd333412c1900ad96233cff3f0e58e664a |