Skip to main content

Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols).

Project description

Pangu2

Split From English and Chinese.

Installation

$ pip install -U pangu2

Usage

In Python

import pangu2

new_text = pangu2.spacing_text('當你凝視著bug,bug也凝視著你')
# new_text = '當你凝視著 bug,bug 也凝視著你'

nwe_content = pangu2.spacing_file('path/to/file.txt')
# nwe_content = '與 PM 戰鬥的人,應當小心自己不要成為 PM'

In CLI

$ pangu2 "請使用uname -m指令來檢查你的Linux作業系統是32位元或是[敏感词已被屏蔽]位元"
請使用 uname -m 指令來檢查你的 Linux 作業系統是 32 位元或是 [敏感词已被屏蔽] 位元

$ python -m pangu2 "為什麼小明有問題都不Google?因為他有Bing"
為什麼小明有問題都不 Google?因為他有 Bing

$ echo "未來的某一天,Gmail配備的AI可能會得出一個結論:想要消滅垃圾郵件最好的辦法就是消滅人類" >> path/to/file.txt
$ pangu2 -f path/to/file.txt >> pangu_file.txt
$ cat pangu_file.txt
未來的某一天,Gmail 配備的 AI 可能會得出一個結論:想要消滅垃圾郵件最好的辦法就是消滅人類

$ echo "心裡想的是Microservice,手裡做的是Distributed Monolith" | pangu2
心裡想的是 Microservice,手裡做的是 Distributed Monolith

$ echo "你從什麼時候開始產生了我沒使用Monkey Patch的錯覺?" | python -m pangu2
你從什麼時候開始產生了我沒使用 Monkey Patch 的錯覺?

History

5.0 (2019-08-02)

  • Remove Letter & Number and Paren Split

4.0.6.1 (2019-02-09)

  • Implement Paranoid Text Spacing algorithm v4

  • Support Python 3.7

  • Drop Python 2.7 support

3.3.0.1 (2018-01-20)

  • Support Python 3.6

  • Add a method: pangu.spacing_file()

  • Add a command-line tool: pangu

3.0.0 (2016-01-24)

  • Support Python 3.5

  • Refactoring

  • Rename text_spacing() to spacing_text()

2.5.6.3 (2015-05-18)

  • Add an alias to spacing(): text_spacing()

  • Fix unicode issue in Python 2.x

2.5.6.2 (2015-05-17)

  • Fix setup.py

2.5.6 (2015-05-17)

  • Synchronize version number with pangu.js

  • Improve Paranoid Text Spacing algorithm

1.0.0 (2014-02-12)

  • Hello World

Project details


Release history Release notifications | RSS feed

This version

5.0

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pangu2-5.0.tar.gz (6.0 kB view details)

Uploaded Source

Built Distribution

pangu2-5.0-py3.7.egg (8.3 kB view details)

Uploaded Source

File details

Details for the file pangu2-5.0.tar.gz.

File metadata

  • Download URL: pangu2-5.0.tar.gz
  • Upload date:
  • Size: 6.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: Python-urllib/3.7

File hashes

Hashes for pangu2-5.0.tar.gz
Algorithm Hash digest
SHA256 84f73bb3d07d2e0338814e4b908e01fcdce8fcf7102b07a94ebffa724c0cab92
MD5 ec793a33b9c5489709c23cdb9fdd2720
BLAKE2b-256 004cf05718052ca54f4118c5f8f30ecbc3229e323be251f47f668ace15b39bc7

See more details on using hashes here.

File details

Details for the file pangu2-5.0-py3.7.egg.

File metadata

  • Download URL: pangu2-5.0-py3.7.egg
  • Upload date:
  • Size: 8.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: Python-urllib/3.7

File hashes

Hashes for pangu2-5.0-py3.7.egg
Algorithm Hash digest
SHA256 b8698a2b2eb72c12ddf8e8b9133ae5f5c6da30a7cba0ee46f2a42d79167c4f40
MD5 7eeb6a7efb06d00b53781ff3b648bc53
BLAKE2b-256 b18842fbc9f4d78e073663ef9b33f0dd333412c1900ad96233cff3f0e58e664a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page