Skip to main content

Chinese Text Error corrector

Project description

corrector
=====
# corrector
中文错别字纠正工具。音似、形似错字(或变体字)纠正,可用于中文拼音、笔画输入法的错误纠正。python开发。

**corrector** 依据语言模型检测错别字位置,通过拼音音似特征、笔画五笔编辑距离特征及语言模型困惑度特征纠正错别字。

### 语言模型
* Kenlm(统计语言模型工具)
* RNN(TensorFlow、PaddlePaddle均有实现栈式双向LSTM的语言模型)

安装说明
========

代码对 Python 2/3 均兼容

- 全自动安装: ``easy_install corrector`` 或者 ``pip install corrector`` / ``pip3 install corrector``
- 手动安装:将 corrector 目录放置于当前目录或者 site-packages 目录
- 通过 ``import corrector`` 来引用

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pycorrector-0.0.1.tar.gz (3.0 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page