A tool that highlights inconsistencies in word segmentation.
Project description
space-diff
Description
space-diff is a tool that highlights inconsistencies in word segmentation within spaced texts (such as training corpora) for any spaceless orthography.
This project is Pure Python and requires Python 3+
Installation
pip install space-diff
Usage/Tutorial
Included in this repository are two toy corpora of segmented simplified Chinese.
Credits
License
GNU GPLv3
Contact
Blake Perry Smith perry.smithb@gmail.com
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
space-diff-0.0.1.tar.gz
(3.3 kB
view hashes)
Built Distribution
space_diff-0.0.1-py3-none-any.whl
(15.8 kB
view hashes)
Close
Hashes for space_diff-0.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 09c9eca2a0959c1b75d4d12b2b96f0899ce5672dbe88a648e3ce84961a1c5b55 |
|
MD5 | eab7426f9c1fd50f527b6f04abcfb26a |
|
BLAKE2b-256 | 6a86ca2a3f05fdf05d0311fbd92cff29f2ea0fdafdd4db4c449c3ed3e19455e5 |