A tool that highlights inconsistencies in word segmentation.
Project description
space-diff
Description
space-diff is a tool that highlights inconsistencies in word segmentation within spaced texts (such as training corpora) for any spaceless orthography.
This project is Pure Python and requires Python 3.6+
Installation
pip install space-diff
Usage/Tutorial
Included in this repository are two toy corpora of segmented simplified Chinese.
Credits
License
GNU GPLv3
Contact
Blake Perry Smith perry.smithb@gmail.com
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
space-diff-0.0.2.tar.gz
(3.6 kB
view hashes)
Built Distribution
space_diff-0.0.2-py3-none-any.whl
(18.1 kB
view hashes)
Close
Hashes for space_diff-0.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a92df8fed0d48c3c6872c22b07833e0241ad091ab566c2cce1949c9bab4a52ca |
|
MD5 | 75dfec928073e534270251c775dc55b8 |
|
BLAKE2b-256 | 0429b9271d33f79b42159c0697725be43c30f0fccedd209635b12d2303bbfe4b |