Skip to main content

A tool that highlights inconsistencies in word segmentation.

Project description

space-diff

Description

space-diff is a tool that highlights inconsistencies in word segmentation within spaced texts (such as training corpora) for any spaceless orthography.

This project is Pure Python and requires Python 3.6+

Installation

pip install space-diff

Usage/Tutorial

Included in this repository are two toy corpora of segmented simplified Chinese.

Credits

License

GNU GPLv3

Contact

Blake Perry Smith perry.smithb@gmail.com

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

space-diff-0.0.2.tar.gz (3.6 kB view hashes)

Uploaded Source

Built Distribution

space_diff-0.0.2-py3-none-any.whl (18.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page