Skip to main content

Python module for text segmentation.

Project description

The pytextseg package provides functions to wrap plain texts: fill() and wrap() are Unicode-aware alternatives for those of textwrap standard module; fold() and unfold() are functions mainly focus on plain text messages such as e-mail.

It also provides lower level interfaces for text segmentation: LineBreak class for line breaking; GCStr class for grapheme cluster segmentation.

Install

See INSTALL.

For Other Language

Perl

Unicode-LineBreak: http://search.cpan.org/dist/Unicode-LineBreak/

License

pytextseg is Copyright (C) 2012, by Hatuka*nezumi - IKEDA Soji.

It is free software; you can redistribute it and/or modify it under the terms of either:

  1. the GNU General Public License as published by the Free Software Foundation; either version 2, or (at your option) any later version, or

  2. the “Artistic License”.


See the COPYING and the ARTISTIC files for more details.

Author

Hatuka*nezumi - IKEDA Soji <hatuka(at)nezumi.nu>.

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pytextseg-0.2.0.tar.gz (212.3 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page