Python module for text segmentation.
Project description
The pytextseg package provides functions to wrap plain texts: fill() and wrap() are Unicode-aware alternatives for those of textwrap standard module; fold() and unfold() are functions mainly focus on plain text messages such as e-mail.
It also provides lower level interfaces for text segmentation: LineBreak class for line breaking; GCStr class for grapheme cluster segmentation.
Install
See INSTALL.
For Other Language
- Perl
Unicode-LineBreak: http://search.cpan.org/dist/Unicode-LineBreak/
License
pytextseg is Copyright (C) 2012, by Hatuka*nezumi - IKEDA Soji.
It is free software; you can redistribute it and/or modify it under the terms of either:
the GNU General Public License as published by the Free Software Foundation; either version 2, or (at your option) any later version, or
the “Artistic License”.
See the COPYING and the ARTISTIC files for more details.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.