Skip to main content

Pure Python module to hyphenate text

Project description

Pyphen is a pure Python module to hyphenate text using existing Hunspell hyphenation dictionaries.

https://github.com/Kozea/Pyphen

This module is a fork of python-hyphenator, written by Wilbert Berendsen.

https://code.google.com/p/python-hyphenator/

Many dictionaries are included in pyphen, they come from the LibreOffice git repository and are distributed under GPL, LGPL and/or MPL. See the dictionaries and the libreoffice’s repository for more details.

http://cgit.freedesktop.org/libreoffice/dictionaries/tree/

Usage:

>>> import pyphen
>>> pyphen.language_fallback('nl_NL_variant1')
'nl_NL'
>>> 'nl_NL' in pyphen.LANGUAGES
True
>>> dic = pyphen.Pyphen(lang='nl_NL')
>>> dic.inserted('lettergrepen')
'let-ter-gre-pen'
>>> dic.wrap('autobandventieldopje', 11)
('autoband-', 'ventieldopje')
>>> for pair in dic.iterate('Amsterdam'):
...     print(pair)
...
('Amster', 'dam')
('Am', 'sterdam')
>>>

Features:

  • 100% pure Python with no dependencies

  • a lot of included dictionaries

  • caches dict files and hyphenated words

  • supports nonstandard hyphenation patterns

License:

Pyphen is released under the GPL 2.0+/LGPL 2.1+/MPL 1.1 tri-license. See COPYING.GPL, COPYING.LGPL and COPYING.MPL for more details.

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Pyphen-0.9.4.tar.gz (1.1 MB view hashes)

Uploaded Source

Built Distribution

Pyphen-0.9.4-py2.py3-none-any.whl (1.9 MB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page