Skip to main content

Pure Python module to hyphenate text

Project description

Pyphen is a pure Python module to hyphenate text using existing Hunspell hyphenation dictionaries.

https://github.com/Kozea/Pyphen

This module is a fork of python-hyphenator, written by Wilbert Berendsen.

https://code.google.com/p/python-hyphenator/

Many dictionaries are included in pyphen, they come from the LibreOffice git repository and are distributed under GPL, LGPL and/or MPL. See the dictionaries and the libreoffice’s repository for more details.

http://cgit.freedesktop.org/libreoffice/dictionaries/tree/

Usage:

>>> import pyphen
>>> pyphen.language_fallback('nl_NL_variant1')
'nl_NL'
>>> 'nl_NL' in pyphen.LANGUAGES
True
>>> dic = pyphen.Pyphen(lang='nl_NL')
>>> dic.inserted('lettergrepen')
'let-ter-gre-pen'
>>> dic.wrap('autobandventieldopje', 11)
('autoband-', 'ventieldopje')
>>> for pair in dic.iterate('Amsterdam'):
...     print(pair)
...
('Amster', 'dam')
('Am', 'sterdam')
>>>

Features:

  • 100% pure Python with no dependencies
  • a lot of included dictionaries
  • caches dict files and hyphenated words
  • supports nonstandard hyphenation patterns

License:

Pyphen is released under the GPL 2.0+/LGPL 2.1+/MPL 1.1 tri-license. See COPYING.GPL, COPYING.LGPL and COPYING.MPL for more details.

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for Pyphen, version 0.8
Filename, size File type Python version Upload date Hashes
Filename, size Pyphen-0.8.tar.gz (962.6 kB) File type Source Python version None Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page