Skip to main content

A simple but working Finnish language hyphenator.

Project description

A simple but working Finnish language hyphenator.

By Pyry Kontio a.k.a Drasa (Drasa@IRCnet, pyry.kontio@drasa.eu)

Hyphenates Finnish text with Unicode soft hyphens. (U+00AD) Mainly intended for server- side-hyphenation of web sites.

Allows to set hyphenation-preventing character margins for words so that they won’t break right at the start or the end. (For example, it’d be a bit silly - although certainly possible in Finnish language - to break a word like ‘erikoinen’ at ‘e-rikoinen’. With default margin of 2, it breaks more stylistically pleasingly, ‘eri-koinen’.)

Hyphenated html tags break web sites, so there’s the boolean argument skip_html. That enabled, it skips over all the words that are contained between “<” and “>” characters.

Usage: as a standalone script:

hyphenate_finnish.py [margin] [text]

or as a Python module:

from hyphenate_finnish import hyphenate; hyphenate(“some text but <html> isn’t gonna get hyphenated!”, margin=1, skip_html=True)

It’s that simple. By the way, written in Py3k, but it seems to work with 2.7 too.

Licensed with LGPL.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hyphenate_finnish-1.3.0.tar.gz (9.1 kB view details)

Uploaded Source

File details

Details for the file hyphenate_finnish-1.3.0.tar.gz.

File metadata

File hashes

Hashes for hyphenate_finnish-1.3.0.tar.gz
Algorithm Hash digest
SHA256 6eba9c892e6909ec4b5f3ad366948edd32ec79107754b06ac11a06fb40a023ab
MD5 5994d150af4efa4105d9a496294c1627
BLAKE2b-256 29c34adfd57a2e3fe75a0b833a82f861ac73f72e1ea735c3393862b9f77f98b5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page