A simple but working Finnish language hyphenator.
Project description
A simple but working Finnish language hyphenator.
By Pyry Kontio a.k.a Drasa (Drasa@IRCnet, pyry.kontio@drasa.eu)
Hyphenates Finnish text with Unicode soft hyphens. (U+00AD) Mainly intended for server- side-hyphenation of web sites.
Allows to set hyphenation-preventing character margins for words so that they won’t break right at the start or the end. (For example, it’d be a bit silly - although certainly possible in Finnish language - to break a word like ‘erikoinen’ at ‘e-rikoinen’. With default margin of 2, it breaks more stylistically pleasingly, ‘eri-koinen’.)
Hyphenated html tags break web sites, so there’s the boolean argument skip_html. That enabled, it skips over all the words that are contained between “<” and “>” characters.
Usage: as a standalone script:
hyphenate_finnish.py [margin] [text]
or as a Python module:
from hyphenate_finnish import hyphenate; hyphenate(“some text but <html> isn’t gonna get hyphenated!”, margin=1, skip_html=True)
It’s that simple. By the way, written in Py3k, but it seems to work with 2.7 too.
Licensed with LGPL.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file hyphenate_finnish-1.3.0.tar.gz.
File metadata
- Download URL: hyphenate_finnish-1.3.0.tar.gz
- Upload date:
- Size: 9.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6eba9c892e6909ec4b5f3ad366948edd32ec79107754b06ac11a06fb40a023ab
|
|
| MD5 |
5994d150af4efa4105d9a496294c1627
|
|
| BLAKE2b-256 |
29c34adfd57a2e3fe75a0b833a82f861ac73f72e1ea735c3393862b9f77f98b5
|