A pip-installable library version of hfst-optimized-lookup from https://hfst.github.io/
Project description
hfst-optimized-lookup
A pip-installable library version of hfst-optimized-lookup, originally built for itwêwina.
Install
pip install hfst-optimized-lookup
This requires that the machine running pip
have a working C++ compiler. If
enough people ask for us to upload binary ‘wheels’ so that you don’t need a
compiler at install time, we could start doing so.
Usage
Import the library:
>>> import hfst_optimized_lookup
Then load an FST!
>>> fst = hfst_optimized_lookup.TransducerFile('../crk-relaxed-analyzer-for-dictionary.hfstol')
Hint: Download
crk-relaxed-analyzer-for-dictionary.hfstol
by cloning https://github.com/UAlbertaALTLab/cree-intelligent-dictionary/tree/main/src/crkeng/resources/fst to follow along! The file itself is stored in Git LFS so is tricky to link to directly.
Do an ordinary lookup, to get a list of concatenated analyses for a wordform:
>>> fst.lookup('atim')
['atim+N+A+Sg', 'atimêw+V+TA+Imp+Imm+2Sg+3SgO']
Or get each parsed analysis from the wordform
>>> analysis = fst.lookup_lemma_with_affixes('atim')[0]
>>> analysis.lemma
'atim'
>>> analysis.suffixes
('+N', '+A', '+Sg')
You can also lookup the analyses with symbols separated:
>>> fst.lookup_symbols('atim')
[['a', 't', 'i', 'm', '+N', '+A', '+Sg'], ['a', 't', 'i', 'm', 'ê', 'w', '+V', '+TA', '+Imp', '+Imm', '+2Sg', '+3SgO']]
Releasing
Run ./release.py --help
for details of the release process.
A typical release is:
-
Make sure the “Unreleased” section of
CHANGELOG.md
is up-to-date -
Run the script:
./release.py --release-timezone=America/Edmonton --push --release
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Hashes for hfst-optimized-lookup-0.0.12.dev0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | e71af4e48094a41ff42c5a16fcfbf97068fe219c6fcfd6eec8ab450d4235fda1 |
|
MD5 | 2ccb20529f6f012074af667cdf6090c4 |
|
BLAKE2b-256 | c515ad924541caa0dd997048d1c3d28abe8cb07a0eaf50f63e2364672aa6986e |