framework for detecting old loanwords
loanpy is a tool for historical linguists. It extracts sound changes and constraints from an etymological dictionary, generates pseudo-roots for L1, pseudo- sound-substitutions for L2, searches for phonetically identical lexemes and ranks those according to semantic similarity.
$ python -m pip install loanpy
>>> from loanpy import loanfinder as lf
Download and unpack 3 Gigabytes of pretrained Google-News vectors. Move GoogleNews-vectors-negative300.bin to the folder “data”, the full path to which can be retrieved via:
>>> import os >>> print(os.path.dirname(lf.__file__)+r"\data")
Following code will compare a set of Gothic words (data/dfgot.csv) with Hungarian words (data/zaicz.csv) and evaluate which elements are the most likely candidates for loanwords. The result can be viewed in data/results/matches.csv:
Academic Free License (AFL) (Creative Commons Attribution 4.0 International)
Release history Release notifications | RSS feed
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
|Filename, size||File type||Python version||Upload date||Hashes|
|Filename, size loanpy-0.1.7.tar.gz (1.3 MB)||File type Source||Python version None||Upload date||Hashes View|