Skip to main content

framework for detecting old loanwords

Project description

loanpy
Build Status

loanpy is a tool for historical linguists. It extracts sound changes and constraints from an etymological dictionary, generates pseudo-roots for L1, pseudo- sound-substitutions for L2, searches for phonetically identical lexemes and ranks those according to semantic similarity.

Installation

$ python -m pip install loanpy

Getting started

>>> from loanpy import loanfinder as lf

Download and unpack 3 Gigabytes of pretrained Google-News vectors. Move GoogleNews-vectors-negative300.bin to the folder “data”, the full path to which can be retrieved via:

>>> import os
>>> print(os.path.dirname(lf.__file__)+r"\data")

Following code will compare a set of Gothic words (data/dfgot.csv) with Hungarian words (data/zaicz.csv) and evaluate which elements are the most likely candidates for loanwords. The result can be viewed in data/results/matches.csv:

>>> lf.loandf()

Data Sources

Gábor Zaicz’s Hungarian etymological dictionary from 2006

Gerhard Köbler’s Gothic database

Hungarian Academy of Science’s online version of Uralisches Etymologisches Wörterbuch

License

Academic Free License (AFL) (Creative Commons Attribution 4.0 International)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

loanpy-0.1.7.tar.gz (1.3 MB view details)

Uploaded Source

File details

Details for the file loanpy-0.1.7.tar.gz.

File metadata

  • Download URL: loanpy-0.1.7.tar.gz
  • Upload date:
  • Size: 1.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.23.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.8.1

File hashes

Hashes for loanpy-0.1.7.tar.gz
Algorithm Hash digest
SHA256 e036516c89eb60241d760c649f0ef5209b567c976e88284db6d33ac8629f1f89
MD5 8bc49cd6c46f5687a387af8a167cccfd
BLAKE2b-256 f858d413ff7f47c1ecc0a4306c8aef2e1d62b26204935b0994e1706cc8a0791b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page