Skip to main content

GLEM is a lemmatizer for Ancient Greek.

Project description

# glem

GLEM is a lemmatizer for Ancient Greek.

It has been created in the project [Unraveling the Language of Perspective](http://ncs.ruhosting.nl/perspective/), which is supported by the EU under FP7, ERC Starting Grant 338421-Perspective.

The paper ‘A memory-based lemmatizer for Ancient Greek’ reports on how it works, what material it uses, and what the accuracy is. It can be found in the repository and at http://dl.acm.org/citation.cfm?id=3078100.

A webservice where you can upload texts that you want to have lemmatized can be found at https://webservices.cls.ru.nl/. Or you can host your own.

## Dependencies

Just Python 3 for the simple word list based lemmatizer.

To add machine learning based lemmatization that also takes into account the context, glem uses [Frog](https://languagemachines.github.io/frog/) via its [python binding](https://github.com/proycon/python-frog).

## Installation

Run: pip install .

We recommend using a Python virtual environment of your own. Alternatively for a global installation, prepend sudo.

## Example usage

Glem comes with a pretrained model, based on lemmas chosen by humans (in the UiO PROIEL project, PI: Dag Haug), for Herodotus. You can use it (with or without Frog) as follows:

` glem -f input.txt `

The files for this model can be found in glem/pretrained_models/herodotus .

## Webservice

A Dockerfile is provided for deployment of the GLEM webservice in production environments.

From the repository root, build as follows:

`` $ docker build -t webglem . ``

Consult the [Dockerfile](Dockerfile) for various build-time parameters that you may want to set for your own production environment.

When running, mount the path where you want the user data stored into the container, a directory webglem-userdata will be created here:

`` $ docker run -p 8080:80 -v /path/to/data/dir:/data webglem ``

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Glem-1.3.1.tar.gz (6.5 MB view details)

Uploaded Source

File details

Details for the file Glem-1.3.1.tar.gz.

File metadata

  • Download URL: Glem-1.3.1.tar.gz
  • Upload date:
  • Size: 6.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for Glem-1.3.1.tar.gz
Algorithm Hash digest
SHA256 9830c874caec401af36a43231c41de79a63c0e05424d17a9b344ad9a3709357f
MD5 977c3997cfba9dc0554a00dc4260431c
BLAKE2b-256 4c6f8007038ddb98373cfc249421d284d5e58f042b3840c74dca55cdaea61611

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page