Skip to main content

Python wrapper for obtaining synonyms in the German language from OpenThesaurus

Project description

Python wrapper for obtaining synonyms in German language from OpenThesaurus

Codacy Badge License: MIT

When working in Natural Language Processing (NLP) area, synonyms can be an essential part of the data augmentation process. The task of obtaining synonyms for the German language is currently limited since there are no easily accessible lexical databases for the German language. Compared to the WordNet lexical database for the English language, which is available as an nltk package, GermaNet represents only one German lexical database alternative. However, to use GermaNet for further research purposes, it is necessary to obtain the license manually.

This repository represents a Python wrapper implementation for obtaining synonyms in a faster and easier way, using the German synonym database and API from OpenThesaurus.

Installation

The library can be installed from PyPI:

pip install py_openthesaurus

Usage

As a Python library:

from py_openthesaurus import OpenThesaurus

open_thesaurus = OpenThesaurus(word="München")

# to get the short version of synonyms as a list
synonyms = open_thesaurus.get_synonyms()

# to get the long version of synonyms as a list
synonyms_long = open_thesaurus.get_synonyms(form='long')

As a command-line tool:

usage: py_openthesaurus [-h] [--form {long,short}] --word WORD

Get synonyms of German words from www.openthesaurus.de

optional arguments:
  -h, --help           show this help message and exit
  --form {long,short}  Defaults to form=short which means that short versions
                       of synonyms will be returned, without nach/zu
                       prefixes/suffixes.On the other hand, form=long returns
                       the full versions of synonyms including nach/zu, sich
                       prefixes/suffixes

required arguments:
  --word WORD          A word from which synonyms will be obtained

Acknowledgments

  • OpenThesaurus for developing a German synonym database with API from which synonyms for the German language can be obtained

Licence

Even though this project is under MIT license, please check information about OpenThesaurus licensing and API limitations from the following link API, in the case your software will need an exhaustive amount of synonym requests in a short period of time.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

py_openthesaurus-1.0.0.tar.gz (4.1 kB view hashes)

Uploaded Source

Built Distribution

py_openthesaurus-1.0.0-py3-none-any.whl (6.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page