Skip to main content

Retrieving literal values from LOD

Project description

LODlit

Simplifying retrieval of literals from Linked Open Data

Different LOD-datasets are available online in different formats with diffferent user-friendliness levels. LODlit allows you to search over different linked open datasets in one place using keywords and outputs the search results in the same json structure convinient for further processing.

For example, LODlit retrieves labels, aliases, and descriptions of Wikidata entities by search terms in a specific language with optional search filtering. It is also possible to get literals in different languages by entity identifiers. Additionally, LODlit provides the functionality to make bag-of-words from literals for natural language processing, for example, to calculate cosine similarity between literals.

Currently, LODlit supports parsing of Wikidata, Getty Art & Architecture Thesaurus (AAT), Princeton WordNet (3.1), and Open Dutch WordNet (1.3).

Installation

pip install LODlit

LODlit is available on PyPI.

  • To parse Princeton WordNet 3.1: After NLTK is installed, download the wordnet31 corpus; Put the content of "wordnet31" to "wordnet" in "nltk_data/corpora" (it is not possible to import wordnet31 from nltk.corpus directly; see explanations on the WordNet website (retrieved on 10.02.2023): "WordNet 3.1 DATABASE FILES ONLY. You can download the WordNet 3.1 database files. Note that this is not a full package as those above, nor does it contain any code for running WordNet. However, you can replace the files in the database directory of your 3.0 local installation with these files and the WordNet interface will run, returning entries from the 3.1 database. This is simply a compressed tar file of the WordNet 3.1 database files"; Use pwn31.check_version() to ensure that WordNet 3.1 is imported;

  • To parse Open Dutch WordNet: Download the ODWN from https://github.com/cultural-ai/OpenDutchWordnet;

License

CC BY-SA 4.0.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

LODlit-0.6.0.tar.gz (2.5 kB view details)

Uploaded Source

Built Distribution

LODlit-0.6.0-py3-none-any.whl (2.7 kB view details)

Uploaded Python 3

File details

Details for the file LODlit-0.6.0.tar.gz.

File metadata

  • Download URL: LODlit-0.6.0.tar.gz
  • Upload date:
  • Size: 2.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.5

File hashes

Hashes for LODlit-0.6.0.tar.gz
Algorithm Hash digest
SHA256 8631bb0f8c551a6da9d2ed9fc390c93e496e21e7ed7ba2390cb913e40d274f81
MD5 a858089f5d632bc23f4adb57dc3cfe18
BLAKE2b-256 e4d201b7a651c8a6eeec0e6cffc4e287275e3fa5722ef0e3c60006d45730edbd

See more details on using hashes here.

File details

Details for the file LODlit-0.6.0-py3-none-any.whl.

File metadata

  • Download URL: LODlit-0.6.0-py3-none-any.whl
  • Upload date:
  • Size: 2.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.5

File hashes

Hashes for LODlit-0.6.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f99c61f81de71e43ffc01b1055d3404f454b9105873536db85a79956b7734cce
MD5 5793c02912cc7ba0e64ae2c2563cc510
BLAKE2b-256 13683626c5fa987202f4c91e4f131d973adae6bd0d74f227bb05d4f6c2791db2

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page