Library for using the Polish Wordnet in the plwnxml format
Project description
Polish Wordnet Python library
Simple, easy-to-use and reasonably fast library for using the Słowosieć (also known as PlWordNet) - a lexico-semantic database of the Polish language. PlWordNet can also be browsed here.
I created this library, because since version 2.9, PlWordNet cannot be easily loaded into Python (for example with nltk
), as it is only provided in a custom plwnxml
format.
Usage
Load wordnet from an XML file (this will take about 20 seconds), and print basic statistics.
import plwordnet
wn = plwordnet.load('plwordnet_4_2.xml')
print(wn)
Expected output:
PlWordnet
lexical units: 513410
synsets: 353586
relation types: 306
synset relations: 1477849
lexical relations: 393137
Find lexical units with name leśny
and print all relations, where where that unit is in the subject/parent position.
for lu in wn.lemmas('leśny'):
for s, p, o in wn.lexical_relations_where(subject=lu):
print(p.format(s, o))
Expected output:
leśny.2 tworzy kolokację z polana.1
leśny.2 jest synonimem mpar. do las.1
leśny.3 przypomina las.1
leśny.4 jest derywatem od las.1
leśny.5 jest derywatem od las.1
leśny.6 przypomina las.1
Print all relation types and their ids:
for id, rel in wn.relation_types.items():
print(id, rel.name)
Expected output:
10 hiponimia
11 hiperonimia
12 antonimia
13 konwersja
...
Installation
Note: plwordnet
requires at Python 3.7 or newer.
pip install plwordnet
Version support
This library should be able to read future versions of PlWordNet without modification, even if more relation types are added. Still, if you use this library with a version of PlWordNet that is not listed below, please consider contributing information if it is supported.
- PlWordNet 4.2
- PlWordNet 4.0
- PlWordNet 3.2
- PlWordNet 3.0
- PlWordNet 2.3
- PlWordNet 2.2
- PlWordNet 2.1
Documentation
See plwordnet/wordnet.py
for RelationType
, Synset
and LexicalUnit
class definitions.
Package functions
load(source)
: Reads PlWordNet, wheresrc
is a path to the wordnet XML file, or a path to the pickled wordnet object. Passed paths can point to files compressed with gzip or lzma.
Wordnet
instance properties
lexical_relations
: List of (subject, predicate, object) triplessynset_relations
: List of (subject, predicate, object) triplesrelation_types
: Mapping from relation type id to objectlexical_units
: Mapping from lexical unit id to unit objectsynsets
: Mapping from synset id to object(lexical|synset)_relations_(s|o|p)
: Mapping from id of subject/object/predicate to a set of matching lexical unit/synset relation idslexical_units_by_name
: Mapping from lexical unit name to a set of matching lexical unit ids
Wordnet
methods
lemmas(value)
: Returns a list ofLexicalUnit
, where the name is equal tovalue
lexical_relations_where(subject, predicate, object)
: Returns lexical relation triples, with matching subject or/and predicate or/and object. Subject, predicate and object arguments can be integer ids orLexicalUnit
andRelationType
objects.synset_relations_where(subject, predicate, object)
: Returns synset relation triples, with matching subject or/and predicate or/and object. Subject, predicate and object arguments can be integer ids orSynset
andRelationType
objects.dump(dst)
: Pickles theWordnet
object to opened filedst
or to a new file with pathdst
.
RelationType
methods
format(x, y, short=False)
: Substitutesx
andy
into theRelationType
display formatdisplay
. Ifshort
,x
andy
are separated by the short relation nameshortcut
.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file plwordnet-0.1.3.tar.gz
.
File metadata
- Download URL: plwordnet-0.1.3.tar.gz
- Upload date:
- Size: 5.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.58.0 CPython/3.9.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2667da90357d67d3d35bc831482e98966030ad9f3605d106581c9ad06ef7443b |
|
MD5 | 721eb4c099e975fcfad2402905490711 |
|
BLAKE2b-256 | 4a0611743f53b82a67a30f3f5eb0b3805e365c1ca66fecfc46322a16a4d50c91 |
File details
Details for the file plwordnet-0.1.3-py2.py3-none-any.whl
.
File metadata
- Download URL: plwordnet-0.1.3-py2.py3-none-any.whl
- Upload date:
- Size: 5.7 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.58.0 CPython/3.9.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4c93b768fe70853ee1aedb0e93e50299d0531c1dbb5bbcdc24882b11604a1d44 |
|
MD5 | cf3fbd60c2dab782670b86f816a345b1 |
|
BLAKE2b-256 | 133fc15d153234269246a1ffcf32926dbcac07a315af347c6fce06f8ed94ee00 |