Skip to main content
Donate to the Python Software Foundation or Purchase a PyCharm License to Benefit the PSF! Donate Now

A python wrapper for the IMS Word Sense Disambiguation tool (Zhong and Ng, 2010)

Project description

PyIMS

A simple python wrapper around the IMS (It Makes Sense) word-sense disambiguation toolkit, that integrates it with NLTK's WordNet interface. Requires IMS to be downloaded separately, which requires a working Java installation. Also requires NLTK with WordNet downloaded. Only tested in Linux, might not work in other OSes. Built for python 3.6+

Installation

First, follow the instructions in Section 3 of the IMS Readme to install the components of IMS into some directory; extract the models from here (step 3.e) into the same directory as in steps 3.b and 3.c. Then, run:

pip install pyims

Usage

from pyims import PyIMS

wsd = PyIMS("path/to/ims", "modelsDirName")
print(wsd.disambiguate("I am interested in the interest rates at the bank.", probs=True, synsets=False))

# If probs=True, returns a list of (token, probability_distribution) tuples where probability_distribution is a map of lemma to its probability
# If probs=False (default False), returns a list of (token, lemma) where lemma is the most probable word-sense in WordNet for the given token
# If synsets=True, lemmas are replaced with the Synsets to which they belong
# If synsets=False (default False), to access a lemma's synset, call lemma.synset()

References

  • Zhong, Zhi and Ng, Hwee Tou. 2010. It Makes Sense: A Wide-Coverage Word Sense Disambiguation System for Free Text. In Proceedings of the ACL 2010 System Demonstrations, pages 78--83, Uppsala, Sweden

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
pyims-0.1.2-py3-none-any.whl (3.3 kB) Copy SHA256 hash SHA256 Wheel py3
pyims-0.1.2.tar.gz (3.0 kB) Copy SHA256 hash SHA256 Source None

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page