Skip to main content

A comprehensive lexical discovery application that is useful for finding semantic relationships such as, the antonyms, synonyms, hypernyms, hyponyms, homophones and definitions for a specific word.

Project description

Primary Use Case

Textual analysis is a broad term for various research methodologies used to qualitatively describe, interpret and understand text data. These methodologies are mainly used in academic research to analyze content related to media and communication studies, popular culture, sociology, and philosophy. Textual analysis allows these researchers to quickly obtain relevant insights from unstructured data. All types of information can be gleaned from textual data, especially from social media posts or news articles. Some of this information includes the overall concept of the subtext, symbolism within the text, assumptions being made and potential relative value to a subject (e.g. data science). In some cases it is possible to deduce the relative historical and cultural context of a body of text using analysis techniques coupled with knowledge from different disciplines, like linguistics and semiotics.

Word frequency is the technique used in textual analysis to measure the frequency of a specific word or word grouping within unstructured data. Measuring the number of word occurrences in a corpus allows a researcher to garner interesting insights about the text. A subset of word frequency is the correlation between a given word and that word's relationship to either antonyms and synonyms within the specific corpus being analyzed. Knowing these relationships is critical to improving word frequencies and topic modeling.

Wordhoard was designed to assist researchers performing textual analysis to build more comprehensive lists of antonyms, synonyms, hypernyms, hyponyms and homophones.

Installation

Install the distribution via pip:

pip3 install wordhoard

General Package Utilization

Please reference the WordHoard Documentation for package usage guidance and parameters.

Sources

This package is designed to query these online sources for antonyms, synonyms, hypernyms, hyponyms and definitions:

  1. classicthesaurus.com
  2. merriam-webster.com
  3. synonym.com
  4. thesaurus.com
  5. wordhippo.com
  6. wordnet.princeton.edu

Dependencies

This package has these core dependencies:

  1. backoff
  2. BeautifulSoup
  3. deckar01-ratelimit
  4. deepl
  5. lxml
  6. requests
  7. urllib3

Additional details on this package's dependencies can be found here.

Development Roadmap

If you would like to contribute to the Wordhoard project please read the contributing guidelines.

Items currently under development:

  • English language word verification using the Python package pyenchant
  • Expanding the list of hypernyms, hyponyms and homophones
  • Adding part-of-speech filters in queries

Issues

This repository is actively maintained. Feel free to open any issues related to bugs, coding errors, broken links or enhancements.

You can also contact me at John Bumgarner with any issues or enhancement requests.

Sponsorship

If you would like to contribute financially to the development and maintenance of the Wordhoard project please read the sponsorship information.

License

The MIT License (MIT). Please see License File for more information.

Author

Copyright (c) 2020 John Bumgarner

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wordhoard-1.5.1.tar.gz (265.3 kB view details)

Uploaded Source

Built Distribution

wordhoard-1.5.1-py3-none-any.whl (300.0 kB view details)

Uploaded Python 3

File details

Details for the file wordhoard-1.5.1.tar.gz.

File metadata

  • Download URL: wordhoard-1.5.1.tar.gz
  • Upload date:
  • Size: 265.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.9.10

File hashes

Hashes for wordhoard-1.5.1.tar.gz
Algorithm Hash digest
SHA256 3c132b941b641ab868325e0ff921c75c2fdf49a2f18a9fa49399d7f361a0b003
MD5 b93821d05cf40a47501ce72815cfc9c8
BLAKE2b-256 92d7257cbf68eb87dc2c8344575c3bffa78ed8d4e8139f33545bc55443d24bf6

See more details on using hashes here.

File details

Details for the file wordhoard-1.5.1-py3-none-any.whl.

File metadata

  • Download URL: wordhoard-1.5.1-py3-none-any.whl
  • Upload date:
  • Size: 300.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.9.10

File hashes

Hashes for wordhoard-1.5.1-py3-none-any.whl
Algorithm Hash digest
SHA256 7a0d4eda04180f3c2fca53ff9ec6313d2824e253d87401585e8226145f24e854
MD5 54beb0739b229c5cbaf26594e69aa720
BLAKE2b-256 81c0d882dbda51022673263abe31689576b95774a0c44578d0206d135e3e9d23

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page