Skip to main content

Snowball stemming algorithms, for information retrieval

Project description

Stemming algorithms

PyStemmer provides access to efficient algorithms for calculating a “stemmed” form of a word. This is a form with most of the common morphological endings removed; hopefully representing a common linguistic base form. This is most useful in building search engines and information retrieval software; for example, a search with stemming enabled should be able to find a document containing “cycling” given the query “cycles”.

PyStemmer provides algorithms for several (mainly european) languages, by wrapping the libstemmer library from the Snowball project in a Python module.

It also provides access to the classic Porter stemming algorithm for english: although this has been superseded by an improved algorithm, the original algorithm may be of interest to information retrieval researchers wishing to reproduce results of earlier experiments.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

PyStemmer-2.0.1.tar.gz (559.3 kB view details)

Uploaded Source

File details

Details for the file PyStemmer-2.0.1.tar.gz.

File metadata

  • Download URL: PyStemmer-2.0.1.tar.gz
  • Upload date:
  • Size: 559.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.8.3

File hashes

Hashes for PyStemmer-2.0.1.tar.gz
Algorithm Hash digest
SHA256 9b81c35302f1d2a5ad9465b85986db246990db93d97d3e8f129269ed7102788e
MD5 b9b2296b2eec30b06c62879484328e87
BLAKE2b-256 55b2c3aeebfe4a60256ddb72257e750a94c26c3085f017b7e58c860d5aa91432

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page